llvm-project

Commit Graph

Author	SHA1	Message	Date
Yaxun (Sam) Liu	092f15ac40	[HIP] File device library ABI version file name It should be oclc_abi_version* instead of abi_version*. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D120557	2022-02-28 16:24:50 -05:00
Todd Mortimer	bcbb03754e	[Driver][OpenBSD] Enable unwind tables on all architectures	2022-02-27 19:43:49 -05:00
Andrzej Warzynski	2e9439e489	[flang][driver] Add support for `--target`/`--triple` This patch adds support for: * `--target` in the compiler driver (`flang-new`) * `--triple` in the frontend driver (`flang-new -fc1`) The semantics of these flags are inherited from `clangDriver`, i.e. consistent with `clang --target` and `clang -cc1 --triple`, respectively. A new structure is defined, `TargetOptions`, that will hold various Frontend options related to the target. Currently, this is mostly a placeholder that contains the target triple. In the future, it will be used for storing e.g. the CPU to tune for or the target features to enable. Additionally, the following target/triple related options are enabled []: `-print-effective-triple`, `-print-target-triple`. Definitions in Options.td are updated accordingly and, to facilated testing, `-emit-llvm` is added to the list of options available in `flang-new` (previously it was only enabled in `flang-new -fc1`). [] These options were actually available before (like all other options defined in `clangDriver`), but not included in `flang-new --help`. Before this change, `flang-new` would just use `native` for defining the target, so these options were of little value. Differential Revision: https://reviews.llvm.org/D120246	2022-02-25 09:38:10 +00:00
Raúl Peñacoba	ca80c24386	[Driver] Support GCC detection for GCC compiled with --enable-version-specific-runtime-libs GCC's compiled with --enable-version-specific-runtime-libs change the paths where includes and libs are found. This patch adds support for these cases Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D118700	2022-02-25 04:41:03 +00:00
Yaxun (Sam) Liu	9d899d8f01	[HIP] Support `-fgpu-default-stream` Introduce -fgpu-default-stream={legacy\|per-thread} option to support per-thread default stream for HIP runtime. When -fgpu-default-stream=per-thread, HIP kernels are launched through hipLaunchKernel_spt instead of hipLaunchKernel. Also HIP_API_PER_THREAD_DEFAULT_STREAM=1 is defined by the preprocessor to enable other per-thread stream API's. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D120298	2022-02-23 22:28:29 -05:00
Zahira Ammarguellat	1592d88aa7	Add support for floating-point option `ffp-eval-method` and for `pragma clang fp eval_method`. Differential Revision: https://reviews.llvm.org/D109239	2022-02-23 15:00:18 -08:00
Joseph Huber	2b97b16f29	[OpenMP] Add option to make offloading mandatory Currently when we generate OpenMP offloading code we always make fallback code for the CPU. This is necessary for implementing features like conditional offloading and ensuring that unhandled pragmas don't result in missing symbols. However, this is problematic for a few cases. For offloading tests we can silently fail to the host without realizing that offloading failed. Additionally, this makes it impossible to provide interoperabiility to other offloading schemes like HIP or CUDA because those methods do not provide any such host fallback guaruntee. this patch adds the `-fopenmp-offload-mandatory` flag to prevent generating the fallback symbol on the CPU and instead replaces the function with a dummy global and the failed branch with 'unreachable'. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D120353	2022-02-23 16:45:36 -05:00
Fangrui Song	e87c32e390	[Driver] Add -fno-sanitize-address-globals-dead-stripping It's customary for these options to have the -fno- form which is sometimes handy to work around issues. Using the supported driver option is preferred over the internal cl::opt option `-mllvm -asan-globals-live-support=0` Reviewed By: kstoimenov, vitalybuka Differential Revision: https://reviews.llvm.org/D120391	2022-02-23 11:51:30 -08:00
Timm Bäder	2f300d34de	[clang][driver][wasm] Fix libstdc++ target-dependent include dir The triple goes after the gcc version, not before. Also add the /backward version. Differential Revision: https://reviews.llvm.org/D120251	2022-02-23 14:38:34 +01:00
Rainer Orth	b1fc966d2e	[Driver] Support Solaris/amd64 GetTls This is the driver part of D91605 <https://reviews.llvm.org/D91605>, a workaround to allow direct calls to `__tls_get_addr` on Solaris/amd64. Tested on `amd64-pc-solaris2.11` and `sparcv9-sun-solaris2.11`. Differential Revision: https://reviews.llvm.org/D119829	2022-02-22 20:14:33 +01:00
tyb0807	650aec687e	[ARM][AArch64] Add missing v8.x checks Summary: This patch adds checks that were missing in clang for Armv8.5/6/7-A. These include: * ACLE macro defines for AArch32. * Handling of crypto and SM4, SHA and AES feature flags on clang's driver. Reviewers: dmgreen, SjoerdMeijer, tmatheson Differential Revision: https://reviews.llvm.org/D116153	2022-02-22 09:07:59 +00:00
Brad Smith	95fed2b267	[Driver][OpenBSD] Pass sysroot to the linker	2022-02-21 23:11:13 -05:00
Kito Cheng	c1f17b0a9e	[RISCV] Fix the include search path order between sysroot and resource folder (Recommit again) Resource folder[1] should include before sysroot[2] in general (Linux clang toolchain, BareMetal clang toolchain, and GCC using that order), and that prevent sysroot's header file override resource folder's one, this change is reference from BareMetal::AddClangSystemIncludeArgs@BareMetal.cpp[3]. And also fix the behavior of `-nobuiltininc`. [1] Include path from resource folder is something like this: `<toolchain-path>/lib/clang/13.0.0/include/` [2] Include path from sysroot is something like this: `<toolchain-path>/riscv32-unknown-elf/include` [3] https://github.com/llvm/llvm-project/blob/llvmorg-13.0.1/clang/lib/Driver/ToolChains/BareMetal.cpp#L193 Reviewed By: asb Differential Revision: https://reviews.llvm.org/D119837 The recommit fixes the Windows build failure due to path issue.	2022-02-21 15:25:21 +08:00
Kito Cheng	cc279529e8	Revert "[RISCV] Fix the include search path order between sysroot and resource folder (Recommit)" This reverts commit `47b1fa5fc4`.	2022-02-21 14:56:58 +08:00
Kito Cheng	47b1fa5fc4	[RISCV] Fix the include search path order between sysroot and resource folder (Recommit) Resource folder[1] should include before sysroot[2] in general (Linux clang toolchain, BareMetal clang toolchain, and GCC using that order), and that prevent sysroot's header file override resource folder's one, this change is reference from BareMetal::AddClangSystemIncludeArgs@BareMetal.cpp[3]. And also fix the behavior of `-nobuiltininc`. [1] Include path from resource folder is something like this: `<toolchain-path>/lib/clang/13.0.0/include/` [2] Include path from sysroot is something like this: `<toolchain-path>/riscv32-unknown-elf/include` [3] https://github.com/llvm/llvm-project/blob/llvmorg-13.0.1/clang/lib/Driver/ToolChains/BareMetal.cpp#L193 Reviewed By: asb Differential Revision: https://reviews.llvm.org/D119837 The recommit fixes the Windows build failure due to path issue.	2022-02-21 14:39:43 +08:00
Kito Cheng	0a17ee1ebe	Revert "[RISCV] Fix the include search path order between sysroot and resource folder" This reverts commit `079d13668b`.	2022-02-21 14:25:49 +08:00
Kito Cheng	079d13668b	[RISCV] Fix the include search path order between sysroot and resource folder Resource folder[1] should include before sysroot[2] in general (Linux clang toolchain, BareMetal clang toolchain, and GCC using that order), and that prevent sysroot's header file override resource folder's one, this change is reference from BareMetal::AddClangSystemIncludeArgs@BareMetal.cpp[3]. And also fix the behavior of `-nobuiltininc`. [1] Include path from resource folder is something like this: `<toolchain-path>/lib/clang/13.0.0/include/` [2] Include path from sysroot is something like this: `<toolchain-path>/riscv32-unknown-elf/include` [3] https://github.com/llvm/llvm-project/blob/llvmorg-13.0.1/clang/lib/Driver/ToolChains/BareMetal.cpp#L193 Reviewed By: asb Differential Revision: https://reviews.llvm.org/D119837	2022-02-21 14:06:47 +08:00
Yaxun (Sam) Liu	fa0f90bc55	[HIP] Support linking archive of bundled bitcode HIP programs compiled with -c -fgpu-rdc generate clang-offload-bundler bundles which contain bitcode for different GPU's. Such files can be archived to an archive file which can be linked with HIP programs with -fgpu-rdc. This patch adds suppor of linking archive of bundled bitcode. When an archive of bundled bitcode is passed to clang by -l, for each GPU specified through --offload-arch, clang extracts bitcode from the archive and creates a new archive for that GPU and pass it to lld. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D120070 Fixes: SWDEV-321741, SWDEV-315773	2022-02-19 18:37:44 -05:00
Joseph Huber	0870a4f59a	[OpenMP] Add flag for disabling thread state in runtime The runtime uses thread state values to indicate when we use an ICV or are in nested parallelism. This is done for OpenMP correctness, but it not needed in the majority of cases. The new flag added is `-fopenmp-assume-no-thread-state`. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D120106	2022-02-18 08:35:05 -05:00
Florian Hahn	09193f20a1	Revert "Add support for floating-point option `ffp-eval-method` and for" This reverts commit `32b73bc6ab`. This breaks builds on macOS in some configurations, because __FLT_EVAL_METHOD__ is set to an unexpected value. E.g. https://green.lab.llvm.org/green/job/clang-stage1-RA/28282/consoleFull#129538464349ba4694-19c4-4d7e-bec5-911270d8a58c More details available in the review thread https://reviews.llvm.org/D109239	2022-02-18 11:04:00 +00:00
Nico Weber	383ed82dd1	[clang] Pass more flags to ld64.lld * ld64.lld now completely supports -export_dynamic (D119372), so map -rdynamic to -export_dynamic like already done for ld64 * ld64.lld has been supporting -object_path_lto for well over a year (D92537), so pass it like already done for ld64 Differential Revision: https://reviews.llvm.org/D119612	2022-02-17 16:45:52 -05:00
Alex Brachet	5364b36868	Revert "[Driver][Fuchsia][NFC] Use GetLinkerPath to see if linker is lld" This reverts commit `b9f4dff8ab`.	2022-02-17 18:41:49 +00:00
Alex Brachet	b9f4dff8ab	[Driver][Fuchsia][NFC] Use GetLinkerPath to see if linker is lld Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D120074	2022-02-17 18:20:23 +00:00
Zahira Ammarguellat	32b73bc6ab	Add support for floating-point option `ffp-eval-method` and for `pragma clang fp eval_method`. https://reviews.llvm.org/D109239	2022-02-17 08:59:21 -08:00
Joseph Huber	64ecdc1cb1	[OpenMP] Pass AMDGPU math libraries into the linker wrapper This patch passes in the AMDPGU math libraries to the linker wrapper. The wrapper already handles linking OpenMP bitcode libraries via the `--target-library` option. This should be sufficient to link in math libraries for the accompanying architecture. Fixes #53526. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D119841	2022-02-16 16:40:40 -05:00
Joseph Huber	5781839f7a	Revert "[OpenMP] Pass AMDGPU math libraries into the linker wrapper" This hits an assertion in the linker wrapper. Revert for now, will fix later. This reverts commit `61fb260d9d`.	2022-02-16 11:54:44 -05:00
Joseph Huber	61fb260d9d	[OpenMP] Pass AMDGPU math libraries into the linker wrapper This patch passes in the AMDPGU math libraries to the linker wrapper. The wrapper already handles linking OpenMP bitcode libraries via the `--target-library` option. This should be sufficient to link in math libraries for the accompanying architecture. Fixes #53526. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D119841	2022-02-16 11:39:11 -05:00
Adrian Prantl	0604d86c07	Darwin: introduce a global override for debug prefix map entries. This patch adds a new Darwin clang driver environment variable in the spirit of RC_DEBUG_OPTIONS, called RC_DEBUG_PREFIX_MAP, which allows a meta build tool to add one additional -fdebug-prefix-map entry without the knowledge of the build system. rdar://85224675 Differential Revision: https://reviews.llvm.org/D119850	2022-02-16 08:36:26 -08:00
Peter Kasting	c5fb05f663	Reland: Make lld-link work in a non-MSVC shell, add /winsysroot: This relands `73e585e44d` (and `0574b5fc65`), with a fix for the failing test (by using Optional<StringRef>s instead of making StringRef::empty() mean absence of value). Differential Revision: https://reviews.llvm.org/D118070	2022-02-16 09:22:39 -05:00
Nico Weber	125abb61f7	Revert "Add support for floating-point option `ffp-eval-method` and for" This reverts commit `4bafe65c2b`. Breaks at least Misc/warning-flags.c, see comments on https://reviews.llvm.org/D109239	2022-02-15 22:02:25 -05:00
Zahira Ammarguellat	4bafe65c2b	Add support for floating-point option `ffp-eval-method` and for `pragma clang fp eval_method`.	2022-02-15 13:59:27 -08:00
Joseph Huber	24ecafb413	[OpenMP] Add support for CPU offloading in new driver This patch adds support for linking CPU offloading applications in the linker wrapper. We generate the necessary linking job using the host linker's path and library arguments. This may not be true for more complex offloading schemes, but this is sufficient for now. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D119613	2022-02-15 15:05:30 -05:00
Brad Smith	d241ce0f97	[Driver][NetBSD] -r: imply -nostdlib like GCC Similar to D116843 for Gnu.cpp Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D119655	2022-02-14 23:29:13 -05:00
Brad Smith	cbd9d136ef	[Driver][DragonFly] -r: imply -nostdlib like GCC Similar to D116843 for Gnu.cpp Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D119656	2022-02-14 23:24:26 -05:00
Alex Lorenz	d238acd113	[clang][driver] add clang driver support for emitting macho files with two build version load commands This patch extends clang driver to pass the right flags to the clang frontend, and ld64, so that they can emit macho files with two build version load commands. It adds a new 0darwin-target-variant option which complements -target and also can be used to specify different target variants when multi-arch compilations are invoked with multiple -arch commands. Differential Revision: https://reviews.llvm.org/D118862	2022-02-14 12:27:14 -08:00
Joseph Huber	a0e8077d28	[OpenMP][NFC] Simplify identifying the device bitcode library Now that the old device runtime has been deleted there is only a single target that differs by the triple and the architecture. Simplify the scheme for identifying the library but directly using the triple. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D119638	2022-02-12 14:55:47 -05:00
Douglas Yung	437d4e01fe	Revert "try to fix windows build after 73e585e44d" and Revert "Reland "[lld/coff] Make lld-link work in a non-MSVC shell, add /winsysroot:"" This reverts commit `0574b5fc65` and `73e585e44d`. This change is causing the test Driver/cl-options.c to fail on Windows buildbots. https://lab.llvm.org/staging/#/builders/204/builds/1343	2022-02-11 23:47:53 -08:00
Nico Weber	73e585e44d	Reland "[lld/coff] Make lld-link work in a non-MSVC shell, add /winsysroot:" This relands commit `b3b2538df1`, except that the new files in Support are instead in a new library WindowsDriver.	2022-02-11 17:07:33 -05:00
Adrian Prantl	baac665adf	Revert "[lld/coff] Make lld-link work in a non-MSVC shell, add /winsysroot:" This reverts commit `b3b2538df1`, it introduced a cycklic module depenency that broke the -DLLVM_ENABLE_MODULES=1 build.	2022-02-11 13:07:23 -08:00
Peter Kasting	b3b2538df1	[lld/coff] Make lld-link work in a non-MSVC shell, add /winsysroot: Makes lld-link work in a non-MSVC shell by autodetecting MSVC toolchain. Also adds support for /winsysroot and a few other switches. All this is done by refactoring to share code with clang-cl's existing support for the same. Differential Revision: https://reviews.llvm.org/D118070	2022-02-11 13:55:18 -05:00
Yuanfang Chen	f927021410	Reland "[clang-cl] Support the /JMC flag" This relands commit `b380a31de0`. Restrict the tests to Windows only since the flag symbol hash depends on system-dependent path normalization.	2022-02-10 15:16:17 -08:00
Yuanfang Chen	b380a31de0	Revert "[clang-cl] Support the /JMC flag" This reverts commit `bd3a1de683`. Break bots: https://luci-milo.appspot.com/ui/p/fuchsia/builders/toolchain.ci/clang-windows-x64/b8822587673277278177/overview	2022-02-10 14:17:37 -08:00
Yuanfang Chen	bd3a1de683	[clang-cl] Support the /JMC flag The introduction and some examples are on this page: https://devblogs.microsoft.com/cppblog/announcing-jmc-stepping-in-visual-studio/ The `/JMC` flag enables these instrumentations: - Insert at the beginning of every function immediately after the prologue with a call to `void __fastcall __CheckForDebuggerJustMyCode(unsigned char *JMC_flag)`. The argument for `__CheckForDebuggerJustMyCode` is the address of a boolean global variable (the global variable is initialized to 1) with the name convention `__<hash>_<filename>`. All such global variables are placed in the `.msvcjmc` section. - The `<hash>` part of `__<hash>_<filename>` has a one-to-one mapping with a directory path. MSVC uses some unknown hashing function. Here I used DJB. - Add a dummy/empty COMDAT function `__JustMyCode_Default`. - Add `/alternatename:__CheckForDebuggerJustMyCode=__JustMyCode_Default` link option via ".drectve" section. This is to prevent failure in case `__CheckForDebuggerJustMyCode` is not provided during linking. Implementation: All the instrumentations are implemented in an IR codegen pass. The pass is placed immediately before CodeGenPrepare pass. This is to not interfere with mid-end optimizations and make the instrumentation target-independent (I'm still working on an ELF port in a separate patch). Reviewed By: hans Differential Revision: https://reviews.llvm.org/D118428	2022-02-10 10:26:30 -08:00
Yuanfang Chen	b96106af3f	[AArch64][ARM] add -Wunaligned-access only for clang Reviewed By: lenary Differential Revision: https://reviews.llvm.org/D119301	2022-02-10 10:26:30 -08:00
Rainer Orth	a6afa9e6b0	[Driver] Use libatomic for 32-bit SPARC atomics support Even after D86621 <https://reviews.llvm.org/D86621>, `clang -m32` on Solaris/sparcv9 doesn't inline atomics with 8-byte operands, unlike `gcc`. This leads to many link failures in the testsuite (undefined references to `__atomic_load_8` and `__sync_val_compare_and_swap_8`. Until a proper codegen fix can be implemented, this patch works around the first of those by linking with `-latomic`. Tested on `sparcv9-sun-solaris2.11`. Differential Revision: https://reviews.llvm.org/D118021	2022-02-10 12:40:32 +01:00
Martin Storsjö	6cf64b2d28	[clang] [MinGW] Default to DWARF 4 Neither LLDB nor GDB seem to work with DWARF 5 debug information on Windows right now. This applies the same change as in `9c62728610` (Default to DWARFv4 on Windows) to the MinGW driver too. Differential Revision: https://reviews.llvm.org/D119326	2022-02-10 10:59:05 +02:00
Muhammad Omair Javaid	cd817231ec	[clang-cl] Bump default -fms-compatibility-version to 19.14 clang-cl MSVC required version is 19.20 now. Update the default -fms-compatibility-version to 19.14. Differential Revision: https://reviews.llvm.org/D114639	2022-02-09 13:54:25 +05:00
Yaxun (Sam) Liu	1d97cb1f6e	[HIP] Emit amdgpu_code_object_version module flag code object version determines ABI, therefore should not be mixed. This patch emits amdgpu_code_object_version module flag in LLVM IR based on code object version (default 4). The amdgpu_code_object_version value is code object version times 100. LLVM IR with different amdgpu_code_object_version module flag cannot be linked. The -cc1 option -mcode-object-version=none is for ROCm device library use only, which supports multiple ABI. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D119026	2022-02-08 21:58:40 -05:00
Zakk Chen	cfe7f69036	[RISCV][NFC] Refactor RISCVISAInfo. 1. Remove computeDefaultABIFromArch and add computeDefaultABI in RISCVISAInfo. 2. Add parseFeatureBits which may used in D118333. Differential Revision: https://reviews.llvm.org/D119250	2022-02-08 18:37:43 -08:00
Bill Wendling	deaf22bc0e	[X86] Implement -fzero-call-used-regs option The "-fzero-call-used-regs" option tells the compiler to zero out certain registers before the function returns. It's also available as a function attribute: zero_call_used_regs. The two upper categories are: - "used": Zero out used registers. - "all": Zero out all registers, whether used or not. The individual options are: - "skip": Don't zero out any registers. This is the default. - "used": Zero out all used registers. - "used-arg": Zero out used registers that are used for arguments. - "used-gpr": Zero out used registers that are GPRs. - "used-gpr-arg": Zero out used GPRs that are used as arguments. - "all": Zero out all registers. - "all-arg": Zero out all registers used for arguments. - "all-gpr": Zero out all GPRs. - "all-gpr-arg": Zero out all GPRs used for arguments. This is used to help mitigate Return-Oriented Programming exploits. Reviewed By: nickdesaulniers Differential Revision: https://reviews.llvm.org/D110869	2022-02-08 17:42:54 -08:00
Ahmed Bougacha	6ba68a5fc3	[clang][Driver] Use a VersionTuple for darwin linker version checks. This unifies a couple spots that did it manually by checking the flag directly. It does mean that we're now dropping the 5th component, but that's not used in any of these checks, and to my knowledge it's never been used in ld64.	2022-02-08 14:30:39 -08:00
Martin Storsjö	079b6d02d1	[clang] [MinGW] Recognize -lcrtdll as a library replacing -lmsvcrt Differential Revision: https://reviews.llvm.org/D119234	2022-02-08 21:57:07 +02:00
Amilendra Kodithuwakku	424e850f1e	[clang][ARM] Re-word PACBTI warning. The original warning added in D115501 when pacbti is used with an incompatible architecture was not exactly correct because it was not really ignored and can affect codegen. Therefore reword to say that the pacbti option is incompatible with the given architecture. Reviewed By: chill Differential Revision: https://reviews.llvm.org/D119166	2022-02-08 19:13:02 +00:00
Leonard Chan	4ac58b6102	[clang][Fuchsia] Ensure static sanitizer libs are only linked in after the -nostdlib check Differential Revision: https://reviews.llvm.org/D119201	2022-02-08 10:53:22 -08:00
Krzysztof Parzyszek	2ecda9ec9c	[Hexagon] Alter meaning of versionless -mhvx The documentation for the official (downstream) Qualcomm Hexagon Clang states that -mhvx sets the HVX version to be the same as the CPU version. The current implementation upstream would use the most recent versioned -mhvx= flag first (if present), then the CPU version. Change the upstream behavior to match the documented behavior of the downstream compiler.	2022-02-08 09:06:15 -08:00
Alex Lorenz	b58bf76f97	[clang][driver] update the darwin driver to point to correct macho_embedded path Compiler-rt started emitting the macho_embedded libraries in `<resource_dir>/lib/darwin/macho_embedded` after https://reviews.llvm.org/D105765 / `1e03c37b97`, so update the clang's driver to reflect that. Differential Revision: https://reviews.llvm.org/D115403	2022-02-07 16:50:58 -08:00
Mark Murray	3d7662142d	[ARM] Undeprecate complex IT blocks AArch32/Armv8A introduced the performance deprecation of certain patterns of IT instructions. After some debate internal to ARM, this is now being reverted; i.e. no IT instruction patterns are performance deprecated anymore, as the perfomance degredation is not significant enough. This reverts the following: "ARMv8-A deprecates some uses of the T32 IT instruction. All uses of IT that apply to instructions other than a single subsequent 16-bit instruction from a restricted set are deprecated, as are explicit references to the PC within that single 16-bit instruction. This permits the non-deprecated forms of IT and subsequent instructions to be treated as a single 32-bit conditional instruction." The deprecation no longer applies, but the behaviour may be controlled by the -arm-restrict-it and -arm-no-restrict-it command-line options, with the latter being the default. No warnings about complex IT blocks will be generated. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D118044	2022-02-07 15:47:53 +00:00
Brad Smith	1831cbd9d4	[Driver][OpenBSD] -r: imply -nostdlib like GCC Similar to D116843 for Gnu.cpp Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D119071	2022-02-07 04:07:30 -05:00
Kazu Hirata	631b94cc22	[Driver] Remove redundant string initialization (NFC) Identified with readability-redundant-string-init.	2022-02-06 10:54:42 -08:00
Alex Xu (Hello71)	38449c98f3	[Driver] Default to -fno-math-errno for musl musl does not set errno in math functions: https://wiki.musl-libc.org/mathematical-library.html, https://git.musl-libc.org/cgit/musl/tree/include/math.h?id=cfdfd5ea3ce14c6abf7fb22a531f3d99518b5a1b#n26. Reviewed By: srhines, MaskRay Differential Revision: https://reviews.llvm.org/D116753	2022-02-04 19:20:30 -08:00
Joseph Huber	280716e75f	[OpenMP] Change amdgcn to amdgpu in device library handling Summary: The name of the AMDGPU device library was changes. Previously it was called 'libomptarget-amdgcn'. This patch changes fixes the tests to use the new name of the library and adds a new flag with the same name.	2022-02-04 20:51:05 -05:00
Joseph Huber	5966c2ec02	[OpenMP] Fix mismatched device runtime name Summary: The new runtime was deleted. AMD's old runtime used the triple name `amdgcn` while the new runtime used `amdgpu`. This was not updated when the old runtime was removed causing the library to not be found on AMDGPU.	2022-02-04 16:54:31 -05:00
Joseph Huber	034adaf5be	[OpenMP] Completely remove old device runtime This patch completely removes the old OpenMP device runtime. Previously, the old runtime had the prefix `libomptarget-new-` and the old runtime was simply called `libomptarget-`. This patch makes the formerly new runtime the only runtime available. The entire project has been deleted, and all references to the `libomptarget-new` runtime has been replaced with `libomptarget-`. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D118934	2022-02-04 15:31:33 -05:00
Joseph Huber	eeb29c8477	[OpenMP] Add -Bsymbolic to arguments for GNU linker This patch adds the '-Bsymbolic' flag when we perform linking for the offloading device. We already pass '-fvisibility=protected' but this is not properly handled when using the bfd linker as is described in https://maskray.me/blog/2021-05-16-elf-interposition-and-bsymbolic. Previously this caused linker errors when creating the shared library. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D119018	2022-02-04 15:13:32 -05:00
Fangrui Song	679c77ede3	[Driver][Android] Removed obsoleted --warn-shared-textrel --warn-shared-textrel is ignored in ld.lld and obsoleted in GNU ld (https://sourceware.org/bugzilla/show_bug.cgi?id=22909). Note: binutils can be configured with --enable-textrel-check=[yes\|error] to make GNU ld error for text relocations by default, like ld.lld. Reviewed By: srhines Differential Revision: https://reviews.llvm.org/D118942	2022-02-04 09:30:11 -08:00
Yaxun (Sam) Liu	d4e4ef2e81	[HIP] Support code object v5 New device library supporting v4 and v5 has abi_version_400.bc and abi version_500.bc. For v5, abi_version_500.bc is linked. For v2-4, abi_version_400.bc is linked. For old device library, for v2-4, none of the above is linked. For v5, error is emitted about unsupported ABI version. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D118949 Fixes: SWDEV-321313	2022-02-04 09:55:08 -05:00
Joseph Huber	8cc4ca95b0	[OpenMP] Add Cuda path to linker wrapper tool The linker wrapper tool uses the 'nvlink' and 'ptxas' binaries to link and assemble device files. Previously we searched for this using the binaries in the user's path. This didn't work in cases where the user passed in a specific Cuda path to Clang. This patch changes the linker wrapper to accept an argument for the Cuda path we can get from Clang. This should fix #53573. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D118944	2022-02-03 20:39:18 -05:00
Joseph Huber	da20df2115	Revert "[OpenMP] Don't use bound architecture when checking cache on the host" This reverts commit `9138d96f8b`.	2022-02-03 17:43:10 -05:00
Joseph Huber	9138d96f8b	[OpenMP] Don't use bound architecture when checking cache on the host When we are creating jobs for the new driver we first check the cache to see if the job was already created as a part of the offloading toolchain. This would sometimes fail if the bound architecture was set for the host during offloading. We want to ingore this because it is not relevant for looking up host actions. Previously it was set on some machines and would cause the cache lookup to fail. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D118858	2022-02-03 17:17:38 -05:00
Matt Morehouse	95d609b549	[HWASan] Add __hwasan_init to .preinit_array. Fixes segfaults on x86_64 caused by instrumented code running before shadow is set up. Reviewed By: pcc Differential Revision: https://reviews.llvm.org/D118171	2022-02-03 13:07:58 -08:00
Timm Bäder	2dd35e98d3	[clang][driver][wasm] Remove unneeded default labels Fix build fallout from `b5787a0c6c`	2022-02-03 16:52:41 +01:00
Timm Bäder	b5787a0c6c	[clang][driver][wasm] Support -stdlib=libstdc++ for WebAssembly The WebAssembly toolchain currently supports only -stdlib=libc++ and implicitly assumes the c++ stdlib to be libc++. Change this to also support libstdc++. Differential Revision: https://reviews.llvm.org/D117888#3290628	2022-02-03 16:37:52 +01:00
Kirill Stoimenov	d7dd7ad827	Revert "[ASan] Not linking asan_static library for DSO." This reverts commit `cf730d8ce1`. It turned out that D118184 is causing segfaults in some situations. Reviewed By: vitalybuka, kda Differential Revision: https://reviews.llvm.org/D118739	2022-02-01 23:58:04 +00:00
Joseph Huber	9375f1563e	[OpenMP] Cleanup the Linker Wrapper Summary: Various changes and cleanup for the Linker Wrapper tool.	2022-01-31 23:11:42 -05:00
Joseph Huber	bf499c58af	[OpenMP] Implement save temps functionality in linker wrapper Summary: This patch implements the `-save-temps` flag for the linker wrapper. This allows the user to inspect the intermeditary outpout that the linker wrapper creates.	2022-01-31 23:11:42 -05:00
Joseph Huber	cb7cfaec71	[OpenMP] Add extra flag handling to linker wrapper This patch adds support for a few extra flags in the linker wrapper, such as debugging flags, verbose output, and passing arguments to ptxas. We also now forward pass remarks to the LLVM backend so they will show up in the LTO passes. Depends on D117049 Differential Revision: https://reviews.llvm.org/D117156	2022-01-31 23:11:41 -05:00
Joseph Huber	f28c3153ee	[OpenMP] Add support for embedding bitcode images in wrapper tool Summary; This patch adds support for embedding device images in the linker wrapper tool. This will be used for performing JIT functionality in the future. Depends on D117048 Differential Revision: https://reviews.llvm.org/D117049	2022-01-31 23:11:41 -05:00
Joseph Huber	3762111aa9	[OpenMP] Link the bitcode library late for device LTO Summary: This patch adds support for linking the OpenMP device bitcode library late when doing LTO. This simply passes it in as an additional device file when doing the final device linking phase with LTO. This has the advantage that we don't link it multiple times, and the device references do not get inlined and prevent us from doing needed OpenMP optimizations when we have visiblity of the whole module. Fix some failings where the implicit conversion of an Error to an Expected triggered the deleted copy constructor. Depends on D116675 Differential revision: https://reviews.llvm.org/D117048	2022-01-31 23:11:41 -05:00
Joseph Huber	c732c3df74	[OpenMP] Initial Implementation of LTO and bitcode linking in linker wrapper This patch implements the fist support for handling LTO in the offloading pipeline. The flag `-foffload-lto` is used to control if bitcode is embedded into the device. If bitcode is found in the device, the extracted files will be sent to the LTO pipeline to be linked and sent to the backend. This implementation does not separately link the device bitcode libraries yet. Depends on D116675 Differential Revision: https://reviews.llvm.org/D116975	2022-01-31 23:11:41 -05:00
Joseph Huber	95c8f74640	[Clang] Introduce Clang Linker Wrapper Tool This patch introduces a linker wrapper tool that allows us to preprocess files before they are sent to the linker. This adds a dummy action and job to the driver stage that builds the linker command as usual and then replaces the command line with the wrapper tool. Depends on D116543 Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D116544	2022-01-31 15:56:04 -05:00
Joseph Huber	12ae095bbb	[OpenMP] Embed device files into the host IR This patch adds support for embedding the device object files into the host IR to create a fat binary. Each offloading file will be inserted into a section with the following naming format `.llvm.offloading.<triple>.<arch>.<filename>`. Depends on D116542 Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D116543	2022-01-31 15:56:02 -05:00
Joseph Huber	2f9ace9e9a	[OpenMP] Introduce new flag to change offloading driver pipeline This patch introduces the `-fopenmp-new-driver` option which instructs the compiler to use a new driver scheme for producing offloading code. In this scheme we create a complete offloading object file and then pass it as input to the host compilation phase. This will allow us to embed the object code in the backend phase. This is the start of a series of commits to rework the OpenMP offloading driver pipeline. The goal of this is to simplify the steps required for creating an offloading program. This patch changes the driver's configuration to simply pass the device file back to the host as an input so it can be embedded as an LLVM IR global during the backend, then simply passes that object file to the linker. This driver implementation will currently create the following phases, ``` $ clang input.c -fopenmp -fopenmp-targets=nvptx64 -fopenmp-new-driver -ccc-print-phases +- 0: input, "input.c", c, (host-openmp) +- 1: preprocessor, {0}, cpp-output, (host-openmp) +- 2: compiler, {1}, ir, (host-openmp) \| \| +- 3: input, "input.c", c, (device-openmp) \| \| +- 4: preprocessor, {3}, cpp-output, (device-openmp) \| \|- 5: compiler, {4}, ir, (device-openmp) \| +- 6: offload, "host-openmp (x86_64-unknown-linux-gnu)" {2}, "device-openmp (nvptx64)" {5}, ir \| +- 7: backend, {6}, assembler, (device-openmp) \|- 8: assembler, {7}, object, (device-openmp) +- 9: offload, "host-openmp (x86_64-unknown-linux-gnu)" {2}, "device-openmp (nvptx64)" {8}, ir +- 10: backend, {9}, assembler, (host-openmp) +- 11: assembler, {10}, object, (host-openmp) 12: clang-linker-wrapper, {11}, image, (host-openmp) ``` Which will map to the following bindings ``` # "x86_64-unknown-linux-gnu" - "clang", inputs: ["input.c"], output: "/tmp/input-bae62e.bc" # "nvptx64" - "clang", inputs: ["input.c", "/tmp/input-bae62e.bc"], output: "/tmp/input-76784e.s" # "nvptx64" - "NVPTX::Assembler", inputs: ["/tmp/input-76784e.s"], output: "/tmp/input-8f29db.o" # "x86_64-unknown-linux-gnu" - "clang", inputs: ["/tmp/input-bae62e.bc", "/tmp/input-8f29db.o"], output: "/tmp/input-545450.o" # "x86_64-unknown-linux-gnu" - "Offload::Linker", inputs: ["/tmp/input-545450.o"], output: "a.out" ``` Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D116541	2022-01-31 15:55:58 -05:00
Jon Chesterfield	9b9d08111b	Set rpath on openmp executables Openmp executables need to find libomp and libomptarget at runtime. This currently requires LD_LIBRARY_PATH or the user to specify rpath. Change that to set the expected location of the openmp libraries in the install tree. Whether rpath means rpath or runpath is system dependent. The attached test shows that the Wl,--disable-new-dtags control interacts correctly with this feature. The implicit rpath field is appended to any user specified ones which is ideal. Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D118493	2022-01-31 16:35:00 +00:00
Jon Chesterfield	a841a3a579	Revert "Set rpath on openmp executables" Failed some buildbots, bad assumptions about structure of install path This reverts commit `a80d5c34e4`.	2022-01-31 16:18:03 +00:00
Jon Chesterfield	a80d5c34e4	Set rpath on openmp executables Openmp executables need to find libomp and libomptarget at runtime. This currently requires LD_LIBRARY_PATH or the user to specify rpath. Change that to set the expected location of the openmp libraries in the install tree. Whether rpath means rpath or runpath is system dependent. The attached test shows that the Wl,--disable-new-dtags control interacts correctly with this feature. The implicit rpath field is appended to any user specified ones which is ideal. Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D118493	2022-01-31 16:01:08 +00:00
Ben Shi	653836251a	[clang][AVR] Set '-fno-use-cxa-atexit' to default AVR is baremetal environment, so the avr-libc does not support '__cxa_atexit()'. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D118445	2022-01-30 02:26:19 +00:00
Ben Shi	ac3894cf1e	[Clang] Move XCore specific options from Clang.cpp to XCore.cpp Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D118535	2022-01-30 02:24:35 +00:00
Joseph Huber	24f88f57de	[OpenMP] Accept shortened triples for -Xopenmp-target= This patch builds on the change in D117634 that expanded the short triples when passed in by the user. This patch adds the same functionality for the `-Xopenmp-target=` flag. Previously it was unintuitive that passing `-fopenmp-targets=nvptx64 -Xopenmp-target=nvptx64 <arg>` would not forward the arg because the triples did not match on account of `nvptx64` being expanded to `nvptx64-nvidia-cuda`. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D118495	2022-01-28 18:22:17 -05:00
Daniele Castagna	6eb826567a	[Driver] Add CUDA support for --offload param The --offload option was added in D110622 to "override the default device target". When it landed it supported only HIP. This patch extends that option to support SPIR-V targets for CUDA. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D117137	2022-01-28 14:50:39 -08:00
Amilendra Kodithuwakku	1f08b08674	[clang][ARM] Emit warnings when PACBTI-M is used with unsupported architectures Branch protection in M-class is supported by - Armv8.1-M.Main - Armv8-M.Main - Armv7-M Attempting to enable this for other architectures, either by command-line (e.g -mbranch-protection=bti) or by target attribute in source code (e.g. __attribute__((target("branch-protection=..."))) ) will generate a warning. In both cases function attributes related to branch protection will not be emitted. Regardless of the warning, module level attributes related to branch protection will be emitted when it is enabled via the command-line. The following people also contributed to this patch: - Victor Campos Reviewed By: chill Differential Revision: https://reviews.llvm.org/D115501	2022-01-28 09:59:58 +00:00
Mike Hommey	fd71493ff0	Add missing namespace to PPCLinux.cpp This fixes a build failure with MSVC introduced in https://reviews.llvm.org/D112906 Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D118211	2022-01-27 09:26:46 +01:00
David Blaikie	9c62728610	Default to DWARFv4 on Windows	2022-01-26 18:01:07 -08:00
Fangrui Song	6bc20eb134	[cc1as] Remove -Wa,--compress-debug-sections=zlib-gnu It's obsoleted and unlikely used. See D117744.	2022-01-26 13:28:51 -08:00
Fangrui Song	35d15222c0	[Driver] Remove obsoleted -gz=zlib-gnu GCC added -gz=zlib-gnu in 2014 for -gz meaning change (.zdebug => SHF_COMPRESSED) and the legacy zlib-gnu hasn't gain adoption. According to Debian Code Search (`gz=zlib-gnu`), no project uses -gz=zlib-gnu (valgrind has a configure to use -gz=zlib). Any possible -gz=zlib-gnu user can switch to -gz smoothly (supported by integrated assemblers for many years; binutils 2.26). Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D117744	2022-01-26 13:26:51 -08:00
Zixu Wang	b1d946cbf7	[clang] Add an extract-api driver option This is the initial commit for the clang-extract-api RFC <https://lists.llvm.org/pipermail/cfe-dev/2021-September/068768.html> Add a new driver option `-extract-api` and associate it with a dummy (for now) frontend action to set up the initial structure for incremental works. Differential Revision: https://reviews.llvm.org/D117809	2022-01-26 11:31:12 -08:00
Qiu Chaofan	b797d5e6b2	[CMake] [Clang] Add option to specify PowerPC long double format This method introduces new CMake variable PPC_LINUX_DEFAULT_IEEELONGDOUBLE (false by default) to enable fp128 as default long double format. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D118110	2022-01-27 00:50:53 +08:00
Benjamin Kramer	f15014ff54	Revert "Rename llvm::array_lengthof into llvm::size to match std::size from C++17" This reverts commit `ef82063207`. - It conflicts with the existing llvm::size in STLExtras, which will now never be called. - Calling it without llvm:: breaks C++17 compat	2022-01-26 16:55:53 +01:00
serge-sans-paille	ef82063207	Rename llvm::array_lengthof into llvm::size to match std::size from C++17 As a conquence move llvm::array_lengthof from STLExtras.h to STLForwardCompat.h (which is included by STLExtras.h so no build breakage expected).	2022-01-26 16:17:45 +01:00
Kirill Stoimenov	cf730d8ce1	[ASan] Not linking asan_static library for DSO. Without this change DSOs fail to link because of missing asan_report_(load\|store)n functions. Reviewed By: kda Differential Revision: https://reviews.llvm.org/D118184	2022-01-26 00:13:26 +00:00
Derek Schuff	d0d8d2d572	[clang][Driver] use DWARF4 for wasm Opt into the old default of DWARF4 for now. Differential Revision: https://reviews.llvm.org/D118082	2022-01-24 15:46:54 -08:00
Qiu Chaofan	c5590396d0	[PowerPC] Emit warning for ieeelongdouble on older GNU toolchain GCC 12 should have proper support for IEEE-754 compliant 128-bit floating point in libstdc++. So warning is needed when linking against older libstdc++ versions or LLVM libc++. Glibc starts supporting float128 in both header and libraries since 2.32. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D112906	2022-01-24 15:23:28 +08:00
David Blaikie	90abe181da	Add missing function implementation from DWARF default change Fix for `d3b26dea16`	2022-01-23 21:10:16 -08:00
David Blaikie	d3b26dea16	Clang: Change the default DWARF version to 5 (except on platforms that already opt in to specific versions - SCE, Android, and Darwin using DWARFv4 explicitly, for instance)	2022-01-23 20:49:57 -08:00
serge-sans-paille	75e164f61d	[llvm] Cleanup header dependencies in ADT and Support The cleanup was manual, but assisted by "include-what-you-use". It consists in 1. Removing unused forward declaration. No impact expected. 2. Removing unused headers in .cpp files. No impact expected. 3. Removing unused headers in .h files. This removes implicit dependencies and is generally considered a good thing, but this may break downstream builds. I've updated llvm, clang, lld, lldb and mlir deps, and included a list of the modification in the second part of the commit. 4. Replacing header inclusion by forward declaration. This has the same impact as 3. Notable changes: - llvm/Support/TargetParser.h no longer includes llvm/Support/AArch64TargetParser.h nor llvm/Support/ARMTargetParser.h - llvm/Support/TypeSize.h no longer includes llvm/Support/WithColor.h - llvm/Support/YAMLTraits.h no longer includes llvm/Support/Regex.h - llvm/ADT/SmallVector.h no longer includes llvm/Support/MemAlloc.h nor llvm/Support/ErrorHandling.h You may need to add some of these headers in your compilation units, if needs be. As an hint to the impact of the cleanup, running clang++ -E -Iinclude -I../llvm/include ../llvm/lib/Support/*.cpp -std=c++14 -fno-rtti -fno-exceptions \| wc -l before: 8000919 lines after: 7917500 lines Reduced dependencies also helps incremental rebuilds and is more ccache friendly, something not shown by the above metric :-) Discourse thread on the topic: https://llvm.discourse.group/t/include-what-you-use-include-cleanup/5831	2022-01-21 13:54:49 +01:00
Joao Moreira	82af95029e	[X86] Enable ibt-seal optimization when LTO is used in Kernel Intel's CET/IBT requires every indirect branch target to be an ENDBR instruction. Because of that, the compiler needs to correctly emit these instruction on function's prologues. Because this is a security feature, it is desirable that only actual indirect-branch-targeted functions are emitted with ENDBRs. While it is possible to identify address-taken functions through LTO, minimizing these ENDBR instructions remains a hard task for user-space binaries because exported functions may end being reachable through PLT entries, that will use an indirect branch for such. Because this cannot be determined during compilation-time, the compiler currently emits ENDBRs to every non-local-linkage function. Despite the challenge presented for user-space, the kernel landscape is different as no PLTs are used. With the intent of providing the most fit ENDBR emission for the kernel, kernel developers proposed an optimization named "ibt-seal" which replaces the ENDBRs for NOPs directly in the binary. The discussion of this feature can be seen in [1]. This diff brings the enablement of the flag -mibt-seal, which in combination with LTO enforces a different policy for ENDBR placement in when the code-model is set to "kernel". In this scenario, the compiler will only emit ENDBRs to address taken functions, ignoring non-address taken functions that are don't have local linkage. A comparison between an LTO-compiled kernel binaries without and with the -mibt-seal feature enabled shows that when -mibt-seal was used, the number of ENDBRs in the vmlinux.o binary patched by objtool decreased from 44383 to 33192, and that the number of superfluous ENDBR instructions nopped-out decreased from 11730 to 540. The 540 missed superfluous ENDBRs need to be investigated further, but hypotheses are: assembly code not being taken care of by the compiler, kernel exported symbols mechanisms creating bogus address taken situations or even these being removed due to other binary optimizations like kernel's static_calls. For now, I assume that the large drop in the number of ENDBR instructions already justifies the feature being merged. [1] - https://lkml.org/lkml/2021/11/22/591 Reviewed By: xiangzhangllvm Differential Revision: https://reviews.llvm.org/D116070	2022-01-21 10:55:34 +08:00
Joseph Huber	0dfe953294	[OpenMP] Change default visibility to protected for device declarations This patch changes the special-case handling of visibility when compiling for an OpenMP target offloading device. This was orignally added as a precaution against the bug encountered in PR41826 when symbols in the device were being preempted by shared library symbols. This should instead be done by making the visibility protected by default. With protected visibility we are asserting that the symbols on the device will never be preempted or preempt another symbol pending a shared library load. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D117806	2022-01-20 21:06:26 -05:00
Alexandre Ganea	83d59e05b2	Re-land [LLD] Remove global state in lldCommon Move all variables at file-scope or function-static-scope into a hosting structure (lld::CommonLinkerContext) that lives at lldMain()-scope. Drivers will inherit from this structure and add their own global state, in the same way as for the existing COFFLinkerContext. See discussion in https://lists.llvm.org/pipermail/llvm-dev/2021-June/151184.html The previous land `f860fe3622` caused issues in https://lab.llvm.org/buildbot/#/builders/123/builds/8383, fixed by `22ee510dac`. Differential Revision: https://reviews.llvm.org/D108850	2022-01-20 14:53:26 -05:00
Alexandre Ganea	5af2433e17	[clang-cl] Support the /HOTPATCH flag This patch adds support for the MSVC /HOTPATCH flag: https://docs.microsoft.com/sv-se/cpp/build/reference/hotpatch-create-hotpatchable-image?view=msvc-170&viewFallbackFrom=vs-2019 The flag is translated to a new -fms-hotpatch flag, which in turn adds a 'patchable-function' attribute for each function in the TU. This is then picked up by the PatchableFunction pass which would generate a TargetOpcode::PATCHABLE_OP of minsize = 2 (which means the target instruction must resolve to at least two bytes). TargetOpcode::PATCHABLE_OP is only implemented for x86/x64. When targetting ARM/ARM64, /HOTPATCH isn't required (instructions are always 2/4 bytes and suitable for hotpatching). Additionally, when using /Z7, we generate a 'hot patchable' flag in the CodeView debug stream, in the S_COMPILE3 record. This flag is then picked up by LLD (or link.exe) and is used in conjunction with the linker /FUNCTIONPADMIN flag to generate extra space before each function, to accommodate for live patching long jumps. Please see: `d703b92296/lld/COFF/Writer.cpp (L1298)` The outcome is that we can finally use Live++ or Recode along with clang-cl. NOTE: It seems that MSVC cl.exe always enables /HOTPATCH on x64 by default, although if we did the same I thought we might generate sub-optimal code (if this flag was active by default). Additionally, MSVC always generates a .debug$S section and a S_COMPILE3 record, which Clang doesn't do without /Z7. Therefore, the following MSVC command-line "cl /c file.cpp" would have to be written with Clang such as "clang-cl /c file.cpp /HOTPATCH /Z7" in order to obtain the same result. Depends on D43002, D80833 and D81301 for the full feature. Differential Revision: https://reviews.llvm.org/D116511	2022-01-20 12:57:19 -05:00
Sander de Smalen	990bab89ff	[ScalableVectors] Warn instead of error for invalid size requests. This was intended to be fixed by D98856, but that only seemed to have the desired behaviour when compiling to assembly using `-S`, not when compiling into an object file or executable. Given that this was not the intention of D98856, this patch fixes the behaviour.	2022-01-20 16:42:08 +00:00
Mubashar Ahmad	35737df4dc	[Clang][AArch64][ARM] Unaligned Access Warning Added Added warning for potential cases of unaligned access when option -mno-unaligned-access has been specified Differential Revision: https://reviews.llvm.org/D116221	2022-01-20 14:12:49 +00:00
Johannes Doerfert	6f2ee1ca5e	[OpenMP][AMDGPU] Optimize the linked in math libraries Once we linked in math files, potentially even if we link in only other "system libraries", we want to optimize the code again. This is not only reasonable but also helps to hide various problems with the missing attribute annotations in the math libraries. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D116906	2022-01-19 23:36:36 -06:00
Joseph Huber	28d718602a	[OpenMP] Expand short verisions of OpenMP offloading triples The OpenMP offloading libraries are built with fixed triples and linked in during compile time. This would cause un-helpful errors if the user passed in the wrong expansion of the triple used for the bitcode library. because we only support these triples for OpenMP offloading we can normalize them to the full verion used in the bitcode library. Reviewed By: jdoerfert, JonChesterfield Differential Revision: https://reviews.llvm.org/D117634	2022-01-19 20:26:37 -05:00
Joseph Huber	a9935b5db7	[openmp] Unconditionally set march commandline argument Extracted from D117246. This reflects the march value used by the compile back into the toolchain arguments, letting downstream processes such as LTO rely on it being present. Subsequent patches should also be able to remove the two other calls to checkSystemForAMDGPU. Reviewed By: jonchesterfield Differential Revision: https://reviews.llvm.org/D117706	2022-01-19 19:14:47 +00:00
Masoud Ataei	d261660af9	Fix the use of -fno-approx-func along with -Ofast or -ffast-math Fix how -fapprox-func interact correctly with the other floating point options. Reported bug Number 52565: https://bugs.llvm.org/show_bug.cgi?id=52565 Differential: https://reviews.llvm.org/D114564 Reviewer: @andrew.w.kaylor	2022-01-19 08:05:08 -08:00
Qichao Gu	67ac3f1fbe	[Driver] Pass the flag -dI to cc1 invocation Hook up the flag -dI in the driver to pass it to cc1 invocation. Differential Revision: https://reviews.llvm.org/D117292	2022-01-18 06:16:44 -08:00
Kagami Sascha Rosylight	9c195bae31	[clang] Add include path for cppwinrt on Windows SDK 10.0.17134+ This fixes https://github.com/llvm/llvm-project/issues/53112 by adding cppwinrt to the include path when the SDK version is higher than 10.0.17134.0. Differential revision: https://reviews.llvm.org/D117407	2022-01-18 09:14:23 +01:00
Fangrui Song	427d3b93ee	[Driver][FreeBSD] -r: imply -nostdlib like GCC Similar to D116843 for Gnu.cpp Reviewed By: dim Differential Revision: https://reviews.llvm.org/D117388	2022-01-16 19:44:48 -08:00
Kevin Athey	a0458b531c	Add -fsanitize-address-param-retval to clang. With the introduction of this flag, it is no longer necessary to enable noundef analysis with 4 separate flags. (-Xclang -enable-noundef-analysis -mllvm -msan-eager-checks=1). This change only covers the introduction into the compiler. This is a follow up to: https://reviews.llvm.org/D116855 Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D116633	2022-01-14 00:41:28 -08:00
Fangrui Song	e289561205	[Driver][Fuchsia] -r: imply -nostdlib like GCC Similar to D116843. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D116844	2022-01-13 15:49:19 -08:00
Fangrui Song	64da6eb065	[Driver][Gnu] -r: imply -nostdlib like GCC See `gcc -dumpspecs` that -r essentially implies -nostdlib and suppresses default -l* and crt*.o. The behavior makes sense because otherwise there will be assuredly conflicting definitions when the relocatable output is linked into the final executable/shared object. Reviewed By: thesamesam, phosek Differential Revision: https://reviews.llvm.org/D116843	2022-01-13 11:25:23 -08:00
Kirill Stoimenov	a3b9edf8b8	[ASan] Driver changes to always link-in asan_static library. This enables the changes from D116182. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D116670	2022-01-11 15:31:41 +00:00
Anastasia Stulova	0eef65028e	[SPIR-V] Remove unused variable	2022-01-11 13:45:59 +00:00
Anastasia Stulova	dbb8d08637	[SPIR-V] Add linking using spirv-link. Add support of linking files compiled into SPIR-V objects using spirv-link. Command line inteface examples: clang --target=spirv64 test1.cl test2.cl clang --target=spirv64 test1.cl -o test1.o clang --target=spirv64 test1.o test2.cl -o test_app.out This works independently from the SPIR-V generation method (via an external tool or an internal backend) and applies to either approach that is being used. Differential Revision: https://reviews.llvm.org/D116266	2022-01-11 13:11:38 +00:00
Martin Storsjö	50ec1306d0	[clang] Add --start-no-unused-arguments/--end-no-unused-arguments to silence some unused argument warnings When passing a set of flags to configure defaults for a specific target (similar to the cmake settings `CLANG_DEFAULT_RTLIB`, `CLANG_DEFAULT_UNWINDLIB`, `CLANG_DEFAULT_CXX_STDLIB` and `CLANG_DEFAULT_LINKER`, but without hardcoding them in the binary), some of the flags may cause warnings (e.g. `-stdlib=` when compiling C code). Allow requesting selectively ignoring unused arguments among some of the arguments on the command line, without needing to resort to `-Qunused-arguments` or `-Wno-unused-command-line-argument`. Fix up the existing diagnostics.c testcase. It was added in response to PR12181 to fix handling of `-Werror=unused-command-line-argument`, but the command line option in the test (`-fzyzzybalubah`) now triggers "error: unknown argument" instead of the intended warning. Change it into a linker input (`-lfoo`) which triggers the intended diagnostic. Extend the existing test case to check more cases and make sure that it keeps testing the intended case. Add testing of the new option to this existing test. Differential Revision: https://reviews.llvm.org/D116503	2022-01-11 09:22:00 +02:00
Yaxun (Sam) Liu	98ab43a1d2	[HIP] Fix device only linking for -fgpu-rdc Currently when -fgpu-rdc is specified, HIP toolchain always does host linking even if --cuda-device-only is specified. This patch fixes that. Only device linking is performed when --cuda-device-only is specified. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D116840	2022-01-10 17:38:02 -05:00
Archibald Elliott	3aec4b3d34	Revert "Unaligned Access Warning Added" This reverts commits: - `2cd2600aba` - `11c67e5a4e` Due to test failures on Windows.	2022-01-07 13:07:30 +00:00
Archibald Elliott	11c67e5a4e	[clang][driver] Don't pass -Wunaligned-access to cc1as This is to fix some failing assembler tests.	2022-01-07 10:45:26 +00:00
Mubashar Ahmad	2cd2600aba	Unaligned Access Warning Added Added warning for potential cases of unaligned access when option -mno-unaligned-access has been specified	2022-01-07 09:54:20 +00:00
Qiu Chaofan	c2cc70e4f5	[NFC] Fix endif comments to match with include guard	2022-01-07 15:52:59 +08:00
Collin Baker	7e08a12088	[clang] Fall back on Android triple w/o API level for runtimes search Clang searches for runtimes (e.g. libclang_rt) first in a subdirectory named for the target triple (corresponding to LLVM_ENABLE_PER_TARGET_RUNTIME_DIR=ON), then if it's not found uses .../lib/<os>/libclang_rt with a suffix corresponding to the arch and environment name. Android triples optionally include an API level indicating the minimum Android version to be run on (e.g. aarch64-unknown-linux-android21). When compiler-rt is built with LLVM_ENABLE_PER_TARGET_RUNTIME_DIR=ON this API level is part of the output path. Linking code built for a later API level against a runtime built for an earlier one is safe. In projects with several API level targets this is desireable to avoid re-building the same runtimes many times. This is difficult with the current runtime search method: if the API levels don't exactly match Clang gives up on the per-target runtime directory path. To enable this more simply, this change tries target triple without the API level before falling back on the old layout. Another option would be to try every API level in the triple, e.g. check aarch-64-unknown-linux-android21, then ...20, then ...19, etc. Differential Revision: https://reviews.llvm.org/D115049	2022-01-05 16:00:48 -05:00
Nico Weber	085f078307	Revert "Revert D109159 "[amdgpu] Enable selection of `s_cselect_b64`."" This reverts commit `859ebca744`. The change contained many unrelated changes and e.g. restored unit test failes for the old lld port.	2022-01-05 13:10:25 -05:00
David Salinas	859ebca744	Revert D109159 "[amdgpu] Enable selection of `s_cselect_b64`." This reverts commit `640beb38e7`. That commit caused performance degradtion in Quicksilver test QS:sGPU and a functional test failure in (rocPRIM rocprim.device_segmented_radix_sort). Reverting until we have a better solution to s_cselect_b64 codegen cleanup Change-Id: Ibf8e397df94001f248fba609f072088a46abae08 Reviewed By: kzhuravl Differential Revision: https://reviews.llvm.org/D115960 Change-Id: Id169459ce4dfffa857d5645a0af50b0063ce1105	2022-01-05 17:57:32 +00:00
Saiyedul Islam	32357266fd	[Clang][NFC] Fix multiline comment prefixes in function headers Cleanup of D105191 after latest clang-format changes. Reviewed By: MyDeveloperDay Differential Revision: https://reviews.llvm.org/D111545	2022-01-04 11:51:31 +00:00
Mikael Holmen	304d30bc59	[clang] Fix warning about unused variable [NFC]	2022-01-04 07:28:16 +01:00
Alexandre Ganea	e32936aef4	[MSVC] Silence -Wnon-virtual-dtor on DIA APIs Differential Revision: https://reviews.llvm.org/D116313	2022-01-03 13:29:08 -05:00
Tomas Matheson	4435d1819e	[ARM][AArch64] clang support for Armv9.3-A This patch introduces support for targetting the Armv9.3-A architecture, which should map to the existing Armv8.8-A extensions. Differential Revision: https://reviews.llvm.org/D116159	2022-01-03 16:02:36 +00:00
Martin Storsjö	a8877c5ccc	[clang] [MinGW] Pass --no-demangle through to the mingw linker Clang has custom handling of --no-demangle, where it is removed from the input -Wl and -Xlinker options, and readded specifically by the drivers where it's known to be supported. Both ld.bfd and lld support the --no-demangle option. This handles the option in the same way as in ToolChains/Gnu.cpp. Differential Revision: https://reviews.llvm.org/D114064	2022-01-03 00:22:40 +02:00
Kazu Hirata	d677a7cb05	[clang] Remove redundant member initialization (NFC) Identified with readability-redundant-member-init.	2022-01-02 10:20:23 -08:00
Markus Böck	dbeeb136ab	[clang][MinGW] Explicitly ignore `-fPIC` & friends GCC on Windows ignores this flag completely [0] which some build systems sadly rely on when compiling for Windows using MinGW. The current behaviour of clang however is to error out as -fPIC & friends has no effect on Windows. This patch instead changes the behaviour for MinGW to ignore the option for the sake of compatibility Fixes https://github.com/llvm/llvm-project/issues/52947 [0] https://gcc.gnu.org/legacy-ml/gcc-patches/2015-08/msg00836.html Differential Revision: https://reviews.llvm.org/D116485	2022-01-02 12:06:54 +01:00
Kazu Hirata	f4ffcab178	Remove redundant string initialization (NFC) Identified by readability-redundant-string-init.	2022-01-01 12:34:11 -08:00
Simon Tatham	d50072f74e	[ARM] Introduce an empty "armv8.8-a" architecture. This is the first commit in a series that implements support for "armv8.8-a" architecture. This should contain all the necessary boilerplate to make the 8.8-A architecture exist from LLVM and Clang's point of view: it adds the new arch as a subtarget feature, a definition in TargetParser, a name on the command line, an appropriate set of predefined macros, and adds appropriate tests. The new architecture name is supported in both AArch32 and AArch64. However, in this commit, no actual _functionality_ is added as part of the new architecture. If you specify -march=armv8.8a, the compiler will accept it and set the right predefines, but generate no code any differently. Differential Revision: https://reviews.llvm.org/D115694	2021-12-31 16:43:53 +00:00
Random	2edcde00cb	[MIPS] Add -mfix4300 flag to enable vr4300 mulmul bugfix pass Early revisions of the VR4300 have a hardware bug where two consecutive multiplications can produce an incorrect result in the second multiply. This revision adds the `-mfix4300` flag to llvm (and clang) which, when passed, provides a software fix for this issue. More precise description of the "mulmul" bug: ``` mul.[s,d] fd,fs,ft mul.[s,d] fd,fs,ft or [D]MULT[U] rs,rt ``` When the above sequence is executed by the CPU, if at least one of the source operands of the first mul instruction happens to be `sNaN`, `0` or `Infinity`, then the second mul instruction may produce an incorrect result. This can happen both if the two mul instructions are next to each other and if the first one is in a delay slot and the second is the first instruction of the branch target. Description of the fix: This fix adds a backend pass to llvm which scans for mul instructions in each basic block and inserts a nop whenever the following conditions are met: - The current instruction is a single or double-precision floating-point mul instruction. - The next instruction is either a mul instruction (any kind) or a branch instruction. Differential Revision: https://reviews.llvm.org/D116238	2021-12-31 15:59:44 +03:00
Kazu Hirata	298367ee6e	[clang] Use nullptr instead of 0 or NULL (NFC) Identified with modernize-use-nullptr.	2021-12-29 08:34:20 -08:00
Kazu Hirata	1b329fe282	[clang] Remove unused "using" (NFC)	2021-12-29 08:27:29 -08:00
Nick Desaulniers	cd284b7ac0	[clang][ARM] re-use arm::isHardTPSupported for hardware TLS check This conditional check for -mstack-protector-guard=tls got out of sync with the conditional check for -mtp=cp15 by me in D114116, because I forgot about the similar check added in D113026. Re-use the code in arm::isHardTPSupported so that these aren't out of sync. Interestingly, our CI reported this when testing -mstack-protector-guard=tls; it was only reproducible with Debian's LLVM and not upstream LLVM due to this out of tree patch: https://salsa.debian.org/pkg-llvm-team/llvm-toolchain/-/blob/snapshot/debian/patches/930008-arm.diff Fixes: https://github.com/ClangBuiltLinux/linux/issues/1502 Reviewed By: ardb Differential Revision: https://reviews.llvm.org/D116233	2021-12-28 13:28:34 -08:00
Kazu Hirata	31cfb3f4f6	[clang] Remove redundant calls to c_str() (NFC) Identified with readability-redundant-string-cstr.	2021-12-26 13:31:40 -08:00
Kazu Hirata	2d303e6781	Remove redundant return and continue statements (NFC) Identified with readability-redundant-control-flow.	2021-12-24 23:17:54 -08:00
Krasimir Georgiev	969a51ff36	Revert "[ASan] Moved optimized callbacks into a separate library." We need some internal updates for this, shared directly with the author. This reverts commit `71b3bfde9c`.	2021-12-24 12:01:36 +01:00
Kirill Stoimenov	71b3bfde9c	[ASan] Moved optimized callbacks into a separate library. This will allow linking in the callbacks directly instead of using PLT. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D116182	2021-12-24 00:40:44 +00:00
Krzysztof Parzyszek	a67c0fc1fb	[Hexagon] Revamp HVX flag verification in driver Generalize warning/error messages (for reuse), refactor flag verification code, rewrite HVX flag driver testcase.	2021-12-23 15:18:08 -08:00
Nathan Chancellor	be8180af58	[clang][driver] Warn when '-mno-outline-atomics' is used with a non-AArch64 triple The Linux kernel has a make macro called cc-option that invokes the compiler with an option in isolation to see if it is supported before adding it to CFLAGS. The exit code of the compiler is used to determine if the flag is supported and should be added to the compiler invocation. A call to cc-option with '-mno-outline-atomics' was added to prevent linking errors with newer GCC versions but this call succeeds with a non-AArch64 target because there is no warning from clang with '-mno-outline-atomics', just '-moutline-atomics'. Because the call succeeds and adds '-mno-outline-atomics' to the compiler invocation, there is a warning from LLVM because the 'outline-atomics target feature is only supported by the AArch64 backend. $ echo \| clang -target x86_64 -moutline-atomics -Werror -x c -c -o /dev/null - clang-14: error: The 'x86_64' architecture does not support -moutline-atomics; flag ignored [-Werror,-Woption-ignored] $ echo $? 1 $ echo \| clang -target x86_64 -mno-outline-atomics -Werror -x c -c -o /dev/null - '-outline-atomics' is not a recognized feature for this target (ignoring feature) $ echo $? 0 This does not match GCC's behavior, which errors when the flag is added to a non-AArch64 target. $ echo \| gcc -moutline-atomics -x c -c -o /dev/null - gcc: error: unrecognized command-line option ‘-moutline-atomics’; did you mean ‘-finline-atomics’? $ echo \| gcc -mno-outline-atomics -x c -c -o /dev/null - gcc: error: unrecognized command-line option ‘-mno-outline-atomics’; did you mean ‘-fno-inline-atomics’? $ echo \| aarch64-linux-gnu-gcc -moutline-atomics -x c -c -o /dev/null - $ echo \| aarch64-linux-gnu-gcc -mno-outline-atomics -x c -c -o /dev/null - To get closer to GCC's behavior, issue a warning when '-mno-outline-atomics' is used without an AArch64 triple and do not add '{-,+}outline-atomic" to the list of target features in these cases. Link: https://github.com/ClangBuiltLinux/linux/issues/1552 Reviewed By: melver, nickdesaulniers Differential Revision: https://reviews.llvm.org/D116128	2021-12-23 12:36:42 -07:00
Krzysztof Parzyszek	1d1b5efdef	[Hexagon] Driver/preprocessor options for Hexagon v69	2021-12-23 10:17:08 -08:00
Kirill Stoimenov	4bf31659fa	Revert "[ASan] Moved optimized callbacks into a separate library." This reverts commit `ab3640aa0e`. Reviewed By: kstoimenov Differential Revision: https://reviews.llvm.org/D116223	2021-12-23 17:13:18 +00:00
Kirill Stoimenov	ab3640aa0e	[ASan] Moved optimized callbacks into a separate library. This will allow linking in the callbacks directly instead of using PLT. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D116182	2021-12-23 16:40:36 +00:00
Anastasia Stulova	0045d01af9	[SPIR-V] Add a toolchain for SPIR-V in clang This patch adds a toolchain (TC) for SPIR-V along with the following changes in Driver and base ToolChain and Tool. This is required to provide a mechanism in clang to bypass SPIR-V backend in LLVM for SPIR-V until it lands in LLVM and matures. The SPIR-V code is generated by the SPIRV-LLVM translator tool named 'llvm-spirv' that is sought in 'PATH'. The compilation phases/actions should be bound for SPIR-V in the meantime as following: compile -> tools::Clang backend -> tools::SPIRV::Translator assemble -> tools::SPIRV::Translator However, Driver’s ToolSelector collapses compile-backend-assemble and compile-backend sequences to tools::Clang. To prevent this, added new {use,has}IntegratedBackend properties in ToolChain and Tool to which the ToolSelector reacts on, and which SPIR-V TC overrides. Linking of multiple input files is currently not supported but can be added separately. Differential Revision: https://reviews.llvm.org/D112410 Co-authored-by: Henry Linjamäki <henry.linjamaki@parmance.com>	2021-12-23 15:10:09 +00:00
Alexandre Ganea	a282ea4898	Reland - [CodeView] Emit S_OBJNAME record Reland integrates build fixes & further review suggestions. Thanks to @zturner for the initial S_OBJNAME patch! Differential Revision: https://reviews.llvm.org/D43002	2021-12-21 19:02:14 -05:00
Alexandre Ganea	5bb5142e80	Revert [CodeView] Emit S_OBJNAME record Also revert all subsequent fixes: - `abd1cbf5e5` [Clang] Disable debug-info-objname.cpp test on Unix until I sort out the issue. - `00ec441253` [Clang] debug-info-objname.cpp test: explictly encode a x86 target when using %clang_cl to avoid falling back to a native CPU triple. - `cd407f6e52` [Clang] Fix build by restricting debug-info-objname.cpp test to x86.	2021-12-21 19:02:14 -05:00
Alexandre Ganea	d26520f6f7	[Clang] Own the CommandLineArgs in CodeGenOptions Fixes PR52704 : https://github.com/llvm/llvm-project/issues/52704 Differential Revision: https://reviews.llvm.org/D116011	2021-12-21 17:41:35 -05:00
Alexandre Ganea	f44e3fbadd	[CodeView] Emit S_OBJNAME record Thanks to @zturner for the initial patch! Differential Revision: https://reviews.llvm.org/D43002	2021-12-21 09:26:36 -05:00
Yaxun (Sam) Liu	a6786cdd57	[HIPSPV][3/4] Enable SPIR-V emission for HIP This patch enables SPIR-V binary emission for HIP device code via the HIPSPV tool chain. ‘--offload’ option, which is envisioned in [1], is added for specifying offload targets. This option is used to override default device target (amdgcn-amd-amdhsa) for HIP compilation for emitting device code as SPIR-V binary. The option is handled in getHIPOffloadTargetTriple(). getOffloadingDeviceToolChain() function (based on the design in the SYCL repository) is added to select HIPSPVToolChain when HIP offload target is ‘spirv64’. The HIPActionBuilder is modified to produce LLVM IR at the backend phase. HIPSPV tool chain expects to receive HIP device code as LLVM IR so it can run external LLVM passes over them. HIPSPV TC is also responsible for emitting the SPIR-V binary. A Cuda GPU architecture ‘generic’ is added. The name is picked from the LLVM SPIR-V Backend. In the HIPSPV code path the architecture name is inserted to the bundle entry ID as target ID. Target ID is expected to be always present so a component in the target triple is not mistaken as target ID. Tests are added for checking the HIPSPV tool chain. [1]: https://lists.llvm.org/pipermail/cfe-dev/2020-December/067362.html Patch by: Henry Linjamäki Reviewed by: Yaxun Liu, Artem Belevich, Alexey Bader Differential Revision: https://reviews.llvm.org/D110622	2021-12-20 10:45:09 -05:00
Ed Maste	b41bb6c1b7	[Driver] Default to contemporary FreeBSD profiling behaviour Prior to FreeBSD 14, FreeBSD provided special _p.a libraries for use with -pg. They are no longer used or provided. If the target does not specify a major version (e.g. amd64-unknown-freebsd, rather than amd64-unknown-freebsd12) default to the new behaviour. Differential Revision: https://reviews.llvm.org/D114396	2021-12-15 09:05:35 -05:00
Henry Linjamäki	4e94cba5b4	[HIPSPV][2/4] Add HIPSPV tool chain This patch adds a new tool chain, HIPSPVToolChain, for emitting HIP device code as SPIR-V binary. The SPIR-V binary is emitted by using an external tool, SPIRV-LLVM-Translator, temporarily. We intend to switch the translator to the llc tool when the SPIR-V backend lands on LLVM and proves to work well on HIP implementations which consume SPIR-V. Before the SPIR-V emission the tool chain loads an optional external pass plugin, either automatically from a HIP installation or from a path pointed by --hipspv-pass-plugin, and runs passes that are meant to expand/lower HIP features that do not have direct counterpart in SPIR-V (e.g. dynamic shared memory). Code emission for SPIR-V will be enabled and HIPSPVToolChain tests will be added in the follow up patch part 3. Other changes: New option ‘-nohipwrapperinc’ is added to exclude HIP include wrappers. The reason for the addition is that they cause compile errors when compiling HIP sources for the host side for HIPCL and HIPLZ implementations. New option is added to avoid this issue. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D110618	2021-12-14 10:22:38 -08:00
Fangrui Song	1042de9058	[Driver] Add CLANG_DEFAULT_PIE_ON_LINUX to emulate GCC --enable-default-pie In 2015-05, GCC added the configure option `--enable-default-pie`. When enabled, * in the absence of -fno-pic/-fpie/-fpic (and their upper-case variants), -fPIE is the default. * in the absence of -no-pie/-pie/-shared/-static/-static-pie, -pie is the default. This has been adopted by all(?) major distros. I think default PIE is the majority in the Linux world, but --disable-default-pie users is not that uncommon because GCC upstream hasn't switched the default yet (https://gcc.gnu.org/PR103398). This patch add CLANG_DEFAULT_PIE_ON_LINUX which allows distros to use default PIE. The option is justified as its adoption can be very high among Linux distros to make Clang default match GCC, and is likely a future-new-default, at which point we will remove CLANG_DEFAULT_PIE_ON_LINUX. The lit feature `default-pie-on-linux` can be handy to exclude default PIE sensitive tests. Reviewed By: foutrelis, sylvestre.ledru, thesamesam Differential Revision: https://reviews.llvm.org/D113372	2021-12-14 10:09:00 -08:00
Yaxun (Sam) Liu	006fb62434	Fix build failure of HIPUtility.cpp on Windows	2021-12-13 11:53:06 -05:00
Yaxun (Sam) Liu	240be6541d	Fix warning about unused variable in HIPAMD.cpp	2021-12-13 11:25:48 -05:00
Yaxun (Sam) Liu	78b0f3701d	[HIPSPV][1/4] Refactor HIP tool chain This patch refactors the HIP tool chain for new HIP tool chain, HIPSPV tool chain, which is added in the follow up patch part 2. Rename HIPToolChain to HIPAMDToolChain and Renames HIP.* files to HIPAMD.. Introduce HIPUtility. file where common HIP utilities, shared among HIP tool chain implementations, are placed in. Move constructHIPFatbinCommand() and constructGenerateObjFileFromHIPFatBinary() to HIPUtility. HIPSPV tool chain is going to use them. Tweak bundle target ID in constructHIPFatbinCommand(): extra dashes are dropped if the Target ID is empty and 'hip' offload kind is made default for non-AMD targets. Patch by: Henry Linjamäki Reviewed by: Yaxun Liu, Artem Belevich, Eric Christopher Differential Revision: https://reviews.llvm.org/D110549	2021-12-13 10:50:25 -05:00
Kazu Hirata	c2bb9637d9	Use llvm::any_of and llvm::all_of (NFC)	2021-12-11 11:54:37 -08:00
Zakk Chen	57b5f4b2ec	[RISCV][Clang] Compute the default target-abi if it's empty. Every generated IR has a corresponding target-abi value, so encoding a non-empty value would improve the robustness and correctness. Reviewed By: asb, jrtc27, arichardson Differential Revision: https://reviews.llvm.org/D105555	2021-12-10 08:54:23 -08:00
Archibald Elliott	52faad83c9	[AArch64] Use Feature for A53 Erratum 835769 Fix When this pass was originally implemented, the fix pass was enabled using a llvm command-line flag. This works fine, except in the case of LTO, where the flag is not passed into the linker plugin in order to enable the function pass in the LTO backend. Now LTO exists, the expectation now is to use target features rather than command-line arguments to control code generation, as this ensures that different command-line arguments in different files are correctly represented, and target-features always get to the LTO plugin as they are encoded into LLVM IR. The fall-out of this change is that the fix pass has to always be added to the backend pass pipeline, so now it makes no changes if the function does not have the right target feature to enable it. This should make a minimal difference to compile time. One advantage is it's now much easier to enable when compiling for a Cortex-A53, as CPUs imply their own individual sets of target-features, in a more fine-grained way. I haven't done this yet, but it is an option, if the fix should be enabled in more places. Existing tests of the user interface are unaffected, the changes are to reflect that the argument is now turned into a target feature. Reviewed By: tmatheson Differential Revision: https://reviews.llvm.org/D114703	2021-12-10 15:09:59 +00:00
Brian Cain	1e68c79987	Reapply [xray] add support for hexagon Adds x-ray support for hexagon to llvm codegen, clang driver, compiler-rt libs. Differential Revision: https://reviews.llvm.org/D113638 Reapplying this after `543a9ad7c4`, which fixes the leak introduced there.	2021-12-10 05:32:28 -08:00
Brian Cain	ab28cb1c5c	Revert "[xray] add support for hexagon" This reverts commit `543a9ad7c4`.	2021-12-09 07:30:40 -08:00
Brian Cain	543a9ad7c4	[xray] add support for hexagon Adds x-ray support for hexagon to llvm codegen, clang driver, compiler-rt libs. Differential Revision: https://reviews.llvm.org/D113638	2021-12-09 05:47:53 -08:00
James Farrell	219672b8dd	Revert "Revert "Use VersionTuple for parsing versions in Triple, fixing issues that caused the original change to be reverted. This makes it possible to distinguish between "16" and "16.0" after parsing, which previously was not possible."" This reverts commit `63a6348cad`. Differential Revision: https://reviews.llvm.org/D115254	2021-12-07 23:15:21 +00:00
Yaxun (Sam) Liu	3b172f60c6	[HIP] Fix -fgpu-rdc for Windows This patch fixes issues for -fgpu-rdc for Windows MSVC toolchain: Fix COFF specific section flags and remove section types in llvm-mc input file for Windows. Escape fatbin path in llvm-mc input file. Add -triple option to llvm-mc. Put __hip_gpubin_handle in comdat when it has linkonce_odr linkage. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D115039	2021-12-06 16:42:23 -05:00
Nick Desaulniers	73ee4e1cbd	[clang][ARM] only check -mtp=cp15 for non-asm sources This diagnostic is really to highlight lack of support for hard thread pointers in post-RA instruction scheduling for non-armv6k+ targets; something that isn't run for assembler sources. Fixes: https://github.com/ClangBuiltLinux/linux/issues/1502 Link: https://lore.kernel.org/all/814585495.6773.1636629846970@jenkins.jenkins/ Reviewed By: ardb Differential Revision: https://reviews.llvm.org/D114124	2021-12-06 11:31:23 -08:00
James Farrell	63a6348cad	Revert "Use VersionTuple for parsing versions in Triple, fixing issues that caused the original change to be reverted. This makes it possible to distinguish between "16" and "16.0" after parsing, which previously was not possible." This reverts commit `5032467034`.	2021-12-06 17:35:26 +00:00
Jon Chesterfield	6bb2a4f3e6	[openmp] Default to new rtl for amdgpu Reverts D114965 as the compiler backend appears to be working again Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D115157	2021-12-06 16:56:14 +00:00
Simon Moll	f6ba645039	Revert "[Clang] Ignore CLANG_DEFAULT_LINKER for custom-linker toolchains" Reverted until all Toolchains are fixed for the new behavior. This reverts commit `34a43f2115`.	2021-12-06 16:44:36 +01:00
James Farrell	5032467034	Use VersionTuple for parsing versions in Triple, fixing issues that caused the original change to be reverted. This makes it possible to distinguish between "16" and "16.0" after parsing, which previously was not possible. This reverts commit `40d5eeac6c`. Differential Revision: https://reviews.llvm.org/D114885	2021-12-06 14:57:47 +00:00
Simon Moll	34a43f2115	[Clang] Ignore CLANG_DEFAULT_LINKER for custom-linker toolchains Before, the CLANG_DEFAULT_LINKER cmake option was a global override for the linker that shall be used on all toolchains. The linker binary specified that way may not be available on toolchains with custom linkers. Eg, the only linker for VE is named 'nld' - any other linker invalidates the toolchain. This patch removes the hard override and instead lets the generic toolchain implementation default to CLANG_DEFAULT_LINKER. Toolchains can now deviate with a custom linker name or deliberatly default to CLANG_DEFAULT_LINKER. Reviewed By: MaskRay, phosek Differential Revision: https://reviews.llvm.org/D115045	2021-12-06 13:31:51 +01:00
Ties Stuij	0fbb17458a	[ARM] Implement setjmp BTI placement for PACBTI-M This patch intends to guard indirect branches performed by longjmp by inserting BTI instructions after calls to setjmp. Calls with 'returns-twice' are lowered to a new pseudo-instruction named t2CALL_BTI that is later expanded to a bundle of {tBL,t2BTI}. This patch is part of a series that adds support for the PACBTI-M extension of the Armv8.1-M architecture, as detailed here: https://community.arm.com/arm-community-blogs/b/architectures-and-processors-blog/posts/armv8-1-m-pointer-authentication-and-branch-target-identification-extension The PACBTI-M specification can be found in the Armv8-M Architecture Reference Manual: https://developer.arm.com/documentation/ddi0553/latest The following people contributed to this patch: - Alexandros Lamprineas - Ties Stuij Reviewed By: labrinea Differential Revision: https://reviews.llvm.org/D112427	2021-12-06 11:07:10 +00:00
Kazushi (Jam) Marukawa	83f572527e	[VE] Support multiple architectures installation Change C++ header files placement to support multiple LLVM_RUNTIME_TARGETS build. Also modifies regression test for it. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D114527	2021-12-06 19:56:41 +09:00
Jack Andersen	296ebeb808	Test commit to check access.	2021-12-05 14:35:33 -05:00
Nick Desaulniers	9f95bc7dc1	[clang][ARM] relax -mtp=cp15 for non-thumb cases Building -march=armv6k Linux kernels with -mtp=cp15 fails to compile: error: hardware TLS register is not supported for the arm sub-architecture @ardb found docs for ARM1176JZF-S (ARMv6K) that reference hard thread pointer. Relax our ARMv6 check for cases where we're targeting ARM via -marm (vs Thumb1 via -mthumb). This more closely matches the KConfig requirements for where we plan to use these (ie. ARMv6K, ARMv7 (arm or thumb2)). As @peter.smith mentions: on armv5 we can write the instruction to read/write to CP15 C13 with the ThreadID opcode. However on no armv5 implementation will the CP15 C13 have a Thread ID register. The GCC intent seems to be whether the instruction is encodable rather than check what the CPU supports. Link: https://github.com/ClangBuiltLinux/linux/issues/1502 Link: https://developer.arm.com/documentation/ddi0301/h/system-control-coprocessor/system-control-processor-registers/c13--thread-and-process-id-registers Reviewed By: ardb, peter.smith Differential Revision: https://reviews.llvm.org/D114116	2021-12-03 14:00:00 -08:00
Keith Smiley	ace03d0df4	[clang][Darwin] Remove old lld implementation handling This now assumes that for the darwin driver any lld is the "new" macho lld implementation. Differential Revision: https://reviews.llvm.org/D114974	2021-12-02 16:29:26 -08:00
Joseph Huber	96ff74a0d5	[OpenMP] Remove the new runtime default for AMDGPU The new runtime is currently broken for AMD offloading. This patch makes the default the old runtime only for the AMD target. Reviewed By: ronlieb Differential Revision: https://reviews.llvm.org/D114965	2021-12-02 12:35:58 -05:00
Joseph Huber	c99407e31c	[OpenMP] Make the new device runtime the default This patch changes the `-fopenmp-target-new-runtime` option which controls if the new or old device runtime is used to be true by default. Disabling this to use the old runtime now requires using `-fno-openmp-target-new-runtime`. Reviewed By: JonChesterfield, tianshilei1992, gregrodgers, ronlieb Differential Revision: https://reviews.llvm.org/D114890	2021-12-02 11:11:45 -05:00
Ties Stuij	e3b2f0226b	[clang][ARM] PACBTI-M frontend support Handle branch protection option on the commandline as well as a function attribute. One patch for both mechanisms, as they use the same underlying parsing mechanism. These are recorded in a set of LLVM IR module-level attributes like we do for AArch64 PAC/BTI (see https://reviews.llvm.org/D85649): - command-line options are "translated" to module-level LLVM IR attributes (metadata). - functions have PAC/BTI specific attributes iff the __attribute__((target("branch-protection=...))) was used in the function declaration. - command-line option -mbranch-protection to armclang targeting Arm, following this grammar: branch-protection ::= "-mbranch-protection=" <protection> protection ::= "none" \| "standard" \| "bti" [ "+" <pac-ret-clause> ] \| <pac-ret-clause> [ "+" "bti"] pac-ret-clause ::= "pac-ret" [ "+" <pac-ret-option> ] pac-ret-option ::= "leaf" ["+" "b-key"] \| "b-key" ["+" "leaf"] b-key is simply a placeholder to make it consistent with AArch64's version. In Arm, however, it triggers a warning informing that b-key is unsupported and a-key will be selected instead. - Handle _attribute_((target(("branch-protection=..."))) for AArch32 with the same grammer as the commandline options. This patch is part of a series that adds support for the PACBTI-M extension of the Armv8.1-M architecture, as detailed here: https://community.arm.com/arm-community-blogs/b/architectures-and-processors-blog/posts/armv8-1-m-pointer-authentication-and-branch-target-identification-extension The PACBTI-M specification can be found in the Armv8-M Architecture Reference Manual: https://developer.arm.com/documentation/ddi0553/latest The following people contributed to this patch: - Momchil Velikov - Victor Campos - Ties Stuij Reviewed By: vhscampos Differential Revision: https://reviews.llvm.org/D112421	2021-12-01 10:37:16 +00:00
modimo	47f230ba2c	Add toggling for -fnew-infallible/-fno-new-infallible Allow toggling of -fnew-infallible so last instance takes precedence Testing: ninja check-all Reviewed By: bruno Differential Revision: https://reviews.llvm.org/D113523	2021-11-30 17:19:53 -08:00
Nikita Popov	40d5eeac6c	Revert "Use VersionTuple for parsing versions in Triple. This makes it possible to distinguish between "16" and "16.0" after parsing, which previously was not possible." This reverts commit `1e82864670`. llvm/test/Transforms/LoopStrengthReduce/X86/2009-11-10-LSRCrash.ll fails with assertion failure: llc: /home/nikic/llvm-project/llvm/include/llvm/ADT/Optional.h:196: T& llvm::optional_detail::OptionalStorage<T, true>::getValue() & [with T = unsigned int]: Assertion `hasVal' failed. ... #8 0x00005633843af5cb llvm::MCStreamer::emitVersionForTarget(llvm::Triple const&, llvm::VersionTuple const&) #9 0x0000563383b47f14 llvm::AsmPrinter::doInitialization(llvm::Module&)	2021-11-30 18:36:32 +01:00
Paul Robinson	b8e03be88d	[PS4][DWARF] Explicitly set default DWARF version to 4	2021-11-30 08:58:40 -08:00
James Farrell	1e82864670	Use VersionTuple for parsing versions in Triple. This makes it possible to distinguish between "16" and "16.0" after parsing, which previously was not possible. See also https://github.com/android/ndk/issues/1455. Differential Revision: https://reviews.llvm.org/D114163	2021-11-30 15:44:23 +00:00
Patrick Oppenlander	b3163c1cdd	[Driver] Support PowerPC SPE musl dynamic linker name ld-musl-powerpc-sf.so.1 Musl treats PowerPC SPE as a soft-float target (as the PowerPC SPE ABI is soft-float compatible). Reviewed By: jhibbits, MaskRay Differential Revision: https://reviews.llvm.org/D105869	2021-11-28 15:39:55 -08:00
Dimitry Andric	df08b2fe8b	[AArch64] Avoid crashing on invalid -Wa,-march= values As reported in https://bugs.freebsd.org/260078, the gnutls Makefiles pass -Wa,-march=all to compile a number of assembly files. Clang does not support this -march value, but because of a mistake in handling the arguments, an unitialized Arg pointer is dereferenced, which can cause a segfault. Work around this by adding a check if the local WaMArch variable is initialized, and if so, using its value in the diagnostic message. Reviewed By: tschuett Differential Revision: https://reviews.llvm.org/D114677	2021-11-28 22:23:42 +01:00
Quinn Pham	b11c66accf	[NFC] Inclusive language: rename master flag to main flag [NFC] As part of using inclusive language within the llvm project, this patch renames master flag to main flag in these comments. Reviewed By: ZarkoCA Differential Revision: https://reviews.llvm.org/D114090	2021-11-25 15:16:11 -06:00
Timm Bäder	3e67cf21a1	[clang][driver] Add -fplugin-arg- to pass arguments to plugins From GCC's manpage: -fplugin-arg-name-key=value Define an argument called key with a value of value for the plugin called name. Since we don't have a key-value pair similar to gcc's plugin_argument struct, simply accept key=value here anyway and pass it along as-is to plugins. This translates to the already existing '-plugin-arg-pluginname arg' that clang cc1 accepts. There is an ambiguity here because in clang, both the plugin name as well as the option name can contain dashes, so when e.g. passing -fplugin-arg-foo-bar-foo it is not clear whether the plugin is foo-bar and the option is foo, or the plugin is foo and the option is bar-foo. GCC solves this by interpreting all dashes as part of the option name. So dashes can't be part of the plugin name in this case. Differential Revision: https://reviews.llvm.org/D113250	2021-11-25 10:47:55 +01:00
Jan Beich	2dec2aa3ad	[Driver] Default to libc++ on FreeBSD All supported FreeBSD releases use libc++, so default to it if the target's major version is not specified. Reviewed by: dim, emaste Differential Revision: https://reviews.llvm.org/D77776	2021-11-22 16:47:03 -05:00
$Alfredo Dal'\''Ava Junior$ Alfredo Dal'\''Ava Junior	8e2fd879e6	[PowerPC] [Clang] Enable Intel intrinsics support on FreeBSD This enables Intel intrinsics support on FreeBSD. Thanks to @pkubaj who noticed this feature was missing Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D113451	2021-11-22 20:42:10 +00:00
Peter Klausler	996ef895cd	[flang] Add -fno-automatic, refine IsSaved() This legacy option (available in other Fortran compilers with various spellings) implies the SAVE attribute for local variables on subprograms that are not explicitly RECURSIVE. The SAVE attribute essentially implies static rather than stack storage. This was the default setting in Fortran until surprisingly recently, so explicit SAVE statements & attributes could be and often were omitted from older codes. Note that initialized objects already have an implied SAVE attribute, and objects in COMMON effectively do too, as data overlays are extinct; and since objects that are expected to survive from one invocation of a procedure to the next in static storage should probably be explicit initialized in the first place, so the use cases for this option are somewhat rare, and all of them could be handled with explicit SAVE statements or attributes. This implicit SAVE attribute must not apply to automatic (in the Fortran sense) local objects, whose sizes cannot be known at compilation time. To get the semantics of IsSaved() right, the IsAutomatic() predicate was moved into Evaluate/tools.cpp to allow for dynamic linking of the compiler. The redundant predicate IsAutomatic() was noticed, removed, and its uses replaced. GNU Fortran's spelling of the option (-fno-automatic) was added to the clang-based driver and used for basic sanity testing. Differential Revision: https://reviews.llvm.org/D114209	2021-11-22 10:06:38 -08:00
Zarko Todorovski	d8e5a0c42b	[clang][NFC] Inclusive terms: replace some uses of sanity in clang Rewording of comments to avoid using `sanity test, sanity check`. Reviewed By: aaron.ballman, Quuxplusone Differential Revision: https://reviews.llvm.org/D114025	2021-11-19 14:58:35 -05:00
Bradley Smith	26f56438e3	[Clang][SVE] Properly enable/disable dependant SVE target features based upon +(no)sve.* options Co-authored-by: Graham Hunter <graham.hunter@arm.com> Differential Revision: https://reviews.llvm.org/D113776	2021-11-18 15:52:28 +00:00
Douglas Yung	b10562612f	Fix Windows build after commit `49682f1`.	2021-11-18 00:23:22 -08:00
Henry Linjamäki	49682f14bf	[SPIR-V] Add translator tool Add a tool for constructing commands for translating LLVM IR to SPIR-V. Used by HIPSPV tool chain (D110618). Reviewed By: bader Differential Revision: https://reviews.llvm.org/D112404	2021-11-18 03:41:24 +03:00
Kazu Hirata	74115602e8	[clang] Use range-based for loops with llvm::reverse (NFC)	2021-11-17 19:40:48 -08:00
Phoebe Wang	de34a940ae	[X86] Add -mskip-rax-setup support to align with GCC AMD64 ABI mandates caller to specify the number of used SSE registers when passing variable arguments. GCC also provides option -mskip-rax-setup to skip the setup of rax when SSE is disabled. This helps to reduce the code size, see pr23258. Reviewed By: nickdesaulniers Differential Revision: https://reviews.llvm.org/D112413	2021-11-18 11:20:32 +08:00
Fangrui Song	062ef8f6b4	[Driver][Android] Remove unneeded isNoExecStackDefault ld.lld used by Android ignores .note.GNU-stack and defaults to noexecstack, so the `-z noexecstack` linker option is unneeded. The `--noexecstack` assembler option is unneeded because AsmPrinter.cpp prints `.section .note.GNU-stack,"",@progbits` (when `llvm.init.trampoline` is unused), so the assembler won't synthesize an executable .note.GNU-stack. Reviewed By: danalbert Differential Revision: https://reviews.llvm.org/D113840	2021-11-17 18:15:24 -08:00
Nico Weber	ae98182cf7	[clang] Make -masm=intel affect inline asm style With this, void f() { __asm__("mov eax, ebx"); } now compiles with clang with -masm=intel. This matches gcc. The flag is not accepted in clang-cl mode. It has no effect on MSVC-style `__asm {}` blocks, which are unconditionally in intel mode both before and after this change. One difference to gcc is that in clang, inline asm strings are "local" while they're "global" in gcc. Building the following with -masm=intel works with clang, but not with gcc where the ".att_syntax" from the 2nd __asm__() is in effect until file end (or until a ".intel_syntax" somewhere later in the file): __asm__("mov eax, ebx"); __asm__(".att_syntax\nmovl %ebx, %eax"); __asm__("mov eax, ebx"); This also updates clang's intrinsic headers to work both in -masm=att (the default) and -masm=intel modes. The official solution for this according to "Multiple assembler dialects in asm templates" in gcc docs->Extensions->Inline Assembly->Extended Asm is to write every inline asm snippet twice: bt{l %[Offset],%[Base] \| %[Base],%[Offset]} This works in LLVM after D113932 and D113894, so use that. (Just putting `.att_syntax` at the start of the snippet works in some but not all cases: When LLVM interpolates in parameters like `%0`, it uses at&t or intel syntax according to the inline asm snippet's flavor, so the `.att_syntax` within the snippet happens to late: The interpolated-in parameter is already in intel style, and then won't parse in the switched `.att_syntax`.) It might be nice to invent a `#pragma clang asm_dialect push "att"` / `#pragma clang asm_dialect pop` to be able to force asm style per snippet, so that the inline asm string doesn't contain the same code in two variants, but let's leave that for a follow-up. Fixes PR21401 and PR20241. Differential Revision: https://reviews.llvm.org/D113707	2021-11-17 13:41:59 -05:00
Jon Chesterfield	0e738323a9	[openmp][amdgpu] Add comment warning that libm may be broken Using llvm-link to add rocm device-libs probably doesn't work Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D112639	2021-11-15 15:56:01 +00:00
Zarko Todorovski	05f34ffa21	[clang] Inclusive language: change instances of blacklist/whitelist to allowlist/ignorelist Change the error message to use ignorelist, and changed some variable and function names in related code and test. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D113189	2021-11-12 15:46:16 +00:00
Benjamin Kramer	98f80d248d	[Driver] Fix unused variable warning in release builds. NFC.	2021-11-12 00:20:21 +01:00
Yaxun (Sam) Liu	0309e50f33	[Driver] Fix ToolChain::getSanitizerArgs The driver uses class SanitizerArgs to store parsed sanitizer arguments. It keeps a cached SanitizerArgs object in ToolChain and uses it for different jobs. This does not work if the sanitizer options are different for different jobs, which could happen when an offloading toolchain translates the options for different jobs. To fix this, SanitizerArgs should be created by using the actual arguments passed to jobs instead of the original arguments passed to the driver, since the toolchain may change the original arguments. And the sanitizer arguments should be diagnose once. This patch also fixes HIP toolchain for handling -fgpu-sanitize: a warning is emitted for GPU's not supporting sanitizer and skipped. This is for backward compatibility with existing -fsanitize options. -fgpu-sanitize is also turned on by default. Reviewed by: Artem Belevich, Evgenii Stepanov Differential Revision: https://reviews.llvm.org/D111443	2021-11-11 17:17:08 -05:00
Zahira Ammarguellat	f04e387055	Making the code compliant to the documentation about Floating Point support default values for C/C++. FPP-MODEL=PRECISE enables FFP-CONTRACT(FMA is enabled). Fix for https://bugs.llvm.org/show_bug.cgi?id=50222	2021-11-11 07:40:35 -05:00
Fangrui Song	a77d1f68a0	[Driver] Change Linux::isPIEDefault to true for all Android versions Currently any API level>=16 uses default PIE. If API level<16 is too old to be supported, we can clean up some code. Reviewed By: danalbert Differential Revision: https://reviews.llvm.org/D113370	2021-11-11 00:12:07 -08:00
Roland McGrath	ff11f0aa5d	[Clang] Pass -z rel to linker for Fuchsia Fuchsia already supports the more compact relocation format. Make it the default. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D113136	2021-11-10 13:31:22 -08:00
Kostya Serebryany	b7f3a4f4fa	[sancov] add tracing for loads and store add tracing for loads and stores. The primary goal is to have more options for data-flow-guided fuzzing, i.e. use data flow insights to perform better mutations or more agressive corpus expansion. But the feature is general puspose, could be used for other things too. Pipe the flag though clang and clang driver, same as for the other SanitizerCoverage flags. While at it, change some plain arrays into std::array. Tests: clang flags test, LLVM IR test, compiler-rt executable test. Reviewed By: morehouse Differential Revision: https://reviews.llvm.org/D113447	2021-11-09 14:35:13 -08:00
Ard Biesheuvel	24772720c5	[ARM] reject -mtp=cp15 if target subarch does not support it Currently, we permit -mtp=cp15 even for targets that don't implement the TLS register. When building for ARMv6 or earlier, this means we emit instructions that will UNDEF at runtime. For Thumb1, passing -mtp=cp15 will trigger an assert in the backend. So let's add some diagnostics to ensure that -mtp=cp15 is only accepted for ARMv6T2 or newer. Reviewed By: nickdesaulniers Differential Revision: https://reviews.llvm.org/D113026	2021-11-09 18:29:30 +01:00
Ard Biesheuvel	a19da876ab	[ARM] implement support for TLS register based stack protector Implement support for loading the stack canary from a memory location held in the TLS register, with an optional offset applied. This is used by the Linux kernel to implement per-task stack canaries, which is impossible on SMP systems when using a global variable for the stack canary. Reviewed By: nickdesaulniers Differential Revision: https://reviews.llvm.org/D112768	2021-11-09 18:19:47 +01:00
Carlos Galvez	7ecec3f0f5	[CUDA] Bump supported CUDA version to 11.5 Differential Revision: https://reviews.llvm.org/D113249	2021-11-09 08:20:53 +00:00
Aaron Ballman	190bde404c	Revert "Making the code compliant to the documentation about Floating Point" This reverts commit `438437cbb6`. There are still broken bots from this: https://lab.llvm.org/buildbot/#/builders/188/builds/5495 https://lab.llvm.org/buildbot/#/builders/171/builds/5710	2021-11-08 11:43:49 -05:00
Zahira Ammarguellat	438437cbb6	Making the code compliant to the documentation about Floating Point support default values for C/C++. FPP-MODEL=PRECISE enables FFP-CONTRACT FMA is enabled. Fix for https://bugs.llvm.org/show_bug.cgi?id=50222	2021-11-08 08:35:19 -05:00
Anastasia Stulova	a10a69fe9c	[SPIR-V] Add SPIR-V triple and clang target info. Add new triple and target info for ‘spirv32’ and ‘spirv64’ and, thus, enabling clang (LLVM IR) code emission to SPIR-V target. The target for SPIR-V is mostly reused from SPIR by derivation from a common base class since IR output for SPIR-V is mostly the same as SPIR. Some refactoring are made accordingly. Added and updated tests for parts that are different between SPIR and SPIR-V. Patch by linjamaki (Henry Linjamäki)! Differential Revision: https://reviews.llvm.org/D109144	2021-11-08 13:34:10 +00:00
Nico Weber	0425087b8b	Revert "Making the code compliant to the documentation about Floating Point" This reverts commit `17d9560294`. Breaks check-clang everywhere, see e.g.: https://lab.llvm.org/buildbot/#/builders/105/builds/17229 https://lab.llvm.org/buildbot/#/builders/109/builds/25831 https://lab.llvm.org/buildbot/#/builders/188/builds/5493 https://lab.llvm.org/buildbot/#/builders/123/builds/7073	2021-11-08 08:32:42 -05:00
Zahira Ammarguellat	17d9560294	Making the code compliant to the documentation about Floating Point support default values for C/C++. FPP-MODEL=PRECISE enables FFP-CONTRACT FMA is enabled. Fix for https://bugs.llvm.org/show_bug.cgi?id=50222	2021-11-08 07:51:29 -05:00
Benjamin Kramer	2e20ff8c1a	[AVR] Remove a global initializer. NFCI.	2021-11-07 16:30:18 +01:00
Zarko Todorovski	a83a6c22e6	[clang] [Objective C] Inclusive language: use objcmt-allowlist-dir-path=<arg> instead of objcmt-white-list-dir-path=<arg> Trying to update some options that don't at least have an inclusive language version. This patch adds `objcmt-allowlist-dir-path` as a default alternative. Reviewed By: akyrtzi Differential Revision: https://reviews.llvm.org/D112591	2021-11-05 12:27:05 -04:00
Kazushi (Jam) Marukawa	3d32218d1a	[VE] Change to omitting the frame pointer on leaf functions Change to omitting the frame pointer on leaf functions by default for VE. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D113087	2021-11-03 17:45:18 +09:00
Yaxun (Sam) Liu	60a085beb0	Revert "[clang] deprecate frelaxed-template-template-args, make it on by default" This reverts commit `2d7fba5f95`. The patch was reverted because it caused regression with rocThrust due to ambiguity of template specialization. For details please see https://reviews.llvm.org/D109496	2021-11-02 17:02:19 -04:00
Duncan P. N. Exon Smith	9902362701	Support: Use sys::path::is_style_{posix,windows}() in a few places Use the new sys::path::is_style_posix() and is_style_windows() in a few places that need to detect the system's native path style. In llvm/lib/Support/Path.cpp, this patch removes most uses of the private `real_style()`, where is_style_posix() and is_style_windows() are just a little tidier. Elsewhere, this removes `_WIN32` macro checks. Added a FIXME to a FileManagerTest that seemed fishy, but maintained the existing behaviour. Differential Revision: https://reviews.llvm.org/D112289	2021-10-29 12:09:41 -07:00
Zarko Todorovski	c001775a3a	[clang] Inclusive language: change error message to use allowlist Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D112627	2021-10-29 13:12:46 -04:00
Keith Smiley	bd8a9507ef	[clang][driver] Fix multiarch output name with -Wl arg Previously if you passed a `-Wl,-foo` _before_ the source filename, the first `InputInfos`, which is used for the base input name would be an `InputArg` kind, which would never have a base input name. Now we use that by default, but pick the first `InputInfo` that is of kind `Filename` to get the name from if there is one. Differential Revision: https://reviews.llvm.org/D112767	2021-10-29 10:09:38 -07:00
Martin Storsjö	d758069f5e	[clang] [MinGW] Guess the right ix86 arch name spelling as sysroot For x86, most contempory mingw toolchains use i686 as 32 bit x86 arch target. As long as the target triple is set to the right form, this works fine, either as the compiler's default target, or via e.g. a triple prefix like i686-w64-mingw32-clang. However, if the unprefixed toolchain targets x86_64, but the user tries to switch it to target 32 bit by adding the -m32 option, the computeTargetTriple function in Clang, together with Triple::get32BitArchVariant, sets the arch to i386. This causes the right sysroot to not be found. When targeting an arch where there are potential spelling ambiguities with respect to the sysroots (i386 and arm), check if the driver can find a sysroot with the arch name - if not, try a couple other candidates. Differential Revision: https://reviews.llvm.org/D111952	2021-10-29 09:32:36 +03:00
Alex Lorenz	3d0d7d8c5b	[clang][driver][darwin] support -target with Mac Catalyst triple without OS version Some users might omit the version and assume the compiler will target the initial Mac Catalyst version.	2021-10-28 18:46:10 -07:00
Jon Chesterfield	4d50803ce4	[libomptarget] Build DeviceRTL for amdgpu Passes same tests as the current deviceRTL. Includes cmake change from D111987. CI is showing a different set of pass/fails to local, committing this without the tests enabled by default while debugging that difference. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D112227	2021-10-28 12:34:01 +01:00
Martin Storsjö	897c86dec5	[clang] [MinGW] Rename the 'Arch' member to 'SubdirName'. NFC. This string isn't a plain architecture name, but contains the whole subdir name used for the sysroot, which often is equal to the target triple. Differential Revision: https://reviews.llvm.org/D112387	2021-10-28 10:26:54 +03:00
YunQiang Su	284c2ebc5e	[clang][MIPS] Fix search path for Debian multilib O32 In the situation of multilib, the gcc objects are in a /32 directory. On Debian, the libraries is under /libo32 to avoid confliction. This patch enables clang find gcc in /32, and C lib in /libo32. Differential Revision: https://reviews.llvm.org/D112158	2021-10-28 10:23:06 +03:00
Jon Chesterfield	6c7b203d1d	Revert "[libomptarget] Build DeviceRTL for amdgpu" - more tests failing on CI than failed locally when writing this patch This reverts commit `33427fdb7b`.	2021-10-28 01:01:53 +01:00
Jon Chesterfield	33427fdb7b	[libomptarget] Build DeviceRTL for amdgpu Passes same tests as the current deviceRTL. Includes cmake change from D111987. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D112227	2021-10-28 00:41:45 +01:00
Matheus Izvekov	2d7fba5f95	[clang] deprecate frelaxed-template-template-args, make it on by default A resolution to the ambiguity issues created by P0522, which is a DR solving CWG 150, did not come as expected, so we are just going to accept the change, and watch how users digest it. For now we deprecate the flag with a warning, and make it on by default. We don't remove the flag completely in order to give users a chance to work around any problems by disabling it. Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D109496	2021-10-27 22:48:27 +02:00
Alexandros Lamprineas	8689f5e6e7	[AArch64] Add support for the 'R' architecture profile. This change introduces subtarget features to predicate certain instructions and system registers that are available only on 'A' profile targets. Those features are not present when targeting a generic CPU, which is the default processor. In other words the generic CPU now means the intersection of 'A' and 'R' profiles. To maintain backwards compatibility we enable the features that correspond to -march=armv8-a when the architecture is not explicitly specified on the command line. References: https://developer.arm.com/documentation/ddi0600/latest Differential Revision: https://reviews.llvm.org/D110065	2021-10-27 12:32:30 +01:00
Kazu Hirata	16ceb44e62	[clang] Use llvm::{count,count_if,find_if,all_of,none_of} (NFC)	2021-10-25 09:14:45 -07:00
Bradley Smith	0ce46a1d43	[AArch64][Driver][SVE] Allow -msve-vector-bits=<n>+ syntax to mean no maximum vscale This patch splits the existing SveVectorBits LangOpt into VScaleMin and VScaleMax LangOpts such that we can represent such an option. The cc1 option has also been split into -mvscale-{min,max}=<n> options so that the cc1 arguments better reflect the vscale_range IR attribute. Differential Revision: https://reviews.llvm.org/D111790	2021-10-25 11:10:52 +00:00
Kazu Hirata	4bd46501c3	Use llvm::any_of and llvm::none_of (NFC)	2021-10-24 17:35:33 -07:00
Kazu Hirata	7cc8fa2dd2	Use llvm::is_contained (NFC)	2021-10-24 09:32:57 -07:00
Sylvestre Ledru	a709787cd9	Add support of the next Ubuntu (Ubuntu 22.04 - Jammy Jellyfish) It is going to be a LTS release	2021-10-23 23:55:50 +02:00
Kazu Hirata	d8e4170b0a	Ensure newlines at the end of files (NFC)	2021-10-23 08:45:29 -07:00
Kristof Beyls	3b93dc6880	Add basic aarch64-none-elf bare metal driver. Differential Revision: https://reviews.llvm.org/D111134	2021-10-22 08:06:17 +01:00
Arthur Eubanks	19b07ec000	Reland [clang] Pass -clear-ast-before-backend in Clang::ConstructJob() This clears the memory used for the Clang AST before we run LLVM passes. https://llvm-compile-time-tracker.com/compare.php?from=d0a5f61c4f6fccec87fd5207e3fcd9502dd59854&to=b7437fee79e04464dd968e1a29185495f3590481&stat=max-rss shows significant memory savings with no slowdown (in fact -O0 slightly speeds up). For more background, see https://lists.llvm.org/pipermail/cfe-dev/2021-September/068930.html. Turn this off for the interpreter since it does codegen multiple times. Relanding with fix for -print-stats: D111973 Relanding with fix for plugins: D112190 If you'd like to use this even with plugins, consider using the features introduced in D112096. This can be turned off with -Xclang -no-clear-ast-before-backend. Differential Revision: https://reviews.llvm.org/D111270	2021-10-21 09:25:53 -07:00
Brad Smith	34188f237f	[Driver][OpenBSD] Some improvements to the external assembler handling - Pass CPU variant for ARM - Pass MIPS CPU in addition to the ABI	2021-10-20 21:05:14 -04:00
Fangrui Song	922bf57fc8	[Driver][Gnu] Delete unneeded -Bstatic dispatch for arm/thumb Historically -static and -Bstatic are synonym. gold made the semantics of -static slightly stronger but that does not matter.	2021-10-19 15:24:07 -07:00
Keith Smiley	17386cb4dc	[clang][Driver] Make multiarch output file basenames reproducible When building a multiarch MachO binary, previously the intermediate output file names would contain random characters. On macOS this filename, since it's used when linking, ended up being used as a stable-ish identifier for the adhoc codesignature of the binary, leading to non-reproducible binaries. This change uses the architecture, when available, to create a stable, but unique, basename for the file. Differential Revision: https://reviews.llvm.org/D111269	2021-10-19 13:49:47 -07:00
Volodymyr Sapsai	91e19f66e5	[driver] Explicitly specify `-fbuild-session-timestamp` in seconds. Representation of the file's last modification time depends on the file system and isn't guaranteed to be in seconds. Cast to seconds explicitly and tighten the test case to check the magnitude of the calculated value, so we can catch passing milliseconds or nanoseconds. rdar://83915615 Differential Revision: https://reviews.llvm.org/D111205	2021-10-19 13:30:26 -07:00
Zequan Wu	57553ce432	Revert "Reland [clang] Pass -clear-ast-before-backend in Clang::ConstructJob()" This reverts commit `1fb24fe85a`. This causes clang crash on chromium. See repro at https://bugs.chromium.org/p/chromium/issues/detail?id=1261551#c1.	2021-10-19 12:39:34 -07:00
Kazu Hirata	cf68e1b2fb	[Driver, Frontend] Use StringRef::contains (NFC)	2021-10-19 08:54:02 -07:00
David Sherwood	607fb1bb8c	[AArch64] Always add -tune-cpu argument to -cc1 driver This patch ensures that we always tune for a given CPU on AArch64 targets when the user specifies the "-mtune=xyz" flag. In the AArch64Subtarget if the tune flag is unset we use the CPU value instead. I've updated the release notes here: llvm/docs/ReleaseNotes.rst and added tests here: clang/test/Driver/aarch64-mtune.c Differential Revision: https://reviews.llvm.org/D110258	2021-10-19 14:57:51 +01:00
Matt Morehouse	e1e2635327	[HWASan] Use tagged-globals feature on x86. Allows us to use the small code model when we disable relocation relaxation. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D111344	2021-10-19 05:56:50 -07:00
Fangrui Song	408e6de8c0	[Driver][Gnu] Support -shared -static: pass -shared to ld and use crtbeginS.o This mode never works (mismatching crtbeginT.o and crtendS.o) and probably unsupported by GCC on glibc based Linux distro (incorrect crtbeginT.o causes linker error) but makes sense (-shared means building a shared object, -static means avoid shared object dependencies) and can be used on musl based Linux distro. mingw supports this mode as well.	2021-10-19 01:09:41 -07:00
Anshil Gandhi	0567f03331	[HIP] [AlwaysInliner] Disable AlwaysInliner to eliminate undefined symbols By default clang emits complete contructors as alias of base constructors if they are the same. The backend is supposed to emit symbols for the alias, otherwise it causes undefined symbols. @yaxunl observed that this issue is related to the llvm options `-amdgpu-early-inline-all=true` and `-amdgpu-function-calls=false`. This issue is resolved by only inlining global values with internal linkage. The `getCalleeFunction()` in AMDGPUResourceUsageAnalysis also had to be extended to support aliases to functions. inline-calls.ll was corrected appropriately. Reviewed By: yaxunl, #amdgpu Differential Revision: https://reviews.llvm.org/D109707	2021-10-18 16:53:15 -06:00
Craig Topper	1053e0b27c	[RISCV] Use a lambda to avoid having the Support library depend on Option library. RISCVISAInfo::toFeatures needs to allocate strings using ArgList::MakeArgString, but toFeatures lives in Support and MakeArgString lives in Option. toFeature only has one caller, so the simple fix is to have that caller pass a lamdba that wraps MakeArgString to break the dependency. Differential Revision: https://reviews.llvm.org/D112032	2021-10-18 13:39:37 -07:00
Arthur Eubanks	1fb24fe85a	Reland [clang] Pass -clear-ast-before-backend in Clang::ConstructJob() This clears the memory used for the Clang AST before we run LLVM passes. https://llvm-compile-time-tracker.com/compare.php?from=d0a5f61c4f6fccec87fd5207e3fcd9502dd59854&to=b7437fee79e04464dd968e1a29185495f3590481&stat=max-rss shows significant memory savings with no slowdown (in fact -O0 slightly speeds up). For more background, see https://lists.llvm.org/pipermail/cfe-dev/2021-September/068930.html. Turn this off for the interpreter since it does codegen multiple times. Relanding with fix for -print-stats: D111973 Differential Revision: https://reviews.llvm.org/D111270	2021-10-18 09:08:16 -07:00
Kazu Hirata	d245f2e859	[clang] Use llvm::erase_if (NFC)	2021-10-17 13:50:29 -07:00
Kito Cheng	8efa6512e0	[RISCV][NFC] Fix build error	2021-10-17 16:38:53 +08:00
Kito Cheng	ff13189c5d	[RISCV] Unify the arch string parsing logic to to RISCVISAInfo. How many place you need to modify when implementing a new extension for RISC-V? At least 7 places as I know: - Add new SubtargetFeature at RISCV.td - -march parser in RISCV.cpp - RISCVTargetInfo::initFeatureMap@RISCV.cpp for handling feature vector. - RISCVTargetInfo::getTargetDefines@RISCV.cpp for pre-define marco. - Arch string parser for ELF attribute in RISCVAsmParser.cpp - ELF attribute emittion in RISCVAsmParser.cpp, and make sure it's in canonical order... - ELF attribute emittion in RISCVTargetStreamer.cpp, and again, must in canonical order... And now, this patch provide an unified infrastructure for handling (almost) everything of RISC-V arch string. After this patch, you only need to update 2 places for implement an extension for RISC-V: - Add new SubtargetFeature at RISCV.td, hmmm, it's hard to avoid. - Add new entry to RISCVSupportedExtension@RISCVISAInfo.cpp or SupportedExperimentalExtensions@RISCVISAInfo.cpp . Most codes are come from existing -march parser, but with few new feature/bug fixes: - Accept version for -march, e.g. -march=rv32i2p0. - Reject version info with `p` but without minor version number like `rv32i2p`. Differential Revision: https://reviews.llvm.org/D105168	2021-10-17 16:25:23 +08:00
Arthur Eubanks	49562d3dfe	Revert "[clang] Pass -clear-ast-before-backend in Clang::ConstructJob()" This reverts commit `47eb99aa44`. This causes crashes with -print-stats: PR52193.	2021-10-16 12:05:41 -07:00
Anshil Gandhi	1830ec94ac	Revert "[HIP] [AlwaysInliner] Disable AlwaysInliner to eliminate undefined symbols" This reverts commit `03375a3fb3`.	2021-10-15 16:16:18 -06:00
Anshil Gandhi	03375a3fb3	[HIP] [AlwaysInliner] Disable AlwaysInliner to eliminate undefined symbols By default clang emits complete contructors as alias of base constructors if they are the same. The backend is supposed to emit symbols for the alias, otherwise it causes undefined symbols. @yaxunl observed that this issue is related to the llvm options `-amdgpu-early-inline-all=true` and `-amdgpu-function-calls=false`. This issue is resolved by only inlining global values with internal linkage. The `getCalleeFunction()` in AMDGPUResourceUsageAnalysis also had to be extended to support aliases to functions. inline-calls.ll was corrected appropriately. Reviewed By: yaxunl, #amdgpu Differential Revision: https://reviews.llvm.org/D109707	2021-10-15 11:39:15 -06:00
Arthur Eubanks	47eb99aa44	[clang] Pass -clear-ast-before-backend in Clang::ConstructJob() This clears the memory used for the Clang AST before we run LLVM passes. https://llvm-compile-time-tracker.com/compare.php?from=d0a5f61c4f6fccec87fd5207e3fcd9502dd59854&to=b7437fee79e04464dd968e1a29185495f3590481&stat=max-rss shows significant memory savings with no slowdown (in fact -O0 slightly speeds up). For more background, see https://lists.llvm.org/pipermail/cfe-dev/2021-September/068930.html. Turn this off for the interpreter since it does codegen multiple times. Differential Revision: https://reviews.llvm.org/D111270	2021-10-15 10:13:17 -07:00
Frederic Cambus	ecef035953	[Driver][NetBSD] Use Triple reference instead of ToolChain.getTriple(). Differential Revision: https://reviews.llvm.org/D111805	2021-10-15 16:36:19 +02:00
Frederic Cambus	8ecbcd058f	[Driver][Darwin] Use T reference instead of getToolChain().getTriple(). Differential Revision: https://reviews.llvm.org/D111793	2021-10-14 21:30:39 +02:00
Frederic Cambus	f7a3214306	[Driver][WebAssembly] Use ToolChain reference instead of getToolChain(). Differential Revision: https://reviews.llvm.org/D111786	2021-10-14 19:43:59 +02:00
Craig Topper	f7ba572483	[RISCV] Update Zba, Zbb, Zbc, and Zbs version from 0.93 to 1.0. I've removed the Zbs W instructions that are not part of the frozen spec. References to B as an extension name have been removed. Tests are updated or split accordingly. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D110669	2021-10-14 09:25:03 -07:00
Martin Storsjö	b541845ea0	[clang] [Windows] Mark PIC as implicitly enabled for aarch64, just like for x86_64 This doesn't practically affect the code generation. Differential Revision: https://reviews.llvm.org/D111707	2021-10-13 22:55:00 +03:00
Kazu Hirata	57b40b5f34	[AST, CodeGen, Driver] Use llvm::is_contained (NFC)	2021-10-12 09:19:49 -07:00
Saiyedul Islam	f56548829c	[Clang][clang-nvlink-wrapper] Pass nvlink path to the wrapper Added support of a "--nvlink-path" option in clang-nvlink-wrapper which takes the path of nvlink binary. Static Device Library support for OpenMP (D105191) now searches for nvlink binary and passes its location via this option. In absence of this option, nvlink binary is searched in locations in PATH. Differential Revision: https://reviews.llvm.org/D111488	2021-10-12 16:15:52 +00:00
Haowei Wu	998e067a0a	Reland "[clang][Fuchsia] Support availability attr on Fuchsia" This reland commit `1131b1eb35`, which adds support to __attribute__((availability)) annotation for Fuchsia platform. This patch also adds '-ffuchsia-api-level' to allow specify Fuchsia API level from the command line. Differential Revision: https://reviews.llvm.org/D108592	2021-10-11 18:41:29 -07:00
Haowei Wu	b5e8348bf2	Revert "[clang][Fuchsia] Support availability attr on Fuchsia" This reverts commit `1131b1eb35`, which breaks several llvm bots.	2021-10-11 17:32:38 -07:00
Haowei Wu	1131b1eb35	[clang][Fuchsia] Support availability attr on Fuchsia This patch adds support to __attribute__((availability)) annotation for Fuchsia platform. This patch also adds '-ffuchsia-api-level' to allow specify Fuchsia API level from the command line. Differential Revision: https://reviews.llvm.org/D108592	2021-10-11 15:33:04 -07:00
Victor Campos	3550e242fa	[Clang][ARM][AArch64] Add support for Armv9-A, Armv9.1-A and Armv9.2-A armv9-a, armv9.1-a and armv9.2-a can be targeted using the -march option both in ARM and AArch64. - Armv9-A maps to Armv8.5-A. - Armv9.1-A maps to Armv8.6-A. - Armv9.2-A maps to Armv8.7-A. - The SVE2 extension is enabled by default on these architectures. - The cryptographic extensions are disabled by default on these architectures. The Armv9-A architecture is described in the Arm® Architecture Reference Manual Supplement Armv9, for Armv9-A architecture profile (https://developer.arm.com/documentation/ddi0608/latest). Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D109517	2021-10-11 17:44:09 +01:00
Frederic Cambus	6417260a57	[Driver][OpenBSD] Use ToolChain reference instead of getToolChain(). Differential Revision: https://reviews.llvm.org/D111462	2021-10-09 13:21:39 +02:00
Reid Kleckner	955dc3449a	Fix TargetRegistry shlib build, clang edition	2021-10-08 15:43:56 -07:00
Reid Kleckner	89b57061f7	Move TargetRegistry.(h\|cpp) from Support to MC This moves the registry higher in the LLVM library dependency stack. Every client of the target registry needs to link against MC anyway to actually use the target, so we might as well move this out of Support. This allows us to ensure that Support doesn't have includes from MC/*. Differential Revision: https://reviews.llvm.org/D111454	2021-10-08 14:51:48 -07:00
Masoud Ataei	b0f68791f0	[clang] Option control afn flag Clang option to set/unset afn fast-math flag. Differential: https://reviews.llvm.org/D106191 Reviewd with: aaron.ballman, erichkeane, and others	2021-10-08 14:26:14 -04:00
Saiyedul Islam	35ebe4cc24	[Clang][OpenMP] Add partial support for Static Device Libraries An archive containing device code object files can be passed to clang command line for linking. For each given offload target it creates a device specific archives which is either passed to llvm-link if the target is amdgpu, or to clang-nvlink-wrapper if the target is nvptx. -L/-l flags are used to specify these fat archives on the command line. E.g. clang++ -fopenmp -fopenmp-targets=nvptx64 main.cpp -L. -lmylib It currently doesn't support linking an archive directly, like: clang++ -fopenmp -fopenmp-targets=nvptx64 main.cpp libmylib.a Linking with x86 offload also does not work. Reviewed By: ye-luo Differential Revision: https://reviews.llvm.org/D105191	2021-10-08 09:37:51 +00:00
Frederic Cambus	1f90b365bd	[Driver][NetBSD] Use ToolChain reference instead of getToolChain(). Differential Revision: https://reviews.llvm.org/D111340	2021-10-08 11:13:22 +02:00
Craig Topper	f2ad8c9dc6	[RISCV] Remove experimental-b extension that includes all Zb* extensions At this point it looks like a B extension will never exist. Instead Zba, Zbb, Zbc, and Zbs are individual extensions being ratified together as a package. Unknown at this time when or if the other Zb* extensions will be ratified. This patch removes references to the B extension. I've updated and split tests accordingly. This has been split from D110669 to make review a little easier. Differential Revision: https://reviews.llvm.org/D111338	2021-10-07 20:47:17 -07:00
Joseph Huber	9efdca87c7	[OpenMP] Introduce new flags to assert thread and team usage in the runtime This patch adds two flags to be supported for the new runtime. The flags are `-fopenmp-assume-threads-oversubscription` and -fopenmp-assume-teams-oversubscription`. These add global values that can be checked by the work sharing runtime functions to make better judgements about how to distribute work between the threads. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D111348	2021-10-07 22:23:09 -04:00
Saiyedul Islam	94e2b0258a	Revert "[Clang][OpenMP] Add partial support for Static Device Libraries" This reverts commit `4c41170895`.	2021-10-07 14:13:24 +00:00
Saiyedul Islam	3eb44f4d28	Revert "[Clang][OpenMP] Fix windows buildbot failure for D105191" This reverts commit `06404d5488`.	2021-10-07 14:13:24 +00:00
Saiyedul Islam	06404d5488	[Clang][OpenMP] Fix windows buildbot failure for D105191 Fixes `4c41170895`.	2021-10-07 05:54:56 +00:00
Saiyedul Islam	4c41170895	[Clang][OpenMP] Add partial support for Static Device Libraries An archive containing device code object files can be passed to clang command line for linking. For each given offload target it creates a device specific archives which is either passed to llvm-link if the target is amdgpu, or to clang-nvlink-wrapper if the target is nvptx. -L/-l flags are used to specify these fat archives on the command line. E.g. clang++ -fopenmp -fopenmp-targets=nvptx64 main.cpp -L. -lmylib It currently doesn't support linking an archive directly, like: clang++ -fopenmp -fopenmp-targets=nvptx64 main.cpp libmylib.a Linking with x86 offload also does not work. Reviewed By: ye-luo Differential Revision: https://reviews.llvm.org/D105191	2021-10-07 04:45:19 +00:00
Jinsong Ji	9c31969e8d	[AIX] Don't pass namedsects in LTO mode LTO don't need binder option , don't pass it in LTO mode. Reviewed By: Whitney Differential Revision: https://reviews.llvm.org/D110955	2021-10-01 19:22:40 +00:00
Craig Topper	a21c557955	[RISCV] Remove Zbproposedc extension This consists of 3 compressed instructions, c.not, c.neg, and c.zext.w. I believe these have been picked up by the Zce effort using different encodings. I don't think it makes sense to keep them in bitmanip. It will eventually cause a conflict if/when Zce is implemented in llvm. Differential Revision: https://reviews.llvm.org/D110871	2021-09-30 14:23:05 -07:00
Jinsong Ji	2443320d68	[AIX] Rename binder option for PGO support Update the binder option.	2021-09-30 19:58:42 +00:00
Nico Weber	e31899c708	Reland "[clang-cl] Accept `#pragma warning(disable : N)` for some N" This reverts commit `0cd9d8a48b` and adds the changes described in https://reviews.llvm.org/D110668#3034461.	2021-09-30 15:03:23 -04:00
Nico Weber	8dfbe9b0ae	[clang] Make crash reproducer work with clang-cl When clang crashes, it writes a standalone source file and shell script to reproduce the crash. The Driver used to set `Mode = CPPMode` in generateCompilationDiagnostics() to force preprocessing mode. This has the side effect of making IsCLMode() return false, which in turn meant Clang::AddClangCLArgs() didn't get called when creating the standalone source file, which meant the stand-alone file was preprocessed with the gcc driver's defaults In particular, exceptions default to on with the gcc driver, but to off with the cl driver. The .sh script did use the original command line, so in the reproducer for a clang-cl crash, the standalone source file could contain exception-using code after preprocessing that the compiler invocation in the shell script would then complain about. This patch removes the `Mode = CPPMode;` line and instead additionally checks for `CCGenDiagnostics` in most places that check `CCCIsCPP(). This also matches the strategy Clang::ConstructJob() uses to add -frewrite-includes for creating the standalone source file for a crash report. Fixes PR52007. Differential Revision: https://reviews.llvm.org/D110783	2021-09-30 14:33:14 -04:00
Nico Weber	fa32fd3bf7	[clang] Remove duplication in types::getCompilationPhases() Call Driver::getFinalPhase() instead of duplicating it. https://reviews.llvm.org/D65993 added the duplication, then `02e35832c3` maded it more obviously a copy of getFinalPhase(). The only difference is that getCompilationPhases() used to use LastPhase / IfsMerge where getFinalPhase() used Link. Adapt getFinalPhase() to return IfsMerge when needed. No intentional behavior change. Differential Revision: https://reviews.llvm.org/D110770	2021-09-30 14:17:14 -04:00
Amy Huang	0cd9d8a48b	Revert "[clang-cl] Accept `#pragma warning(disable : N)` for some N" because it causes `error: error reading '/wd4091'` errors in compiler-rt builds.	2021-09-29 18:46:55 -07:00
Nico Weber	2240deb976	[clang] Minor cleanups after `b2de52bec`	2021-09-29 14:28:13 -04:00
Nico Weber	b2de52bec1	[clang-cl] Accept `#pragma warning(disable : N)` for some N clang-cl maps /wdNNNN to -Wno-flags for a few warnings that map cleanly from cl.exe concepts to clang concepts. This patch adds support for the same numbers to `#pragma warning(disable : NNNN)`. It also lets `#pragma warning(push)` and `#pragma warning(pop)` have an effect, since these are used together with `warning(disable)`. The optional numeric argument to `warning(push)` is ignored, as are the other non-`disable` `pragma warning()` arguments. (Supporting `error` would be easy, but we also don't support `/we`, and those should probably be added together.) The motivating example is that a bunch of code (including in LLVM) uses this idiom to locally disable warnings about calls to deprecated functions in Windows-only code, and 4996 maps nicely to -Wno-deprecated-declarations: #pragma warning(push) #pragma warning(disable: 4996) f(); #pragma warning(pop) Implementation-wise: - Move `/wd` flag handling from Options.td to actual Driver-level code - Extract the function mapping cl.exe IDs to warning groups to the new file clang/lib/Basic/CLWarnings.cpp - Create a diag::Group enum so that CLWarnings.cpp can refer to existing groups by ID (and give DllexportExplicitInstantiationDecl a named group), and add a function to map a diag::Group to the spelling of it's associated commandline flag - Call that new function from PragmaWarningHandler Differential Revision: https://reviews.llvm.org/D110668	2021-09-29 13:14:23 -04:00
Jinsong Ji	1e48951c73	[AIX] Enable PGO without LTO On AIX, we relied on LTO to merge the csects for profiling data/counter sections. AIX binder now get the namedcsect support to support the merging, so now we can enable PGO without LTO with the new binder. Reviewed By: Whitney Differential Revision: https://reviews.llvm.org/D110671	2021-09-29 02:00:11 +00:00
Artem Belevich	fd582eeffe	[CUDA] Move CUDA SDK include path further down the include search path. This allows clang to work on Linux distributions like Debian where <CUDA-PATH>/include may be a symlink to /usr/include. We only need `cuda_wrappers` to be present before the standard C++ library headers. The CUDA SDK headers themselves do not need to be found that early. This addresses https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=995122 mentioned in post-commit comments on D108247 Differential Revision: https://reviews.llvm.org/D110596	2021-09-28 11:29:28 -07:00
Fangrui Song	75f0194d3d	[Driver] Remove confusing *-linux-android detection with non-android --target= These values allow, for example, `--target=aarch64` and `--target=aarch64-linux-gnu` to detect `aarch64-linux-android`. This is confusing. Users should specify `--target=aarch64-linux-android` to get Android GCC installation. Reverts D53463. Reviewed By: nickdesaulniers, danalbert Differential Revision: https://reviews.llvm.org/D110379	2021-09-27 13:28:40 -07:00
Yaxun (Sam) Liu	c4afb5f81b	[HIP] Fix linking of asanrt.bc HIP currently uses -mlink-builtin-bitcode to link all bitcode libraries, which changes the linkage of functions to be internal once they are linked in. This works for common bitcode libraries since these functions are not intended to be exposed for external callers. However, the functions in the sanitizer bitcode library is intended to be called by instructions generated by the sanitizer pass. If their linkage is changed to internal, their parameters may be altered by optimizations before the sanitizer pass, which renders them unusable by the sanitizer pass. To fix this issue, HIP toolchain links the sanitizer bitcode library with -mlink-bitcode-file, which does not change the linkage. A struct BitCodeLibraryInfo is introduced in ToolChain as a generic approach to pass the bitcode library information between ToolChain and Tool. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D110304	2021-09-27 13:25:46 -04:00
Nico Weber	63bb2d585e	[clang] Put original flags on 'Driver args:' crash report line We used to put the canonical spelling of flags after alias processing on that line. For clang-cl in particular, that meant that we put flags on that line that the clang-cl driver doesn't even accept, and the "Driver args:" line wasn't usable. Differential Revision: https://reviews.llvm.org/D110458	2021-09-27 10:24:46 -04:00
Nico Weber	6ece82e900	Revert "[Driver] Correctly handle static C++ standard library" This reverts commit `03142c5f67`. Breaks check-asan if system ld doesn't support --push-state, even if lld was built and is used according to lit's output. See comments on https://reviews.llvm.org/D110128	2021-09-24 18:44:53 -04:00
Petr Hosek	03142c5f67	[Driver] Correctly handle static C++ standard library When statically linking C++ standard library, we shouldn't add -Bdynamic after including the library on the link line because that might override user settings like -static and -static-pie. Rather, we should surround the library with --push-state/--pop-state to make sure that -Bstatic only applies to C++ standard library and nothing else. This has been supported since GNU ld 2.25 (2014) so backwards compatibility should no longer be a concern. Differential Revision: https://reviews.llvm.org/D110128	2021-09-24 00:40:16 -07:00
Fangrui Song	afab3c488f	[Driver] Default Generic_GCC x86 to -fasynchronous-unwind-tables to match GCC and Clang's own x86-64.	2021-09-23 19:39:50 -07:00
Fangrui Song	7647a8413b	Fix -fno-unwind-tables -fasynchronous-unwind-tables to emit unwind tables This matches GCC. Change the CC1 option to encode the unwind table level (1: needed by exceptions, 2: asynchronous) so that we can support two modes in the future.	2021-09-23 16:15:40 -07:00
Hongtao Yu	e9d1a679a1	[CSSPGO] Do not pass -fpseudo-probe-for-profiling to the linker. The correponding linker switch has been removed by https://reviews.llvm.org/D110209, so do not pass it in clang. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D110371	2021-09-23 15:50:40 -07:00
Petr Hosek	904ca7d2ed	Revert "[Driver] Correctly handle static C++ standard library" This reverts commit `5e28c892d0` as the linker on the clang-ppc64le-rhel bot doesn't seem to support --push-state/--pop-state.	2021-09-23 01:13:10 -07:00
Petr Hosek	5e28c892d0	[Driver] Correctly handle static C++ standard library When statically linking C++ standard library, we shouldn't add -Bdynamic after including the library on the link line because that might override user settings like -static and -static-pie. Rather, we should surround the library with --push-state/--pop-state to make sure that -Bstatic only applies to C++ standard library and nothing else. This has been supported since GNU ld 2.25 (2014) so backwards compatibility should no longer be a concern. Differential Revision: https://reviews.llvm.org/D110128	2021-09-23 01:00:11 -07:00
David Blaikie	38c09ea2d2	DebugInfo: Add (initially no-op) -gsimple-template-names={simple,mangled} This is to build the foundation of a new debug info feature to use only the base name of template as its debug info name (eg: "t1" instead of the full "t1<int>"). The intent being that a consumer can still retrieve all that information from the DW_TAG_template_*_parameters. So gno-simple-template-names is business as usual/previously ("t1<int>") =simple is the simplified name ("t1") =mangled is a special mode to communicate the full information, but also indicate that the name should be able to be simplified. The data is encoded as "_STNt1\|<int>" which will be matched with an llvm-dwarfdump --verify feature to deconstruct this name, rebuild the original name, and then try to rebuild the simple name via the DWARF tags - then compare the latter and the former to ensure that all the data necessary to fully rebuild the name is present.	2021-09-22 11:11:49 -07:00
Fangrui Song	a07727199d	Revert code change of D63497 & D74399 for riscv64-*-linux GCC detection This partially reverts commits `1fc2a47f0b` and `9816e726e7`. See D109727. Replacing config.guess in favor of {gcc,clang} -dumpmachine can avoid the riscv64-{redhat,suse}-linux GCC detection. Acked-by: Luís Marques <luismarques@lowrisc.org>	2021-09-20 10:28:32 -07:00
Keith Smiley	80d62993d0	[clang][darwin] Add support for --emit-static-lib This uses darwin's default libtool since llvm-ar isn't normally available. Differential Revision: https://reviews.llvm.org/D109461	2021-09-17 12:11:05 -07:00
Martin Storsjö	d13d9da1fb	[clang] [ARM] Don't set the strict alignment flag for armv7 on Windows Windows on armv7 is as alignment tolerant as Linux. The alignment considerations in the Windows on ARM ABI are documented at https://docs.microsoft.com/en-us/cpp/build/overview-of-arm-abi-conventions?view=msvc-160#alignment. The document doesn't explicitly say in which state the OS configures the SCTLR.A register (and it's not accessible from user space to inspect), but in practice, unaligned loads/stores do work and seem to be as fast as aligned loads and stores. (Unaligned strd also does seem to work, contrary to Linux, but significantly slower, as they're handled by the kernel - exactly as the document describes.) Differential Revision: https://reviews.llvm.org/D109960	2021-09-17 21:39:25 +03:00
Arnold Schwaighofer	f670c5aeee	Add a new frontend flag `-fswift-async-fp={auto\|always\|never}` Summary: Introduce a new frontend flag `-fswift-async-fp={auto\|always\|never}` that controls how code generation sets the Swift extended async frame info bit. There are three possibilities: * `auto`: which determines how to set the bit based on deployment target, either statically or dynamically via `swift_async_extendedFramePointerFlags`. * `always`: default, always set the bit statically, regardless of deployment target. * `never`: never set the bit, regardless of deployment target. Differential Revision: https://reviews.llvm.org/D109451	2021-09-16 08:48:51 -07:00
Alexandros Lamprineas	1bd5ea968e	[ARM] Mitigate the cve-2021-35465 security vulnurability. Recently a vulnerability issue is found in the implementation of VLLDM instruction in the Arm Cortex-M33, Cortex-M35P and Cortex-M55. If the VLLDM instruction is abandoned due to an exception when it is partially completed, it is possible for subsequent non-secure handler to access and modify the partial restored register values. This vulnerability is identified as CVE-2021-35465. The mitigation sequence varies between v8-m and v8.1-m as follows: v8-m.main --------- mrs r5, control tst r5, #8 /* CONTROL_S.SFPA / it ne .inst.w 0xeeb00a40 / vmovne s0, s0 / 1: vlldm sp / Lazy restore of d0-d16 and FPSCR. / v8.1-m.main ----------- vscclrm {vpr} / Clear VPR. / vlldm sp / Lazy restore of d0-d16 and FPSCR. */ More details on developer.arm.com/support/arm-security-updates/vlldm-instruction-security-vulnerability Differential Revision: https://reviews.llvm.org/D109157	2021-09-16 12:56:43 +01:00
Nico Weber	951f362e25	[clang-cl] Add a /diasdkdir flag and make /winsysroot imply it D109708 added "DIA SDK" to our win sysroot for hermetic builds that use LLVM_ENABLE_DIA_SDK. But the build system still has to manually pass flags pointing to it. Since we have a /winsysroot flag, make it look at DIA SDK in the sysroot. With this, the following is enough to compile the DIA2Dump example: out\gn\bin\clang-cl ^ "sysroot\DIA SDK\Samples\DIA2Dump\DIA2Dump.cpp" ^ "sysroot\DIA SDK\Samples\DIA2Dump\PrintSymbol.cpp" ^ "sysroot\DIA SDK\Samples\DIA2Dump\regs.cpp" ^ /diasdkdir "sysroot\DIA SDK" ^ ole32.lib oleaut32.lib diaguids.lib Differential Revision: https://reviews.llvm.org/D109828	2021-09-16 07:42:32 -04:00
Yaxun (Sam) Liu	ab5f2b505a	[HIP] Diagnose -fopenmp-targets for HIP programs Diagnose -fopenmp-targets for HIP programs since dual HIP and OpenMP offloading in the same compilation is currently not supported by HIP toolchain. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D109718	2021-09-15 13:03:57 -04:00
David Tenty	1f3925e25a	[clang][driver][AIX] Add system libc++ header paths to driver This change adds the system libc++ header location to the driver. As well we define the `__LIBC_NO_CPP_MATH_OVERLOADS__` macro when using those headers, in order to suppress conflicting C++ overloads in the system libc headers that were used by XL C++. Reviewed By: ZarkoCA Differential Revision: https://reviews.llvm.org/D109078	2021-09-15 10:41:18 -04:00
Nico Weber	b7bac5a172	[clang] Revert gcc-driver part of `648feabc65` See discussion on https://reviews.llvm.org/D109624	2021-09-13 19:04:29 -04:00
Nico Weber	648feabc65	[clang] Make the driver not diagnose errors on nonexistent linker inputs When nonexistent linker inputs are passed to the driver, the linker now errors out, instead of the compiler. If the linker does not run, clang now emits a "warning: linker input unused" instead of an error for nonexistent files. The motivation for this change is that I noticed that `clang-cl /winsysroot sysroot main.cc ole32.lib` emitted a "ole32.lib not found" error, even though the linker finds it just fine when I run `clang-cl /winsysroot sysroot main.cc /link ole32.lib`. The same problem occurs if running `clang-cl main.cc ole32.lib` in a non-MSVC shell. The problem is that DiagnoseInputExistence() only looked for libs in %LIB%, but MSVCToolChain uses much more involved techniques. For this particular problem, we could make DiagnoseInputExistence() ask the toolchain to see if it can find a .lib file, but in general the driver can't know what the linker will do to find files, so it shouldn't try. For example, if we implement PR24616, lld-link will look in the registry to determine a good default for %LIB% if it isn't set. This is less or a problem for the gcc driver, since .a paths there are either passed via -l flags (which honor -L), or via a qualified path (that doesn't honor -L) -- but for example ld.lld's --chroot flag can also trigger this problem. Without this patch, `clang -fuse-ld=lld -Wl,--chroot,some/dir /file.o` will complain that `/file.o` doesn't exist, even though `clang -fuse-ld=lld -Wl,--chroot,some/dir -Wl,/file.o` succeeds just fine. This implements rnk's suggestion on the old bug PR27234. Differential Revision: https://reviews.llvm.org/D109624	2021-09-13 08:57:38 -04:00
Joseph Huber	29b44ca896	[OpenMP] Add flag for setting debug in the offloading device This patch introduces the flags `-fopenmp-target-debug` and `-fopenmp-target-debug=` to set the value of a global in the device. This will be used to enable or disable debugging features statically in the device runtime library. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D109544	2021-09-10 18:19:19 -04:00
Jon Chesterfield	2a581710c1	[openmp] No longer use LIBRARY_PATH to find devicertl Given D109057, change test runner to use the libomptarget-x-bc-path argument instead of the LIBRARY_PATH environment variable to find the device library. Also drop the use of LIBRARY_PATH environment variable as it is far too easy to pull in the device library from an unrelated toolchain by accident with the current setup. No loss in flexibility to developers as the clang commandline used here is still available. Reviewed By: jdoerfert, tianshilei1992 Differential Revision: https://reviews.llvm.org/D109061	2021-09-09 17:16:41 +01:00
Usman Nadeem	0a9d740c23	[clang][Driver] Update/cleanup LTO logic to ensure that the last lto argument is honored - Make flto an alias of flto=full. - Make foffload-lto an alias of foffload-lto=full. - Make flto_EQ_jobserver, flto_EQ_auto aliases of flto=full, since they are being treated as full lto right now. - Clean up the code for parseLTOMode and setLTOMode. - Replace uses of OPT_flto with OPT_flto_EQ since they alias now. Differential Revision: https://reviews.llvm.org/D108881 Change-Id: I5d867db83a680434fba5c8d85c9a83135d3b81ee	2021-09-08 15:53:49 -07:00
Usman Nadeem	54612a037a	Revert "[clang][Driver] Update/cleanup LTO logic to ensure that the last lto argument is honored" This reverts commit `d2d2e5ea48`.	2021-09-08 15:49:35 -07:00
Usman Nadeem	d2d2e5ea48	[clang][Driver] Update/cleanup LTO logic to ensure that the last lto argument is honored - Make flto an alias of flto=full. - Make foffload-lto an alias of foffload-lto=full. - Make flto_EQ_jobserver, flto_EQ_auto aliases of flto=full, since they are being treated as full lto right now. - Clean up the code for parseLTOMode and setLTOMode. - Replace uses of OPT_flto with OPT_flto_EQ since they alias now. Change-Id: Iea5338c20cb800b43529b20745e92600e2cfd2b1	2021-09-08 15:40:32 -07:00
Saiyedul Islam	98380762c3	[clang-offload-bundler] Make Bundle Entry ID backward compatible Earlier BundleEntryID used to be <OffloadKind>-<Triple>-<GPUArch>. This used to work because the clang-offload-bundler didn't need GPUArch explicitly for any bundling/unbundling action. With unbundleArchive it needs GPUArch to ensure compatibility between device specific code objects. D93525 enforced triples to have separators for all 4 components irrespective of number of components, like "amdgcn-amd-amdhsa--". It was required to to correctly parse a possible 4th environment component or a GPU. But, this condition is breaking backward compatibility with archive libraries compiled with compilers older than D93525. This patch allows triples to have any number of components with and without extra separator for empty environment field. Thus, both the following bundle entry IDs are same: openmp-amdgcn-amd-amdhsa--gfx906 openmp-amdgcn-amd-amdhsa-gfx906 Reviewed By: yaxunl, grokos Differential Revision: https://reviews.llvm.org/D106809	2021-09-08 16:06:12 +05:30
Kadir Cetinkaya	73c00d40bd	[clang][Driver] Pick the last --driver-mode in case of multiple ones This was an accidental behaviour change in D106789 and this patch restores it back to original state. Differential Revision: https://reviews.llvm.org/D109361	2021-09-07 15:33:45 +02:00
Kazu Hirata	15cd16aaf0	[Driver] Drop unnecessary const from return types (NFC) Identified with readability-const-return-type.	2021-09-04 08:05:27 -07:00
Brad Smith	775ab780fd	Support linking against OpenMP runtime on OpenBSD.	2021-09-03 19:33:09 -04:00
Brad Smith	b989662eb0	OpenBSD also needs execinfo	2021-09-03 17:33:48 -04:00
Frederic Cambus	466451c661	[clang] Allow the OpenBSD driver to link the libclang_rt.profile library. Differential Revision: https://reviews.llvm.org/D109244	2021-09-03 17:18:40 -04:00
Ben Shi	12fee64daf	[CUDA][NFC] Fix wrong assert information Reviewed By: fodinabor Differential Revision: https://reviews.llvm.org/D109232	2021-09-03 22:35:42 +08:00
Nico Weber	cc2d4dc3e0	Reland "Try to unbreak Win build differently after 973519826edb76"" Build should be fixed by https://github.com/llvm/llvm-project/commit/9d22754389 This reverts commit `df052e1732`. Differential Revision: https://reviews.llvm.org/D109181	2021-09-02 16:19:58 -07:00
Geoffrey Martin-Noble	df052e1732	Revert "Try to unbreak Win build differently after 973519826edb76" Breaks the build and failed pre-merge checks: https://buildkite.com/llvm-project/premerge-checks/builds/54930#07373971-3d37-49cf-9def-22c0d724ee23 > llvm-project/lld/wasm/Writer.cpp:521:16: error: non-const lvalue reference to > type 'llvm::StringRef' cannot bind to a temporary of type 'llvm::StringRef' > for (auto &feature : used.keys()) { This reverts commit `5881dcff7e`.	2021-09-02 12:05:33 -07:00
Nico Weber	5881dcff7e	Try to unbreak Win build differently after `973519826e` Looks like the MS STL wants StringMapKeyIterator::operator*() to be const. Return the result by copy instead of reference to do that. Assigning to a hash map key iterator doesn't make sense anyways. Also reverts `123f811fe5` which is now hopefully no longer needed. Differential Revision: https://reviews.llvm.org/D109167	2021-09-02 14:45:56 -04:00
Nico Weber	123f811fe5	Try to unbreak Win build after `973519826e` Apparently some versions of the MS STL don't like constructing a vector from a StringMapKeyIterator<>: http://45.33.8.238/win/44999/step_4.txt It builds fine with the MS STL on my Windows box, so just sidestep the issue. Full error for posterity: VC\Tools\MSVC\14.14.26428\include\xmemory(218,75): error: indirection requires pointer operand ('const llvm::StringMapKeyIterator<llvm::StringRef>' invalid) _Uses_default_construct_t<_Alloc, decltype(_Unfancy(_UDest)), decltype(*_UFirst)>()))); VC\Tools\MSVC\14.14.26428\include\vector(1922,11): note: in instantiation of function template specialization 'std::_Uninitialized_copy<...>' requested here return (_Uninitialized_copy(_First, _Last, _Dest, this->_Getal())); VC\Tools\MSVC\14.14.26428\include\vector(757,22): note: in instantiation of function template specialization 'std::vector<llvm::StringRef>::_Ucopy<llvm::StringMapKeyIterator<llvm::StringRef>>' requested here this->_Mylast() = _Ucopy(_First, _Last, this->_Myfirst()); VC\Tools\MSVC\14.14.26428\include\vector(772,3): note: in instantiation of function template specialization 'std::vector<llvm::StringRef>::_Range_construct_or_tidy<llvm::StringMapKeyIterator<llvm::StringRef>>' requested here _Range_construct_or_tidy(_Unchecked(_First), _Unchecked(_Last), _Iter_cat_t<_Iter>{}); ../../clang/lib/Driver/ToolChains/Arch/X86.cpp(62,30): note: in instantiation of function template specialization 'std::vector<llvm::StringRef>::vector<llvm::StringMapKeyIterator<llvm::StringRef>, void>' requested here std::vector<StringRef> ValidArchs{ArchMap.keys().begin(),	2021-09-02 12:06:53 -04:00
Nico Weber	973519826e	[clang-cl] Emit nicer warning on unknown /arch: arguments Now prints the list of known archs. This requires plumbing a Driver arg through a few functions. Also add two more convenience insert() overlods to StringMap. Differential Revision: https://reviews.llvm.org/D109105	2021-09-02 10:37:32 -04:00
Jon Chesterfield	c7cbf1a03e	[openmp] Accept directory for libomptarget-bc-path The commandline flag to specify a particular openmp devicertl library currently errors like: ``` fatal error: cannot open file './runtimes/runtimes-bins/openmp/libomptarget': Is a directory ``` CommonArgs successfully appends the directory to the commandline args then mlink-builtin-bitcode rejects it. This patch is a point fix to that. If --libomptarget-amdgcn-bc-path=directory then append the expected name for the current architecture and go on as before. This is useful for test runners that don't hardcode the architecture. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D109057	2021-09-01 21:22:35 +01:00
Jon Chesterfield	6b0636ce53	Revert "[openmp] Accept directory for libomptarget-bc-path" Windows separator problem. Fixing that broke another regex. This reverts commit `0173e024fd`.	2021-09-01 20:45:41 +01:00
Jon Chesterfield	cef1199686	Revert "[openmp] No longer use LIBRARY_PATH to find devicertl" This reverts commit `7a228f872f`. Failing test case under CI	2021-09-01 20:44:12 +01:00
Jon Chesterfield	7a228f872f	[openmp] No longer use LIBRARY_PATH to find devicertl Given D109057, change test runner to use the libomptarget-x-bc-path argument instead of the LIBRARY_PATH environment variable to find the device library. Also drop the use of LIBRARY_PATH environment variable as it is far too easy to pull in the device library from an unrelated toolchain by accident with the current setup. No loss in flexibility to developers as the clang commandline used here is still available. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D109061	2021-09-01 20:24:34 +01:00
Nico Weber	3d157cfcc4	[clang] Add a -canonical-prefixes option In https://reviews.llvm.org/D47480 I complained that there's no positive form of this flag, so let's add one :) https://gcc.gnu.org/PR29931 also has a pending patch to add the positive form to gcc (but there's admittedly not a lot of movement on that bug). This doesn't change any defaults. Differential Revision: https://reviews.llvm.org/D108818	2021-09-01 14:51:06 -04:00
Jon Chesterfield	0173e024fd	[openmp] Accept directory for libomptarget-bc-path The commandline flag to specify a particular openmp devicertl library currently errors like: ``` fatal error: cannot open file './runtimes/runtimes-bins/openmp/libomptarget': Is a directory ``` CommonArgs successfully appends the directory to the commandline args then mlink-builtin-bitcode rejects it. This patch is a point fix to that. If --libomptarget-amdgcn-bc-path=directory then append the expected name for the current architecture and go on as before. This is useful for test runners that don't hardcode the architecture. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D109057	2021-09-01 19:46:21 +01:00
Zahira Ammarguellat	cec7c2b32e	Revert "[CLANG][PATCH][FPEnv] Add support for option -ffp-eval-method and extend #pragma float_control similarly" The intent of this patch is to add support of -fp-model=[source\|double\|extended] to allow the compiler to use a wider type for intermediate floating point calculations. As a side effect to that, the value of FLT_EVAL_METHOD is changed according to the pragma float_control. Unfortunately some issue was uncovered with this change in preprocessing. See details in https://reviews.llvm.org/D93769 . We are therefore reverting this patch until we find a way to reconcile the value of FLT_EVAL_METHOD, the pragma and the -E flow. This reverts commit `66ddac22e2`.	2021-09-01 04:48:50 -07:00
Joel E. Denny	83ddfa0d22	[OpenMP][OpenACC] Implement `ompx_hold` map type modifier extension in Clang (1/2) This patch implements Clang support for an original OpenMP extension we have developed to support OpenACC: the `ompx_hold` map type modifier. The next patch in this series, D106510, implements OpenMP runtime support. Consider the following example: ``` #pragma omp target data map(ompx_hold, tofrom: x) // holds onto mapping of x { foo(); // might have map(delete: x) #pragma omp target map(present, alloc: x) // x is guaranteed to be present printf("%d\n", x); } ``` The `ompx_hold` map type modifier above specifies that the `target data` directive holds onto the mapping for `x` throughout the associated region regardless of any `target exit data` directives executed during the call to `foo`. Thus, the presence assertion for `x` at the enclosed `target` construct cannot fail. (As usual, the standard OpenMP reference count for `x` must also reach zero before the data is unmapped.) Justification for inclusion in Clang and LLVM's OpenMP runtime: * The `ompx_hold` modifier supports OpenACC functionality (structured reference count) that cannot be achieved in standard OpenMP, as of 5.1. * The runtime implementation for `ompx_hold` (next patch) will thus be used by Flang's OpenACC support. * The Clang implementation for `ompx_hold` (this patch) as well as the runtime implementation are required for the Clang OpenACC support being developed as part of the ECP Clacc project, which translates OpenACC to OpenMP at the directive AST level. These patches are the first step in upstreaming OpenACC functionality from Clacc. * The Clang implementation for `ompx_hold` is also used by the tests in the runtime implementation. That syntactic support makes the tests more readable than low-level runtime calls can. Moreover, upstream Flang and Clang do not yet support OpenACC syntax sufficiently for writing the tests. * More generally, the Clang implementation enables a clean separation of concerns between OpenACC and OpenMP development in LLVM. That is, LLVM's OpenMP developers can discuss, modify, and debug LLVM's extended OpenMP implementation and test suite without directly considering OpenACC's language and execution model, which can be handled by LLVM's OpenACC developers. * OpenMP users might find the `ompx_hold` modifier useful, as in the above example. See new documentation introduced by this patch in `openmp/docs` for more detail on the functionality of this extension and its relationship with OpenACC. For example, it explains how the runtime must support two reference counts, as specified by OpenACC. Clang recognizes `ompx_hold` unless `-fno-openmp-extensions`, a new command-line option introduced by this patch, is specified. Reviewed By: ABataev, jdoerfert, protze.joachim, grokos Differential Revision: https://reviews.llvm.org/D106509	2021-08-31 16:13:49 -04:00
Kazu Hirata	b8debabb77	[clang] Remove redundant calls to c_str() (NFC) Identified with readability-redundant-string-cstr.	2021-08-31 08:53:51 -07:00
Simon Moll	a5791badde	[clang] Add gcc-toolset-10 support (RHEL/CentOS 8) Clang only adds GCC paths for RHEL <= 7 'devtoolset-<N>' Software Collections (SCL). This generalizes this support to also include the 'gcc-toolset-10' SCL in RHEL/CentOS 8. Reviewed By: stephan.dollberg Differential Revision: https://reviews.llvm.org/D108908	2021-08-30 13:33:30 +02:00
Lin Sun	d280a76908	[Driver][Linux] Fix regression when -DLIBCXX_LIBDIR_SUFFIX=64 This patch allows an installed (`ninja install-clang`) Clang to find `../lib64/libc++.so` Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D108286	2021-08-25 23:50:17 -07:00
Heejin Ahn	a947b40caf	[WebAssembly] Add Wasm SjLj option support for clang This adds support for Wasm SjLj in clang. Also this sets the new `-mllvm -wasm-enable-eh` option for Wasm EH. Note there is a little unfortunate inconsistency there: Wasm EH is enabled by a clang option `-fwasm-exceptions`, which sets `-mllvm -wasm-enable-eh` in the backend options. It also sets `-exception-model=wasm` but this is done in the common code. Wasm SjLj doesn't have a clang-level option like `-fwasm-exceptions`. `-fwasm-exceptions` was added because each exception model has its corresponding `-f*-exceptions`, but I'm not sure if adding a new option like `-fwasm-sjlj` or something is a good idea. So the current plan is Emscripten sets `-mllvm -wasm-enable-sjlj` if Wasm SJLj is enabled in its settings.js, as it does for Emscripten EH/SjLj (it sets `-mllvm -enable-emscripten-cxx-exceptions` for Emscripten EH and `-mllvm -enable-emscripten-sjlj` for Emscripten SjLj). And setting this enables the exception handling feature, and also sets `-exception-model=wasm`, but this time this is not done in the common code so we do it ourselves. Also note that other exception models have 1-to-1 correspondance with their `-f-exceptions` flag and their `-exception-model=**` flag, but because we use `-exception-model=wasm` also for Wasm SjLj while `-fwasm-exceptions` still means Wasm EH, there is also a little inconsistency there, but I think it is manageable. Also this adds various error checking and tests. Reviewed By: dschuff Differential Revision: https://reviews.llvm.org/D108582	2021-08-24 18:12:52 -07:00
Ed Maste	6609892a2d	[clang] allow -fstack-clash-protection on FreeBSD -fstack-clash-protection was added in Clang commit `e67cbac812` but was enabled only on Linux. Allow it on FreeBSD as well, as it works fine. Reviewed By: serge-sans-paille Differential Revision: https://reviews.llvm.org/D108571	2021-08-24 21:02:36 -04:00
Artem Belevich	3db8e486e5	[CUDA] Improve CUDA version detection and diagnostics. Always use cuda.h to detect CUDA version. It's a more universal approach compared to version.txt which is no longer present in recent CUDA versions. Split the 'unknown CUDA version' warning in two: * when detected CUDA version is partially supported by clang. It's expected to work in general, at the feature parity with the latest supported CUDA version. and may be missing support for the new features/instructions/GPU variants. Clang will issue a warning. * when detected version is new. Recent CUDA versions have been working with clang reasonably well, and will likely to work similarly to the partially supported ones above. Or it may not work at all. Clang will issue a warning and proceed as if the latest known CUDA version was detected. Differential Revision: https://reviews.llvm.org/D108247	2021-08-23 13:24:48 -07:00
Artem Belevich	49d982d8cb	[CUDA] Add support for CUDA-11.4 Differential Revision: https://reviews.llvm.org/D108239	2021-08-23 13:24:46 -07:00
Artem Belevich	0060fffc82	[CUDA] Bump default GPU architecture to sm_35. It's the oldest GPU architecture currently supported by all CUDA versions clang can use. Differential Revision: https://reviews.llvm.org/D108235	2021-08-23 13:24:45 -07:00
Brian Cain	59dfde7d94	[clang] enable sanitizers for hexagon	2021-08-17 19:59:24 -07:00
Ben Shi	b31199bab4	[AVR][clang] Improve search for avr-libc installation path Search avr-libc path according to avr-gcc installation at first, then other possible installed pathes. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D107682	2021-08-17 11:51:35 +08:00
Sylvestre Ledru	b8d451da86	Add support of the future Debian (Debian 12 - Bookworm) https://wiki.debian.org/DebianBookworm ETA: 2023	2021-08-16 09:11:31 +02:00
Pushpinder Singh	60e07a9568	[AMDGPU][OpenMP] Use llvm-link to link ocml libraries This fixes the 'unused linker option: -lm' warning when compiling program with -c. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D107952	2021-08-13 13:36:57 +05:30
Sarah Purohit	ee620b1743	[clang][Arm] Fix the default floating point ABI for 'armv7-pc-win32-macho' It is incorrect to select the hardware floating point ABI on Mach-O platforms using the Windows triple if the ABI is "apcs-gnu". rdar://81810554 Differential Revision: https://reviews.llvm.org/D107939	2021-08-12 21:46:30 -07:00
Hongtao Yu	ccb5b9bbfb	[CSSPGO] Allow the use of debug-info-for-profiling and pseudo-probe-for-profiling together Previoulsy debug-info-for-profiling and pseudo-probe-for-profiling are mutual exclusive because they compete the dwarf discrimnator for callsites on the IR. This changes allows to use the two switches together. The side effect is that callsite discriminators will be taken by pseudo probe, while discriminators for other instructions are still available for AutoFDO use. This is less than ideal, however, it still allows us a chance to smoothly transition from AutoFDO to CSSPGO, by collecting both profiles from a CSSPGO binary. Reviewed By: wenlei, wmi Differential Revision: https://reviews.llvm.org/D107876	2021-08-12 08:52:49 -07:00
Martin Storsjö	5ed9e5c2c0	[clang] [MinGW] Consider the per-target libc++ include directory too The existing logic for per-target libc++ include directories only seem to exist for the Gnu and Fuchsia drivers, added in `ea12d779bc` / D89013. This is less generic than the corresponding case in the Gnu driver, but matches the existing level of genericity in the MinGW driver (and others too). Differential Revision: https://reviews.llvm.org/D107893	2021-08-12 13:27:09 +03:00
Joseph Huber	01d59c0de8	[OpenMP]Fix PR50336: Remove temporary files in the offload bundler tool Temporary files created by the offloading device toolchain are not removed after compilation when using a two-step compilation. The offload-bundler uses a different filename for the device binary than the `.o` file present in the Job's input list. This is not listed as a temporary file so it is never removed. This patch explicitly adds the device binary as a temporary file to consume it. This fixes PR50336. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D107668	2021-08-11 08:50:47 -04:00
Petr Hosek	389dc94d4b	[InstrProfiling] Generate runtime hook for Fuchsia When none of the translation units in the binary have been instrumented we shouldn't need to link the profile runtime. However, because we pass -u__llvm_profile_runtime on Linux and Fuchsia, the runtime would still be pulled in and incur some overhead. On Fuchsia which uses runtime counter relocation, it also means that we cannot reference the bias variable unconditionally. This change modifies the InstrProfiling pass to pull in the profile runtime only when needed by declaring the __llvm_profile_runtime symbol in the translation unit only when needed. For now we restrict this only for Fuchsia, but this can be later expanded to other platforms. This approach was already used prior to `9a041a7522`, but we changed it to always generate the __llvm_profile_runtime due to a TAPI limitation, but that limitation may no longer apply, and it certainly doesn't apply on platforms like Fuchsia. Differential Revision: https://reviews.llvm.org/D98061	2021-08-10 23:21:15 -07:00
Brian Cain	888876ba27	[clang] [hexagon] Add resource include dir	2021-08-10 08:37:58 -05:00
Ettore Tiotto	41e3ac398c	[AIX]: Fix option processing for -b Code added by D106688 has a problem. It passes the option -bxyz to the system linker as -b xyz xyz (duplication of the string 'xyz' is incorrect). This patch fixes that oversight. Reviewed by: hubert.reinterpretcast, jsji Differential Revision: https://reviews.llvm.org/D107786	2021-08-09 19:52:31 -04:00
Craig Topper	618543bb12	[clang][NFC] Fix a -Wparentheses warning.	2021-08-07 08:56:31 -07:00
Matt Jacobson	71e71067f3	[AVR][clang] Add '$SYSROOT/avr' to possible avr-libc locations Reviewed by: benshi001 Differential Revision: https://reviews.llvm.org/D107672	2021-08-07 10:24:14 +08:00
Zahira Ammarguellat	4389a413e2	Revert "[clang][fpenv][patch] Change clang option -ffp-model=precise to select ffp-contract=on" This reverts commit `48ad446a0f`.	2021-08-06 12:01:47 -07:00
Artem Belevich	6a9cf21f5a	[CUDA, MemCpyOpt] Add a flag to force-enable memcpyopt and use it for CUDA. Attempt to enable MemCpyOpt unconditionally in D104801 uncovered the fact that there are users that do not expect LLVM to materialize `memset` intrinsic. While other passes can do that, too, MemCpyOpt triggers it more frequently and breaks sanitizers and some downstream users. For now introduce a flag to force-enable the flag and opt-in only CUDA compilation with NVPTX back-end. Differential Revision: https://reviews.llvm.org/D106401	2021-08-06 11:13:52 -07:00
Matt Jacobson	dae7adda94	[AVR][clang] Pass '-fno-use-init-array' to cc1 as default On AVR, '.ctors' is used, not '.init_array'. Make this the default unless specifically overridden by driver argument. This matches gcc, and it matches the behavior in (e.g.) the NetBSD driver (for certain OS variants). Reviewed by: MaskRay Differential Revision: https://reviews.llvm.org/D107610	2021-08-06 10:14:23 +08:00
Fangrui Song	c38efb4899	[clang] Implement -falign-loops=N (N is a power of 2) for non-LTO GCC supports multiple forms of -falign-loops=. -falign-loops= is currently ignored in Clang. This patch implements the simplest but the most useful form where N is a power of 2. The underlying implementation uses a `llvm::TargetOptions` option for now. Bitcode generation ignores this option. Differential Revision: https://reviews.llvm.org/D106701	2021-08-05 12:17:50 -07:00
Aaron Ballman	530ea28fef	Correct a lot of diagnostic wordings for the driver Clang diagnostics should not start with a capital letter or use trailing punctuation (https://clang.llvm.org/docs/InternalsManual.html#the-format-string), but quite a few driver diagnostics were not following this advice. This corrects the grammar and punctuation to improve consistency, but does not change the circumstances under which the diagnostics are produced.	2021-08-05 07:04:55 -04:00
Martin Storsjö	ce49fd024b	[clang] [MinGW] Let the last of -mconsole/-mwindows have effect Don't just check for the existence of one, but check which one was specified last, if any. This fixes https://llvm.org/PR51296. Differential Revision: https://reviews.llvm.org/D107261	2021-08-03 10:55:44 +03:00
modimo	b40a2a533a	[clang] Add support for optional flag -fnew-infallible to restrict exception propagation The declaration for the global new function in C++ is generated in the compiler front-end. When examining exception propagation, we found that this is the largest root throw site propagator requiring unwind code to be generated for callers up the stack. Allowing this to be handled immediately with termination stops upward propagation and leads to significantly less landing pads generated. This in turns leads to a performance and .text size win. With `-fnew-infallible` this annotates the declaration with `throw()` and `__attribute__((returns_nonnull))`. `throw()` allows the compiler to assume exceptions do not propagate out of new and eliminate it as a root throw site. Note that the definition of global new is user-replaceable so users should ensure that the one used follows these semantics. Measuring internally, we're seeing at 0.5% CPU win in one of our large internal FB workload. Measuring on clang self-build (`cd0a1226b5`) we get: thinlto/ "dwarfehprepare.NumCleanupLandingPadsRemaining": 153494, "dwarfehprepare.NumNoUnwind": 26309, thinlto_newinfallible/ "dwarfehprepare.NumCleanupLandingPadsRemaining": 143660, "dwarfehprepare.NumNoUnwind": 28744, a 1-143660/153494 = 6.4% reduction in landing pads and a 28744/26309 = 9.3% increase in the number of nounwind functions. Testing: ninja check-all new test case to make sure these attributes are added correctly to global new. Reviewed By: urnathan Differential Revision: https://reviews.llvm.org/D105225	2021-08-02 15:45:06 -07:00
Alex Lorenz	f575f37182	[clang][darwin] Add support for the -mtargetos= option to the driver The new -mtargetos= option is a replacement for the existing, OS-specific options like -miphoneos-version-min=. This allows us to introduce support for new darwin OSes easier as they won't require the use of a new option. The older options will be deprecated and the use of the new option will be encouraged instead. Differential Revision: https://reviews.llvm.org/D106316	2021-08-02 12:45:40 -07:00
Scott Linder	635c5ba45b	[AMDGPU][HIP] Switch default DWARF version to 5 Another attempt at changing this default, now that tooling has greater support for DWARF 5. Differential Revision: https://reviews.llvm.org/D107190	2021-08-02 18:04:01 +00:00
Pushpinder Singh	713a5d12cd	[OpenMP][AMDGCN] Initial math headers support With this patch, OpenMP on AMDGCN will use the math functions provided by ROCm ocml library. Linking device code to the ocml will be done in the next patch. Reviewed By: JonChesterfield, jdoerfert, scchan Differential Revision: https://reviews.llvm.org/D104904	2021-08-02 14:38:52 +00:00
Justas Janickas	b13fc7311e	[OpenCL] __cpp_threadsafe_static_init is by default undefined in OpenCL mode. Definition of `__cpp_threadsafe_static_init` macro is controlled by language option Opts.ThreadsafeStatics. This patch sets language option to false by default in OpenCL mode, resulting in macro `__cpp_threadsafe_static_init` being undefined. Default value can be overridden using command line option -fthreadsafe-statics. Change is supposed to address portability because not all OpenCL vendors support thread safe implementation of static initialization. Fixes llvm.org/PR48012 Differential Revision: https://reviews.llvm.org/D107163	2021-08-02 14:10:15 +01:00
peter klausler	3338ef93b0	[flang] Produce proper "preprocessor output" for -E option Rename the current -E option to "-E -Xflang -fno-reformat". Add a new Parsing::EmitPreprocessedSource() routine to convert the cooked character stream output of the prescanner back to something more closely resembling output from a traditional preprocessor; call this new routine when -E appears. The new -E output is suitable for use as fixed form Fortran source to compilation by (one hopes) any Fortran compiler. If the original top-level source file had been free form source, the output will be suitable for use as free form source as well; otherwise there may be diagnostics about missing spaces if they were indeed absent in the original fixed form source. Unless the -P option appears, #line directives are interspersed with the output (but be advised, f18 will ignore these if presented with them in a later compilation). An effort has been made to preserve original alphabetic character case and source indentation. Add -P and -fno-reformat to the new drivers. Tweak test options to avoid confusion with prior -E output; use -fno-reformat where needed, but prefer to keep -E, sometimes in concert with -P, on most, updating expected results accordingly. Differential Revision: https://reviews.llvm.org/D106727	2021-07-30 15:13:56 -07:00
Jon Chesterfield	7f97ddaf8a	Revert "[OpenMP][AMDGCN] Initial math headers support" Broke nvptx compilation on files including <complex> This reverts commit `12da97ea10`.	2021-07-30 22:07:00 +01:00
Anjan Kumar	aa35c496cf	[AIX] Pass the -b option to linker on AIX (with fix to build break) This patch will re-enable the patch posted under https://reviews.llvm.org/D106688 originally which was reverted due to buildbreak that was caused by mismatched diagnostic message arguments. Reviewed By: Zarko Todorovski Differential Revision: https://reviews.llvm.org/D107105	2021-07-30 15:50:52 +00:00
Pushpinder Singh	12da97ea10	[OpenMP][AMDGCN] Initial math headers support With this patch, OpenMP on AMDGCN will use the math functions provided by ROCm ocml library. Linking device code to the ocml will be done in the next patch. Reviewed By: JonChesterfield, jdoerfert, scchan Differential Revision: https://reviews.llvm.org/D104904	2021-07-30 14:52:41 +00:00
Pushpinder Singh	9830f902e4	[AMDGPU][OpenMP] Support linking of math libraries Math libraries are linked only when -lm is specified. This is because host system could be missing rocm-device-libs. Reviewed By: JonChesterfield, yaxunl Differential Revision: https://reviews.llvm.org/D105981	2021-07-30 13:53:44 +00:00
Matt Jacobson	1e6a93f15c	[AVR][clang] Pass '--start-group' and '--end-group' options to avr-ld Reviewed By: Ben Shi Differential Revision: https://reviews.llvm.org/D106854	2021-07-30 08:25:14 +08:00
Anjan Kumar	7645cdcb48	Revert "[AIX] Pass the -b option to linker on AIX" This reverts commit `109954410c`.	2021-07-29 19:40:25 +00:00
Anjan Kumar	109954410c	[AIX] Pass the -b option to linker on AIX Parse the -b option in the driver and pass it to the linker if the target OS is AIX. This will establish compatibility with the other AIX compilers. Reviewed By: Zarko Todorovski Differential Revision: https://reviews.llvm.org/D106688	2021-07-29 18:14:41 +00:00
Jamie Schmeiser	c3c1826c31	Set TargetCPUName for AIX to default to pwr7. Summary: Set the TargetCPUName for AIX to default to pwr7, removing the setting of it based on the major/minor of the OS version, which previously set it to pwr4 for AIX 7.1 and earlier. The old code would also set it to pwr4 when the OS version was not specified and with the change, it will default it to pwr7 in all cases. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By:hubert.reinterpretcast (Hubert Tong) Differential Revision: https://reviews.llvm.org/D107063	2021-07-29 09:59:24 -04:00
Melanie Blower	66ddac22e2	[CLANG][PATCH][FPEnv] Add support for option -ffp-eval-method and extend #pragma float_control similarly The Intel compiler ICC supports the option "-fp-model=(source\|double\|extended)" which causes the compiler to use a wider type for intermediate floating point calculations. Also supported is a way to embed this effect in the source program with #pragma float_control(source\|double\|extended). This patch extends pragma float_control syntax, and also adds support for a new floating point option "-ffp-eval-method=(source\|double\|extended)". source: intermediate results use source precision double: intermediate results use double precision extended: intermediate results use extended precision Reviewed By: Aaron Ballman Differential Revision: https://reviews.llvm.org/D93769	2021-07-28 10:50:32 -04:00
Melanie Blower	48ad446a0f	[clang][fpenv][patch] Change clang option -ffp-model=precise to select ffp-contract=on Change the ffp-model=precise to enables -ffp-contract=on (previously -ffp-model=precise enabled -ffp-contract=fast). This is a follow-up to Andy Kaylor's comments in the llvm-dev discussion "Floating Point semantic modes". From the same email thread, I put Andy's distillation of floating point options and floating point modes into UsersManual.rst Also fixes bugs.llvm.org/show_bug.cgi?id=50222 I had to revert this a few times because of failures on the x86-64 buildbot but I think we finally have that fixed by LNT/79f2b03c51. Reviewed By: rjmccall, andrew.kaylor Differential Revision: https://reviews.llvm.org/D74436	2021-07-27 13:55:31 -04:00
Kadir Cetinkaya	ce90b60bd0	[clang][Driver] Expose driver mode detection logic Also use it in other places that performed it on their own. Differential Revision: https://reviews.llvm.org/D106789	2021-07-27 14:49:53 +02:00
Nico Weber	452095fe2f	[clang/darwin] Pass libclang_rt.profile last on linker command This reverts the functional change of https://reviews.llvm.org/D35385 because it sounds like this is no longer necessary (https://bugs.llvm.org/show_bug.cgi?id=51135#c11) and makes clang's behavior more uniform across platforms. Differential Revision: https://reviews.llvm.org/D106733	2021-07-27 07:51:06 -04:00
Jan Svoboda	b76c7c6faf	[clang][driver] NFC: Expose InputInfo in Job instead of plain filenames This patch exposes `InputInfo` in `Job` instead of plain filenames. This is useful in a follow-up patch that uses this to recognize `-cc1` commands interesting for Clang tooling. Depends on D106787. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D106788	2021-07-27 09:18:58 +02:00
Jan Svoboda	60426f33b1	[clang][driver] NFC: Move InputInfo.h from lib to include Moving `InputInfo.h` from `lib/Driver/` into `include/Driver` to be able to expose it in an API consumed from outside of `clangDriver`. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D106787	2021-07-27 09:17:39 +02:00
Amy Huang	1a3bf2953a	[DebugInfo] Switch to using constructor homing (-debug-info-kind=constructor) by default when debug info is enabled Constructor homing reduces the amount of class type info that is emitted by emitting conmplete type info for a class only when a constructor for that class is emitted. This will mainly reduce the amount of duplicate debug info in object files. In Chrome enabling ctor homing decreased total build directory sizes by about 30%. It's also expected that some class types (such as unused classes) will no longer be emitted in the debug info. This is fine, since we wouldn't expect to need these types when debugging. In some cases (e.g. libc++, https://reviews.llvm.org/D98750), classes are used without calling the constructor. Since this is technically undefined behavior, enabling constructor homing should be fine. However Clang now has an attribute `__attribute__((standalone_debug))` that can be used on classes to ignore ctor homing. Bug: https://bugs.llvm.org/show_bug.cgi?id=46537 Differential Revision: https://reviews.llvm.org/D106084	2021-07-26 17:24:42 -07:00
Joseph Huber	d297211692	[OpenMP] Add a driver flag to enable the new device runtime library This patch adds a driver flag `-fopenmp-target-new-runtime` to optionally enable the new device runtime bitcode library. This allows users to enable the new experimental runtime before it becomes the default in the future. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106793	2021-07-26 16:35:56 -04:00
Michael Kruse	ae6b400002	[Preprocessor] Implement -fminimize-whitespace. This patch adds the -fminimize-whitespace with the following effects: * If combined with -E, remove as much non-line-breaking whitespace as possible. * If combined with -E -P, removes as much whitespace as possible, including line-breaks. The motivation is to reduce the amount of insignificant changes in the preprocessed output with source files where only whitespace has been changed (add/remove comments, clang-format, etc.) which is in particular useful with ccache. A patch for ccache for using this flag has been proposed to ccache as well: https://github.com/ccache/ccache/pull/815, which will use -fnormalize-whitespace when clang-13 has been detected, and additionally uses -P in "unify_mode". ccache already had a unify_mode in an older version which was removed because of problems that using the preprocessor itself does not have (such that the custom tokenizer did not recognize C++11 raw strings). This patch slightly reorganizes which part is responsible for adding newlines that are required for semantics. It is now either startNewLineIfNeeded() or MoveToLine() but never both; this avoids the ShouldUpdateCurrentLine workaround and avoids redundant lines being inserted in some cases. It also fixes a mandatory newline not inserted after a _Pragma("...") that is expanded into a #pragma. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D104601	2021-07-25 23:30:57 -05:00
Fangrui Song	7290ddd6b1	Revert "[clang] -falign-loops=" This reverts commit `42896eeed9`. Unfinished. Accidentally pushed when reverting a clangd commit.	2021-07-23 09:58:35 -07:00
Fangrui Song	42896eeed9	[clang] -falign-loops=	2021-07-23 09:50:43 -07:00
Yaxun (Sam) Liu	44dbbe6106	[HIP] Preserve ASAN bitcode library functions Address sanitizer passes may generate call of ASAN bitcode library functions after bitcode linking in lld, therefore lld cannot add those symbols since it does not know they will be used later. To solve this issue, clang emits a reference to a bicode library function which calls all ASAN functions which need to be preserved. This basically force all ASAN functions to be linked in. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D106315	2021-07-23 10:35:52 -04:00
Yaxun (Sam) Liu	9a977daaf6	Fix __hip_fabin visibility In -fgpu-rdc case, fat binary is embedded as global variable __hip_fatbin. It needs to have protected visibility to avoid conflict between shared libraries. Reviewed by: Siu Chi Chan Differential Revision: https://reviews.llvm.org/D106571 Fixes: SWDEV-292290	2021-07-23 10:14:29 -04:00

... 6 7 8 9 10 ...

7007 Commits