llvm-project

Commit Graph

Author	SHA1	Message	Date
Kazu Hirata	cb2c8f694d	[clang] Use value instead of getValue (NFC)	2022-07-13 23:39:33 -07:00
Joseph Huber	b370be37cc	[CUDA] Allow the new driver to compile CUDA in non-RDC mode The new driver primarily allows us to support RDC-mode compilations with proper linking. This is not needed for non-RDC mode compilation, but we still would like the new driver to be able to handle this mode so we can transition away from the old driver in the future. This patch adds the necessary code to support creating a fatbinary for CUDA code generation as well as removing old assumptions and errors about RDC-mode with the new driver. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D129655	2022-07-13 21:49:15 -04:00
Ben Langmuir	3ce78cbd23	[clang][deps] Fix handling of -MT in module command-line Follow-up to `6626f6fec3`, this fixes the handling of -MT * If no targets are provided, we need to invent one since cc1 expects the driver to have handled it. The default is to use -o, quoting as necessary for a make target. * Fix the splitting for empty string, which was incorrectly treated as {""} instead of {}. * Add a way to test this behaviour in clang-scan-deps. Differential Revision: https://reviews.llvm.org/D129607	2022-07-13 13:36:15 -07:00
Joseph Huber	6abaa8e210	[CUDA][HIP][NFC] Do not add `-fgpu-rdc` flag twice Summary: We currently add the `-fgpu-rdc` flag twice. Once unconditionally for both the host and device phases of compilation, and a second time only for the host. When we moved to an unconditional addition of this flag it the old one was most likely not removed. This patch simply removes the redundant flag and changes no functionality.	2022-07-13 15:57:53 -04:00
Kai Nacke	880eb839e6	[SystemZ] Enable `-mtune=` option in clang. https://reviews.llvm.org/D128910 enabled handling of attribute "tune-cpu" in LLVM. This PR now enables option `-mtune` in clang, which then generates the new attribute. Reviewed By: uweigand Differential Revision: https://reviews.llvm.org/D129562	2022-07-13 11:39:24 -04:00
Nick Desaulniers	2240d72f15	[X86] initial -mfunction-return=thunk-extern support Adds support for: * `-mfunction-return=<value>` command line flag, and * `__attribute__((function_return("<value>")))` function attribute Where the supported <value>s are: * keep (disable) * thunk-extern (enable) thunk-extern enables clang to change ret instructions into jmps to an external symbol named __x86_return_thunk, implemented as a new MachineFunctionPass named "x86-return-thunks", keyed off the new IR attribute fn_ret_thunk_extern. The symbol __x86_return_thunk is expected to be provided by the runtime the compiled code is linked against and is not defined by the compiler. Enabling this option alone doesn't provide mitigations without corresponding definitions of __x86_return_thunk! This new MachineFunctionPass is very similar to "x86-lvi-ret". The <value>s "thunk" and "thunk-inline" are currently unsupported. It's not clear yet that they are necessary: whether the thunk pattern they would emit is beneficial or used anywhere. Should the <value>s "thunk" and "thunk-inline" become necessary, x86-return-thunks could probably be merged into x86-retpoline-thunks which has pre-existing machinery for emitting thunks (which could be used to implement the <value> "thunk"). Has been found to build+boot with corresponding Linux kernel patches. This helps the Linux kernel mitigate RETBLEED. * CVE-2022-23816 * CVE-2022-28693 * CVE-2022-29901 See also: * "RETBLEED: Arbitrary Speculative Code Execution with Return Instructions." * AMD SECURITY NOTICE AMD-SN-1037: AMD CPU Branch Type Confusion * TECHNICAL GUIDANCE FOR MITIGATING BRANCH TYPE CONFUSION REVISION 1.0 2022-07-12 * Return Stack Buffer Underflow / Return Stack Buffer Underflow / CVE-2022-29901, CVE-2022-28693 / INTEL-SA-00702 SystemZ may eventually want to support "thunk-extern" and "thunk"; both options are used by the Linux kernel's CONFIG_EXPOLINE. This functionality has been available in GCC since the 8.1 release, and was backported to the 7.3 release. Many thanks for folks that provided discrete review off list due to the embargoed nature of this hardware vulnerability. Many Bothans died to bring us this information. Link: https://www.youtube.com/watch?v=IF6HbCKQHK8 Link: https://github.com/llvm/llvm-project/issues/54404 Link: https://gcc.gnu.org/legacy-ml/gcc-patches/2018-01/msg01197.html Link: https://www.intel.com/content/www/us/en/developer/articles/technical/software-security-guidance/advisory-guidance/return-stack-buffer-underflow.html Link: https://arstechnica.com/information-technology/2022/07/intel-and-amd-cpus-vulnerable-to-a-new-speculative-execution-attack/?comments=1 Link: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=ce114c866860aa9eae3f50974efc68241186ba60 Link: https://www.intel.com/content/www/us/en/security-center/advisory/intel-sa-00702.html Link: https://www.intel.com/content/www/us/en/security-center/advisory/intel-sa-00707.html Reviewed By: aaron.ballman, craig.topper Differential Revision: https://reviews.llvm.org/D129572	2022-07-12 09:17:54 -07:00
Xiang1 Zhang	a45dd3d814	[X86] Support -mstack-protector-guard-symbol Reviewed By: nickdesaulniers Differential Revision: https://reviews.llvm.org/D129346	2022-07-12 10:17:00 +08:00
Xiang1 Zhang	643786213b	Revert "[X86] Support -mstack-protector-guard-symbol" This reverts commit `efbaad1c4a`. due to miss adding review info.	2022-07-12 10:14:32 +08:00
Xiang1 Zhang	efbaad1c4a	[X86] Support -mstack-protector-guard-symbol	2022-07-12 10:13:48 +08:00
Petr Hosek	0ed8d82095	[Driver] Don't use frame pointer on Fuchsia when optimizations are enabled This matches the standard behavior on other platforms. Differential Revision: https://reviews.llvm.org/D129512	2022-07-11 21:45:42 +00:00
Joseph Huber	22a01b860b	[LinkerWrapper] Forward `-mllvm` options to the linker wrapper This patch adds the ability to use `-mllvm` options in the linker wrapper when performing bitcode linking or the module compilation. This is done by passing in the LLVM argument to the clang-linker-wrapper tool. Inside the linker-wrapper tool we invoke the `CommandLine` parser solely for forwarding command line options to the `clang-linker-wrapper` to the LLVM tools that also use the `CommandLine` parser. The actual arguments to the linker wrapper are parsed using the `Opt` library instead. For example, in the following command the `CommandLine` parser will attempt to parse `abc`, while the `opt` parser takes `-mllvm <arg>` and ignores it so it is not passed to the linker arguments. ``` clang-linker-wrapper -mllvm -abc -- <linker-args> ``` As far as I can tell this is the easiest way to forward arguments to LLVM tool invocations. If there is a better way to pass these arguments (such as through the LTO config) let me know. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D129424	2022-07-09 21:18:19 -04:00
Joseph Huber	24849c9eb5	[Clang] Fix the wrong features being derivec in the offload packager The offload packager embeds the features in the offloading binary when performing LTO. This had an incorrect interaction with the `--cuda-feature` option because we weren't deriving the features from the CUDA toolchain arguments when it was being specified. This patch fixes this so the features are correctly overrideen when using this argument. However, this brings up a question of how best to handle conflicting target features. The user could compile many libraries with different features, in this case we do not know which one to pick. This was not previously a problem when we simply passed the features in from the CUDA installation at link-link because we just defaulted to whatever was current on the system. Reviewed By: ye-luo Differential Revision: https://reviews.llvm.org/D129393	2022-07-08 17:26:20 -04:00
Cole Kissane	ea61750c35	[NFC] Refactor llvm::zlib namespace * Refactor compression namespaces across the project, making way for a possible introduction of alternatives to zlib compression. Changes are as follows: * Relocate the `llvm::zlib` namespace to `llvm::compression::zlib`. Reviewed By: MaskRay, leonardchan, phosek Differential Revision: https://reviews.llvm.org/D128953	2022-07-08 11:19:07 -07:00
Joseph Huber	74a8fce6e8	[LinkerWrapper] Fix save-temps and argument name Summary: The previous path reworked some handling of temporary files which exposed some bugs related to capturing local state by reference in the callback labmda. Squashing this by copying in everything instead. There was also a problem where the argument name was changed for `--bitcode-library=` but clang still used `--target-library=`.	2022-07-08 11:38:33 -04:00
Joseph Huber	d2ead9e324	[LinkerWrapper][NFC] Rework command line argument handling in the linker wrapper Summary: This patch reworks the command line argument handling in the linker wrapper from using the LLVM `cl` interface to using the `Option` interface with TableGen. This has several benefits compared to the old method. We use arguments from the linker arguments in the linker wrapper, such as the libraries and input files, this allows us to properly parse these. Additionally we can now easily set up aliases to the linker wrapper arguments and pass them in the linker input directly. That is, pass an option like `cuda-path=` as `--offload-arg=cuda-path=` in the linker's inputs. This will allow us to handle offloading compilation in the linker itself some day. Finally, this is also a much cleaner interface for passing arguments to the individual device linking jobs.	2022-07-08 11:18:38 -04:00
Argyrios Kyrtzidis	93d6fdfc23	[Driver] Ignore the clang modules validation-related flags if clang modules are not enabled If clang modules are not enabled it becomes unnecessary to read the session timestamp file in order to pass `-fbuild-session-timestamp` to the `cc1` invocation. Differential Revision: https://reviews.llvm.org/D129030	2022-07-03 21:26:15 -07:00
Vitaly Buka	cdfa15da94	Revert "[clang] Introduce -fstrict-flex-arrays=<n> for stricter handling of flexible arrays" This reverts D126864 and related fixes. This reverts commit `572b08790a`. This reverts commit `886715af96`.	2022-06-27 14:03:09 -07:00
Kazu Hirata	3b7c3a654c	Revert "Don't use Optional::hasValue (NFC)" This reverts commit `aa8feeefd3`.	2022-06-25 11:56:50 -07:00
Kazu Hirata	aa8feeefd3	Don't use Optional::hasValue (NFC)	2022-06-25 11:55:57 -07:00
David Blaikie	4821508d4d	Revert "DebugInfo: Fully integrate ctor type homing into 'limited' debug info" Reverting to simplify some Google-internal rollout issues. Will recommit in a week or two. This reverts commit `517bbc64db`.	2022-06-24 17:07:47 +00:00
Yaxun (Sam) Liu	8ad4c6e4b1	[HIP] add -fhip-kernel-arg-name Add option -fhip-kernel-arg-name to emit kernel argument name metadata, which is needed for certain HIP applications. Reviewed by: Artem Belevich, Fangrui Song, Brian Sumner Differential Revision: https://reviews.llvm.org/D128022	2022-06-24 11:15:36 -04:00
serge-sans-paille	886715af96	[clang] Introduce -fstrict-flex-arrays=<n> for stricter handling of flexible arrays Some code [0] consider that trailing arrays are flexible, whatever their size. Support for these legacy code has been introduced in `f8f6324983` but it prevents evaluation of __builtin_object_size and __builtin_dynamic_object_size in some legit cases. Introduce -fstrict-flex-arrays=<n> to have stricter conformance when it is desirable. n = 0: current behavior, any trailing array member is a flexible array. The default. n = 1: any trailing array member of undefined, 0 or 1 size is a flexible array member n = 2: any trailing array member of undefined or 0 size is a flexible array member n = 3: any trailing array member of undefined size is a flexible array member (strict c99 conformance) Similar patch for gcc discuss here: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101836 [0] https://docs.freebsd.org/en/books/developers-handbook/sockets/#sockets-essential-functions	2022-06-24 16:13:29 +02:00
David Blaikie	517bbc64db	DebugInfo: Fully integrate ctor type homing into 'limited' debug info Simplify debug info back to just "limited" or "full" by rolling the ctor type homing fully into the "limited" debug info. Also fix a bug I found along the way that was causing ctor type homing to kick in even when something could be vtable homed (where vtable homing is stronger/more effective than ctor homing) - fixing at the same time as it keeps the tests (that were testing only "limited non ctor" homing and now test ctor homing) passing.	2022-06-23 20:15:00 +00:00
Xiang Li	77f72ac15b	[HLSL] Enable half type for hlsl. HLSL supports half type. When enable-16bit-types is not set, half will be treated as float. When enable-16bit-types is set, half will be treated like real 16bit float type and map to llvm half type. Also change CXXABI to Microsoft to match dxc behavior. The mangle name for half is "$f16@" when half is treat as native half type and "$halff@" when treat as float. In AST, half is still half. The special thing is done at clang codeGen, when NativeHalfType is false, half will translated into float. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D124790	2022-06-23 12:56:26 -07:00
Joseph Huber	6e6889288c	[Offloading] Embed the target features in the OffloadBinary The target features are necessary for correctly compiling most programs in LTO mode. Currently, these are derived in clang at link time and passed as an arguemnt to the linker wrapper. This is problematic because it requires knowing the required toolchain at link time, which should not be necessry. Instead, these features should be embedded into the offloading binary so we can unify them in the linker wrapper for LTO. This also required changing the offload packager to interpret multiple arguments as concatenation with a comma. This is so we can still use the `,` separator for the argument list. Depends on D127246 Reviewed By: tra Differential Revision: https://reviews.llvm.org/D127686	2022-06-23 13:15:01 -04:00
Kazu Hirata	06decd0b41	[clang] Use value_or instead of getValueOr (NFC)	2022-06-18 23:21:34 -07:00
Kazu Hirata	c5935af058	[Toolchains] Use llvm::is_contained (NFC)	2022-06-18 15:57:50 -07:00
Fangrui Song	c418d0f1be	[Driver] Simplify -fno-builtin- handling. NFC	2022-06-15 13:35:08 -07:00
Balazs Benics	9da697e1bc	Reland "[analyzer] Deprecate the unused 'analyzer-opt-analyze-nested-blocks' cc1 flag" It was previously reverted by `8406839d19`. --- This flag was introduced by `6818991d71` commit `6818991d71` Author: Ted Kremenek <kremenek@apple.com> Date: Mon Dec 7 22:06:12 2009 +0000 Add clang-cc option '-analyzer-opt-analyze-nested-blocks' to treat block literals as an entry point for analyzer checks. The last reference was removed by this commit: `5c32dfc5fb` commit `5c32dfc5fb` Author: Anna Zaks <ganna@apple.com> Date: Fri Dec 21 01:19:15 2012 +0000 [analyzer] Add blocks and ObjC messages to the call graph. This paves the road for constructing a better function dependency graph. If we analyze a function before the functions it calls and inlines, there is more opportunity for optimization. Note, we add call edges to the called methods that correspond to function definitions (declarations with bodies). Consequently, we should remove this dead flag. However, this arises a couple of burning questions. - Should the `cc1` frontend still accept this flag - to keep tools/users passing this flag directly to `cc1` (which is unsupported, unadvertised) working. - If we should remain backward compatible, how long? - How can we get rid of deprecated and obsolete flags at some point? Reviewed By: martong Differential Revision: https://reviews.llvm.org/D126067	2022-06-14 10:22:37 +02:00
Balazs Benics	ffe7950ebc	Reland "[analyzer] Deprecate `-analyzer-store region` flag" I'm trying to remove unused options from the `Analyses.def` file, then merge the rest of the useful options into the `AnalyzerOptions.def`. Then make sure one can set these by an `-analyzer-config XXX=YYY` style flag. Then surface the `-analyzer-config` to the `clang` frontend; After all of this, we can pursue the tablegen approach described https://discourse.llvm.org/t/rfc-tablegen-clang-static-analyzer-engine-options-for-better-documentation/61488 In this patch, I'm proposing flag deprecations. We should support deprecated analyzer flags for exactly one release. In this case I'm planning to drop this flag in `clang-16`. In the clang frontend, now we won't pass this option to the cc1 frontend, rather emit a warning diagnostic reminding the users about this deprecated flag, which will be turned into error in clang-16. Unfortunately, I had to remove all the tests referring to this flag, causing a mass change. I've also added a test for checking this warning. I've seen that `scan-build` also uses this flag, but I think we should remove that part only after we turn this into a hard error. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D126215	2022-06-14 09:20:41 +02:00
David Tenty	6a8673038b	Reland [clang][AIX] add option mdefault-visibility-export-mapping The option mdefault-visibility-export-mapping is created to allow mapping default visibility to an explicit shared library export (e.g. dllexport). Exactly how and if this is manifested is target dependent (since it depends on how they map dllexport in the IR). Three values are provided for the option: * none: the default and behavior without the option, no additional export linkage information is created. * explicit: add the export for entities with explict default visibility from the source, including RTTI * all: add the export for all entities with default visibility This option is useful for targets which do not export symbols as part of their usual default linkage behaviour (e.g. AIX), such targets traditionally specified such information in external files (e.g. export lists), but this mapping allows them to use the visibility information typically used for this purpose on other (e.g. ELF) platforms. This relands commit: `8c8a2679a2` with fixes for the compile time and assert problems that were reported by: * making shouldMapVisibilityToDLLExport inline and provide an early return in the case where no mapping is in effect (aka non-AIX platforms) * don't try to export RTTI types which we will give internal linkage to Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D126340	2022-06-13 13:43:46 -04:00
Jan Svoboda	a85670001b	[clang][driver] Fix compilation database dump with multiple architectures Command lines with multiple `-arch` arguments expand into multiple entries in the compilation database. However, the file writes are not appending, meaning subsequent writes end up overwriting the previous ones, resulting in garbled output. This patch fixes that by always appending to the file. rdar://90165004 Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D121997	2022-06-13 13:30:57 +02:00
Jez Ng	d4bcb45db7	[MC][re-land] Omit DWARF unwind info if compact unwind is present where eligible This reverts commit `d941d59783`. Differential Revision: https://reviews.llvm.org/D122258	2022-06-12 17:24:19 -04:00
Jez Ng	d941d59783	Revert "[MC] Omit DWARF unwind info if compact unwind is present where eligible" This reverts commit `ef501bf85d`.	2022-06-12 10:47:08 -04:00
Jez Ng	ef501bf85d	[MC] Omit DWARF unwind info if compact unwind is present where eligible Previously, omitting unnecessary DWARF unwinds was only done in two cases: * For Darwin + aarch64, if no DWARF unwind info is needed for all the functions in a TU, then the `__eh_frame` section would be omitted entirely. If any one function needed DWARF unwind, then MC would emit DWARF unwind entries for all the functions in the TU. * For watchOS, MC would omit DWARF unwind on a per-function basis, as long as compact unwind was available for that function. This diff makes it so that we omit DWARF unwind on a per-function basis for Darwin + aarch64 as well. In addition, we introduce the flag `--emit-dwarf-unwind=` which can toggle between `always`, `no-compact-unwind` (only emit DWARF when CU cannot be emitted for a given function), and the target platform `default`. `no-compact-unwind` is particularly useful for newer x86_64 platforms: we don't want to omit DWARF unwind for x86_64 in general due to possible backwards compat issues, but we should make it possible for people to opt into this behavior if they are only targeting newer platforms. Motivation: I'm working on adding support for `__eh_frame` to LLD, but I'm concerned that we would suffer a perf hit. Processing compact unwind is already expensive, and that's a simpler format than EH frames. Given that MC currently produces one EH frame entry for every compact unwind entry, I don't think processing them will be cheap. I tried to do something clever on LLD's end to drop the unnecessary EH frames at parse time, but this made the code significantly more complex. So I'm looking at fixing this at the MC level instead. Addendum: It turns out that there was a latent bug in the X86 backend when `OmitDwarfIfHaveCompactUnwind` is naively enabled, which is not too surprising given that this combination has not been heretofore used. For functions that have unwind info that cannot be encoded with CU, MC would end up dropping both the compact unwind entry (OK; existing behavior) as well as the DWARF entries (not OK). This diff fixes things so that we emit the DWARF entry, as well as a CU entry with encoding `UNWIND_X86_MODE_DWARF` -- this basically tells the unwinder to look for the DWARF entry. I'm not 100% sure the `UNWIND_X86_MODE_DWARF` CU entry is necessary, this was the simplest fix. ld64 seems to be able to handle both the absence and presence of this CU entry. Ultimately ld64 (and LLD) will synthesize `UNWIND_X86_MODE_DWARF` if it is absent, so there is no impact to the final binary size. Reviewed By: davide, lhames Differential Revision: https://reviews.llvm.org/D122258	2022-06-12 10:03:56 -04:00
Nico Weber	8406839d19	Revert "[analyzer] Deprecate `-analyzer-store region` flag" This reverts commit `d50d9946d1`. Broke check-clang, see comments on https://reviews.llvm.org/D126067 Also revert dependent change "[analyzer] Deprecate the unused 'analyzer-opt-analyze-nested-blocks' cc1 flag" This reverts commit `07b4a6d046`. Also revert "[analyzer] Fix buildbots after introducing a new frontend warning" This reverts commit `90374df15d`. (See https://reviews.llvm.org/rG90374df15ddc58d823ca42326a76f58e748f20eb)	2022-06-10 08:50:13 -04:00
Balazs Benics	07b4a6d046	[analyzer] Deprecate the unused 'analyzer-opt-analyze-nested-blocks' cc1 flag This flag was introduced by `6818991d71` commit `6818991d71` Author: Ted Kremenek <kremenek@apple.com> Date: Mon Dec 7 22:06:12 2009 +0000 Add clang-cc option '-analyzer-opt-analyze-nested-blocks' to treat block literals as an entry point for analyzer checks. The last reference was removed by this commit: `5c32dfc5fb` commit `5c32dfc5fb` Author: Anna Zaks <ganna@apple.com> Date: Fri Dec 21 01:19:15 2012 +0000 [analyzer] Add blocks and ObjC messages to the call graph. This paves the road for constructing a better function dependency graph. If we analyze a function before the functions it calls and inlines, there is more opportunity for optimization. Note, we add call edges to the called methods that correspond to function definitions (declarations with bodies). Consequently, we should remove this dead flag. However, this arises a couple of burning questions. - Should the `cc1` frontend still accept this flag - to keep tools/users passing this flag directly to `cc1` (which is unsupported, unadvertised) working. - If we should remain backward compatible, how long? - How can we get rid of deprecated and obsolete flags at some point? Reviewed By: martong Differential Revision: https://reviews.llvm.org/D126067	2022-06-10 13:09:37 +02:00
Balazs Benics	d50d9946d1	[analyzer] Deprecate `-analyzer-store region` flag I'm trying to remove unused options from the `Analyses.def` file, then merge the rest of the useful options into the `AnalyzerOptions.def`. Then make sure one can set these by an `-analyzer-config XXX=YYY` style flag. Then surface the `-analyzer-config` to the `clang` frontend; After all of this, we can pursue the tablegen approach described https://discourse.llvm.org/t/rfc-tablegen-clang-static-analyzer-engine-options-for-better-documentation/61488 In this patch, I'm proposing flag deprecations. We should support deprecated analyzer flags for exactly one release. In this case I'm planning to drop this flag in `clang-16`. In the clang frontend, now we won't pass this option to the cc1 frontend, rather emit a warning diagnostic reminding the users about this deprecated flag, which will be turned into error in clang-16. Unfortunately, I had to remove all the tests referring to this flag, causing a mass change. I've also added a test for checking this warning. I've seen that `scan-build` also uses this flag, but I think we should remove that part only after we turn this into a hard error. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D126215	2022-06-10 12:57:15 +02:00
Alex Brachet	fac39d14b1	[clang] Allow CLANG_MODULE_CACHE_PATH env var to override module caching behavior CLANG_MODULE_CACHE_PATH can be used to change where clang should put the module cache, or can be set to "" to disable caching entirely. Differential revision: https://reviews.llvm.org/D126678	2022-06-09 16:55:37 +00:00
Pengxuan Zheng	e3a6784ac9	[clang-cl] Add support for /kernel MSVC defines _KERNEL_MODE when /kernel is passed. Also, /kernel disables RTTI and C++ exception handling. https://docs.microsoft.com/en-us/cpp/build/reference/kernel-create-kernel-mode-binary?view=msvc-170 Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D126719	2022-06-07 06:42:35 -07:00
Hans Wennborg	d42fe9aa84	Revert "[clang][AIX] add option mdefault-visibility-export-mapping" This caused assertions, see comment on the code review: llvm/clang/lib/AST/Decl.cpp:1510: clang::LinkageInfo clang::LinkageComputer::getLVForDecl(const clang::NamedDecl , clang::LVComputationKind): Assertion `D->getCachedLinkage() == LV.getLinkage()' failed. > The option mdefault-visibility-export-mapping is created to allow > mapping default visibility to an explicit shared library export > (e.g. dllexport). Exactly how and if this is manifested is target > dependent (since it depends on how they map dllexport in the IR). > > Three values are provided for the option: > > none: the default and behavior without the option, no additional export linkage information is created. > * explicit: add the export for entities with explict default visibility from the source, including RTTI > * all: add the export for all entities with default visibility > > This option is useful for targets which do not export symbols as part of > their usual default linkage behaviour (e.g. AIX), such targets > traditionally specified such information in external files (e.g. export > lists), but this mapping allows them to use the visibility information > typically used for this purpose on other (e.g. ELF) platforms. > > Reviewed By: MaskRay > > Differential Revision: https://reviews.llvm.org/D126340 This reverts commit `8c8a2679a2`.	2022-06-02 15:09:39 +02:00
David Tenty	8c8a2679a2	[clang][AIX] add option mdefault-visibility-export-mapping The option mdefault-visibility-export-mapping is created to allow mapping default visibility to an explicit shared library export (e.g. dllexport). Exactly how and if this is manifested is target dependent (since it depends on how they map dllexport in the IR). Three values are provided for the option: * none: the default and behavior without the option, no additional export linkage information is created. * explicit: add the export for entities with explict default visibility from the source, including RTTI * all: add the export for all entities with default visibility This option is useful for targets which do not export symbols as part of their usual default linkage behaviour (e.g. AIX), such targets traditionally specified such information in external files (e.g. export lists), but this mapping allows them to use the visibility information typically used for this purpose on other (e.g. ELF) platforms. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D126340	2022-06-01 18:07:17 -04:00
Adrian Prantl	d951ca5439	Revert "[Driver] Enable to use C++20 standalne by -fcxx-modules" This reverts commit `a544710cd4`. See discussion in D120540. This breaks C++ Clang modules on Darwin and also more than a dozen tests in the LLDB testsuite. I think we need to be more careful to separate out the enabling of Clang C++ modules and C++20 modules. Either by having -fmodules-ts control the HaveModules flag, or by adding a way to explicitly turn them off.	2022-06-01 12:11:57 -07:00
Xiang Li	d3e4727907	[HLSL] add -D option for dxc mode. Create dxc_D as alias to option D which Define <macro> to <value> (or 1 if <value> omitted). Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D125338	2022-05-31 17:50:36 -07:00
Xiang Li	13e1a65327	[HLSL] Enable vector types for hlsl. Vector types in hlsl is using clang ext_vector_type. Declaration of vector types is in builtin header hlsl.h. hlsl.h will be included by default for hlsl shader. Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D125052	2022-05-31 13:54:17 -07:00
Nico Weber	c4eb8035ed	Revert "[HLSL] Enable vector types for hlsl." This reverts commit `e576280380`. Breaks tests on mac/arm, see comment on https://reviews.llvm.org/D125052 Also revert follow-up "[gn build] Port e576280380d3" This reverts commit `1e01b1ec72`.	2022-05-30 14:11:07 -04:00
Xiang Li	e576280380	[HLSL] Enable vector types for hlsl. Vector types in hlsl is using clang ext_vector_type. Declaration of vector types is in builtin header hlsl.h. hlsl.h will be included by default for hlsl shader. Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D125052	2022-05-30 09:05:29 -07:00
Chuanqi Xu	a544710cd4	[Driver] Enable to use C++20 standalne by -fcxx-modules This patch allows user to use C++20 module by -fcxx-modules. Previously, we could only use it under -std=c++20. Given that user could use C++20 coroutine standalonel by -fcoroutines-ts. It makes sense to offer an option to use C++20 modules without enabling C++20. Reviewed By: iains, MaskRay Differential Revision: https://reviews.llvm.org/D120540	2022-05-30 14:19:56 +08:00
Chuanqi Xu	42c3c70a9e	Revert "[Driver] Enable to use C++20 standalne by -fcxx-modules" This reverts commit `99eca83538`. Since it would cause clang-tools-extra fail.	2022-05-30 10:43:13 +08:00
Chuanqi Xu	99eca83538	[Driver] Enable to use C++20 standalne by -fcxx-modules This patch allows user to use C++20 module by -fcxx-modules. Previously, we could only use it under -std=c++20. Given that user could use C++20 coroutine standalonel by -fcoroutines-ts. It makes sense to offer an option to use C++20 modules without enabling C++20. Reviewed By: iains, MaskRay Differential Revision: https://reviews.llvm.org/D120540	2022-05-30 10:24:09 +08:00
Fangrui Song	3b4500014a	[Driver] Replace err_invalid_branch_protection with err_drv_unsupported_option_argument The convention is to use err_drv_unsupported_option_argument instead of adding a new diagnostic for every option.	2022-05-27 22:28:39 -07:00
Joseph Huber	f37101983f	[OpenMP] Add `-Xoffload-linker` to forward input to the device linker We use the clang-linker-wrapper to perform device linking of embedded offloading object files. This is done by generating those jobs inside of the linker-wrapper itself. This patch adds an argument in Clang and the linker-wrapper that allows users to forward input to the device linking phase. This can either be done for every device linker, or for a specific target triple. We use the `-Xoffload-linker <arg>` and the `-Xoffload-linker-<triple> <arg>` syntax to accomplish this. Reviewed By: markdewing, tra Differential Revision: https://reviews.llvm.org/D126226	2022-05-24 09:11:02 -04:00
Anastasia Stulova	d61ded1034	[OpenCL] Make -cl-ext a driver option. For generic targets such as SPIR-V clang sets all OpenCL extensions/features as supported by default. However concrete targets are unlikely to support all extensions features, which creates a problem when such generic SPIR-V binary is compiled for a specific target later on. To allow compile time diagnostics for unsupported features this flag is now being exposed in the clang driver. Differential Revision: https://reviews.llvm.org/D125243	2022-05-24 11:34:19 +01:00
Bill Wendling	6e00a34cdb	[AArch64] Add support for -fzero-call-used-regs Support the "-fzero-call-used-regs" option on AArch64. This involves much less specialized code than the X86 version. Most of the checks can be done with TableGen. Reviewed By: nickdesaulniers, MaskRay Differential Revision: https://reviews.llvm.org/D124836	2022-05-19 16:58:28 -07:00
Paul Walker	d640442518	[NFC] Fix a couple of whitespace issues.	2022-05-19 17:27:09 +00:00
Pengxuan Zheng	366e57de23	[clang-cl] Add /Zc:wchar_t- option Map /Zc:wchar_t- to the cc1 flag -fno-wchar which is already supported. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D125513	2022-05-17 09:40:30 -07:00
Egor Zhdan	2f04e703bf	[Clang] Add DriverKit support This is the second patch that upstreams the support for Apple's DriverKit. The first patch: https://reviews.llvm.org/D118046. Differential Revision: https://reviews.llvm.org/D121911	2022-05-13 20:34:57 +01:00
Joseph Huber	4638ae3a85	[OpenMP] Use the new OpenMP device static library when doing LTO The previous patches allowed us to create a static library containing all the device code. This patch uses that library to perform the device runtime linking late when performing LTO. This in addition to simplifying the libraries, allows us to transparently handle the runtime library as-needed without needing Clang to manually pass the necessary library in the linker wrapper job. Depends on D125315 Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D125333	2022-05-13 14:38:55 -04:00
Joseph Huber	af757f8980	[OpenMP] Don't set device runtime debugging flags if using '-nogpulib' We use globals to configure debugging at compile-time for the device runtime. Because these are only used by the OpenMP runtime we shouldn't define them if we aren't using the device runtime. When a user passes in '-nogpulib' this indicates that we are not using the device runtime, so we should check for the precense of this flag and not emit these globals if used. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D125314	2022-05-13 14:38:43 -04:00
Joseph Huber	5189f634a1	[OpenMP] Don't include the device wrappers if -nostdinc is used OpenMP uses several wrapper hearders to provide the definitions of needed symbols contained in the host. However, some users may use the `-nostdinc` option to override these definitions themselves. The OpenMP wrapper headers are stored in the same location as the clang install. If the user passes `-nostdinc` then this include directory is never looked at by default which means that including these wrappers will always fail. These headers should instead be included manually if they are needed with a `-nostdinc` build. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D125265	2022-05-13 14:38:39 -04:00
Alan Zhao	6398f3f2e9	[clang] Add the flag -ffile-reproducible When Clang generates the path prefix (i.e. the path of the directory where the file is) when generating FILE, __builtin_FILE(), and std::source_location, Clang uses the platform-specific path separator character of the build environment where Clang _itself_ is built. This leads to inconsistencies in Chrome builds where Clang running on non-Windows environments uses the forward slash (/) path separator while Clang running on Windows builds uses the backslash (\) path separator. To fix this, we add a flag -ffile-reproducible (and its inverse, -fno-file-reproducible) to have Clang use the target's platform-specific file separator character. Additionally, the existing flags -fmacro-prefix-map and -ffile-prefix-map now both imply -ffile-reproducible. This can be overriden by setting -fno-file-reproducible. [0]: https://crbug.com/1310767 Differential revision: https://reviews.llvm.org/D122766	2022-05-11 23:04:36 +02:00
Xiang Li	6055837f6d	Revert "[HLSL] add -D option for dxc mode." This reverts commit `4dae38ebfb`. Differential Revision: https://reviews.llvm.org/D125414	2022-05-11 13:57:47 -07:00
Xiang Li	4dae38ebfb	[HLSL] add -D option for dxc mode. Create dxc_D as alias to option D which Define <macro> to <value> (or 1 if <value> omitted). Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D125338	2022-05-11 11:26:31 -07:00
Joseph Huber	26eb04268f	[Clang] Introduce clang-offload-packager tool to bundle device files In order to do offloading compilation we need to embed files into the host and create fatbainaries. Clang uses a special binary format to bundle several files along with their metadata into a single binary image. This is currently performed using the `-fembed-offload-binary` option. However this is not very extensibile since it requires changing the command flag every time we want to add something and makes optional arguments difficult. This patch introduces a new tool called `clang-offload-packager` that behaves similarly to CUDA's `fatbinary`. This tool takes several input files with metadata and embeds it into a single image that can then be embedded in the host. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D125165	2022-05-11 09:39:13 -04:00
Joseph Huber	0035f7154c	[CUDA] Create offloading entries when using the new driver The changes made in D123460 generalized the code generation for OpenMP's offloading entries. We can use the same scheme to register globals for CUDA code. This patch adds the code generation to create these offloading entries when compiling using the new offloading driver mode. The offloading entries are simple structs that contain the information necessary to register the global. The struct used is as follows: ``` Type struct __tgt_offload_entry { void addr; // Pointer to the offload entry info. // (function or global) char name; // Name of the function or global. size_t size; // Size of the entry info (0 if it a function). int32_t flags; int32_t reserved; }; ``` Currently CUDA handles RDC code generation by deferring the registration of globals in the current TU to a callback function containing the modules ID. Later all the module IDs will be used to register all of the globals at once. Rather than mimic this, offloading entries allow us to mimic the way OpenMP registers globals. That is, we create a simple global struct for each device global to be registered. These are placed at a special section `cuda_offloading_entires`. Because this section is a valid C-identifier, the linker will profide a `__start` and `__stop` pointer that we can use to iterate and register all globals at runtime. the registration requires a flag variable to indicate which registration function to use. I have assigned the flags somewhat arbitrarily, but these use the following values. Kernel: 0 Variable: 0 Managed: 1 Surface: 2 Texture: 3 Depends on D120272 Reviewed By: tra Differential Revision: https://reviews.llvm.org/D123471	2022-05-11 07:30:21 -04:00
Joseph Huber	045665423e	[OpenMP] Fix embedding offload code when there is no offloading toolchain Summary: We use the `--offload-new-driver` option to enable offload code embedding. The check for when to do this was flawed and was enabling it too early in the case of OpenMP, causing a segfault when dereferencing the offloading toolchain.	2022-05-10 13:33:20 -04:00
Joseph Huber	509b631f84	[OpenMP] Try to Infer target triples using the offloading architecture Currently we require the `-fopenmp-targets=` option to specify the triple to use for the offloading toolchains, and the `-Xopenmp-target=` option to specify architectures to a specific toolchain. The changes made in D124721 allowed us to use `--offload-arch=` to specify multiple target architectures. However, this can become combersome with many different architectures. This patch introduces functinality that attempts to deduce the target triple and architectures from the offloading action. Currently we will deduce known GPU architectures when only `-fopenmp` is specified. This required a bit of a hack to cache the deduced architectures, without this we would've just thrown an error when we tried to look up the architecture again when generating the job. Normally we require the user to manually specify the toolchain arguments, but here they would confict unless we overrode them. Depends on: D124721 Reviewed By: saiislam Differential Revision: https://reviews.llvm.org/D125050	2022-05-06 16:57:18 -04:00
python3kgae	3fa5eb4cfc	[HLSL] add -fcgl option flag. fcgl option will make compilation stop after clang codeGen and output the llvm ir. It is added to check clang codeGen output for HLSL. It will be translated into -S -emit-llvm and -disable-llvm-passes. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D124983	2022-05-06 11:42:15 -07:00
Sam McCall	04b4190489	[Driver] Make "upgrade" of -include to include-pch optional; disable in clangd If clang is passed "-include foo.h", it will rewrite to "-include-pch foo.h.pch" before passing it to cc1, if foo.h.pch exists. Existence is checked, but validity is not. This is probably a reasonable assumption for the compiler itself, but not for clang-based tools where the actual compiler may be a different version of clang, or even GCC. In the end, we lose our -include, we gain a -include-pch that can't be used, and the file often fails to parse. I would like to turn this off for all non-clang invocations (i.e. createInvocationFromCommandLine), but we have explicit tests of this behavior for libclang and I can't work out the implications of changing it. Instead this patch: - makes it optional in the driver, default on (no change) - makes it optional in createInvocationFromCommandLine, default on (no change) - changes driver to do IO through the VFS so it can be tested - tests the option - turns the option off in clangd where the problem was reported Subsequent patches should make libclang opt in explicitly and flip the default for all other tools. It's probably also time to extract an options struct for createInvocationFromCommandLine. Fixes https://github.com/clangd/clangd/issues/856 Fixes https://github.com/clangd/vscode-clangd/issues/324 Differential Revision: https://reviews.llvm.org/D124970	2022-05-05 16:47:17 +02:00
Aaron Ballman	786954db06	Accept -fno-knr-functions as a driver flag as well Due to a think-o, it was only being accepted as a -cc1 flag. This adds the proper forwarding from the driver to the frontend and adds test coverage for the option.	2022-04-30 13:37:55 -04:00
python3kgae	73417c5176	[HLSL][clang][Driver] Support validator version command line option. The DXIL validator version option(/validator-version) decide the validator version when compile hlsl. The format is major.minor like 1.0. In normal case, the value of validator version should be got from DXIL validator. Before we got DXIL validator ready for llvm/main, DXIL validator version option is added first to set validator version. It will affect code generation for DXIL, so it is treated as a code gen option. A new member std::string DxilValidatorVersion is added to clang::CodeGenOptions. Then CGHLSLRuntime is added to clang::CodeGenModule. It is used to translate clang::CodeGenOptions::DxilValidatorVersion into a ModuleFlag under key "dx.valver" at end of clang code generation. Reviewed By: beanz Differential Revision: https://reviews.llvm.org/D123884	2022-04-29 16:48:08 -07:00
David Candler	9e7c9967c3	Additionally set f32 mode with denormal-fp-math When the denormal-fp-math option is used, this should set the denormal handling mode for all floating point types. However, currently 32-bit float types can ignore this setting as there is a variant of the option, denormal-fp-math-f32, specifically for that type which takes priority when checking the mode based on type and remains at the default of IEEE. From the description, denormal-fp-math would be expected to set the mode for floats unless overridden by the f32 variant, and code in the front end only emits the f32 option if it is different to the general one, so setting just denormal-fp-math should be valid. This patch changes the denormal-fp-math option to also set the f32 mode. If denormal-fp-math-f32 is also specified, this is then overridden as expected, but if it is absent floats will be set to the mode specified by the former option, rather than remain on the default. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D122589	2022-04-29 15:06:32 +01:00
Joseph Huber	c5e5b54350	[CUDA] Add driver support for compiling CUDA with the new driver This patch adds the basic support for the clang driver to compile and link CUDA using the new offloading driver. This requires handling the CUDA offloading kind and embedding the generated files into the host. This will allow us to link OpenMP code with CUDA code in the linker wrapper. More support will be required to create functional CUDA / HIP binaries using this method. Depends on D120270 D120271 D120934 Reviewed By: tra Differential Revision: https://reviews.llvm.org/D120272	2022-04-29 09:14:44 -04:00
Joseph Huber	4e2b5a6693	[Clang] Make enabling the new driver more generic In preparation for allowing other offloading kinds to use the new driver a new opt-in flag `-foffload-new-driver` is added. This is distinct from the existing `-fopenmp-new-driver` because OpenMP will soon use the new driver by default while the others should not. Reviewed By: yaxunl, tra Differential Revision: https://reviews.llvm.org/D123325	2022-04-29 09:14:39 -04:00
Joseph Huber	ca6bbe0085	[OpenMP] Make clang argument handling for the new driver more generic In preparation for accepting other offloading kinds with the new driver, this patch makes the way we handle offloading actions more generic. A new field to get the associated device action's toolchain is used rather than manually iterating a list. This makes building the arguments easier and makes sure that we doin't rely on any implicit ordering. Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D123313	2022-04-29 09:14:35 -04:00
Paul Robinson	17f3a92ee5	[PS5] Default to omit leaf frame pointer	2022-04-28 10:35:14 -07:00
Paul Robinson	db013b28c3	[PS5] Defaults for -fdeclspec, -ffunction/data-sections	2022-04-28 10:03:46 -07:00
Craig Topper	bd30d4be23	[Driver] Add f16 support to -mrecip parsing. This is a followup to D120158 which added an 'h' suffix to the backend handling. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D124551	2022-04-28 08:33:52 -07:00
Paul Robinson	451c5578b8	[PS5] Set default cpu to znver2, with no tuning	2022-04-28 07:50:37 -07:00
Paul Robinson	8854d1244c	[PS5] Disable exceptions by default	2022-04-28 07:16:16 -07:00
Joseph Huber	0a4a43cf13	[OpenMP] Properly handle the `-f/-fno` veraints for the new driver Summary: We provide the `-f(no-)openmp-new-driver` option to allow users to use the old or new driver. Previously this wasn't handled in the expected way and only `-fno-openmp-new-driver` was checked. This patch fixes that by using the `hasFlag` method as is standard.	2022-04-27 09:58:55 -04:00
Paul Robinson	b375882014	[PS5] Correctly test for analyzer defaults Commit `5c90eca` added some analyzer option checking, but a typo meant it was redundantly checking PS4 and not adding checking for PS5. With the test corrected, it identified the necessary driver updates, added in this commit.	2022-04-26 07:14:48 -07:00
Iain Sandoe	9dc9b21488	[C++20][Modules][Driver][HU 4/N] Add fdirectives-only mode for preprocessing output. When the -fdirectives-only option is used together with -E, the preprocessor output reflects evaluation of if/then/else directives. Thus it preserves macros that are still live after such processing. This output can be consumed by a second compilation to produce a header unit. We automatically invoke this (with -E) when we know that the job produces a header unit so that the preprocessed output reflects the macros that will be defined when the binary HU is emitted. Differential Revision: https://reviews.llvm.org/D121591	2022-04-23 14:43:07 +01:00
Byoungchan Lee	f69328049e	[cc1as] Add support for emitting the build version load command for -darwin-target-variant This patch extends cc1as to export the build version load command with LC_VERSION_MIN_MACOSX. This is especially important for Mac Catalyst as Mac Catalyst uses the MacOS's compiler rt built-ins. Differential Revision: https://reviews.llvm.org/D121868	2022-04-22 08:38:07 -04:00
Iain Sandoe	5c6a146498	[C++20][Modules][Driver][HU 1/N] Initial handling for -xc++-{system,user}-header. This adds file types and handling for three input types, representing a C++20 header unit source: 1. When provided with a complete pathname for the header. 2. For a header to be looked up (by the frontend) in the user search paths 3. For a header to be looked up in the system search paths. We also add a pre-processed file type (although that is a single type, regardless of the original input type). These types may be specified with -xc++-{user,system,header-unit}-header xxxx. These types allow us to disambiguate header unit jobs from PCH ones, and thus we handle these differently from other header jobs in two ways: 1. The job construction is altered to build a C++20 header unit (rather than a PCH file, as would be the case for other headers). 2. When the type is "user" or "system" we defer checking for the file until the front end is run, since we need to look up the header in the relevant paths which are not known at this point. Differential Revision: https://reviews.llvm.org/D121588	2022-04-22 09:24:29 +01:00
Joseph Huber	ae23be84cb	[OpenMP] Make the new offloading driver the default Previously an opt-in flag `-fopenmp-new-driver` was used to enable the new offloading driver. After passing tests for a few months it should be sufficiently mature to flip the switch and make it the default. The new offloading driver is now enabled if there is OpenMP and OpenMP offloading present and the new `-fno-openmp-new-driver` is not present. The new offloading driver has three main benefits over the old method: - Static library support - Device-side LTO - Unified clang driver stages Depends on D122683 Differential Revision: https://reviews.llvm.org/D122831	2022-04-18 15:05:09 -04:00
Joseph Huber	984a0dc386	[OpenMP] Use new offloading binary when embedding offloading images The previous patch introduced the offloading binary format so we can store some metada along with the binary image. This patch introduces using this inside the linker wrapper and Clang instead of the previous method that embedded the metadata in the section name. Differential Revision: https://reviews.llvm.org/D122683	2022-04-15 20:35:26 -04:00
Paul Robinson	7726ad04e2	[PS5] Add basic PS5 driver behavior This adds a PS5-specific ToolChain subclass, which defines some basic PS5 driver behavior. Future patches will add more target-specific driver behavior.	2022-04-14 12:45:33 -07:00
Fangrui Song	bfafa105aa	[Driver] Simplify some hasFlag patterns with addOptInFlag/addOptOutFlag. NFC	2022-04-13 22:00:44 -07:00
Fangrui Song	ab8abeaf48	[Driver] Change CLANG_ENABLE_OPAQUE_POINTERS_INTERNAL to affect driver default instead of cc1 default The current cc1 CLANG_ENABLE_OPAQUE_POINTERS=on default difference is not ideal in that people contribute %clang_cc1 tests may assume the default ON behavior, which will cause failures on systems set to OFF. cc1 option default dependent on CMake options should be used prudently (generally avoided). We prefer to limit target differences to Driver. Change the CLANG_ENABLE_OPAQUE_POINTERS_INTERNAL mechanism introduced in D123122 to use a driver default instead. This is similar to the mechanism used for the -flegacy-pass-manager transition to new PM transition. Reviewed By: #opaque-pointers, rsmith, aeubanks Differential Revision: https://reviews.llvm.org/D123744	2022-04-13 16:58:00 -07:00
Nikita Popov	2978d02681	[Clang] Remove support for legacy pass manager This removes the -flegacy-pass-manager and -fno-experimental-new-pass-manager options, and the corresponding support code in BackendUtil. The -fno-legacy-pass-manager and -fexperimental-new-pass-manager options are retained as no-ops. Differential Revision: https://reviews.llvm.org/D123609	2022-04-13 10:21:42 +02:00
Fangrui Song	fe02896a79	[Driver] -fno-optimize-sibling-calls: use the same spelling for its -cc1 counterpart And remove a -no-opaque-pointers	2022-04-11 22:21:24 -07:00
Fangrui Song	aefa4b60ce	[Driver] Simplify hasFlag pattern with addOptInFlag/addOptOutFlag helpers Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D123468	2022-04-11 12:29:25 -07:00
Fangrui Song	8e1530ba43	[Driver] Simplify OPT_fcolor_diagnostics claim Mostly NFC, but the diagnostic is changed to the more appropriate err_drv_invalid_argument_to_option.	2022-04-10 01:21:31 -07:00
Fangrui Song	30b1c1f23d	[Driver] Simplify -f[no-]diagnostics-color handling. NFC Make them aliases for -f[no-]color-diagnostics.	2022-04-10 01:07:44 -07:00
Fangrui Song	ee7fb36ba0	[Driver] Fix -f[no-]inline to override -f[no-]inline-functions/-finline-hint-functions Fix two cases to match GCC: * -fno-inline -finline => (no cc1 option) * -fno-inline -finline-functions => -fno-inline	2022-04-10 00:15:12 -07:00
Connor Kuehl	7aa8c38a9e	[randstruct] Add randomize structure layout support The Randstruct feature is a compile-time hardening technique that randomizes the field layout for designated structures of a code base. Admittedly, this is mostly useful for closed-source releases of code, since the randomization seed would need to be available for public and open source applications. Why implement it? This patch set enhances Clang’s feature parity with that of GCC which already has the Randstruct feature. It's used by the Linux kernel in certain structures to help thwart attacks that depend on structure layouts in memory. This patch set is a from-scratch reimplementation of the Randstruct feature that was originally ported to GCC. The patches for the GCC implementation can be found here: https://www.openwall.com/lists/kernel-hardening/2017/04/06/14 Link: https://lists.llvm.org/pipermail/cfe-dev/2019-March/061607.html Co-authored-by: Cole Nixon <nixontcole@gmail.com> Co-authored-by: Connor Kuehl <cipkuehl@gmail.com> Co-authored-by: James Foster <jafosterja@gmail.com> Co-authored-by: Jeff Takahashi <jeffrey.takahashi@gmail.com> Co-authored-by: Jordan Cantrell <jordan.cantrell@mail.com> Co-authored-by: Nikk Forbus <nicholas.forbus@gmail.com> Co-authored-by: Tim Pugh <nwtpugh@gmail.com> Co-authored-by: Bill Wendling <isanbard@gmail.com> Signed-off-by: Bill Wendling <isanbard@gmail.com> Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D121556	2022-04-09 13:15:36 -07:00
Fangrui Song	a58d0af058	Revert D121556 "[randstruct] Add randomize structure layout support" This reverts commit `3f0587d0c6`. Not all tests pass after a few rounds of fixes. I spot one failure that std::shuffle (potentially different results with different STL implementations) was misused and replaced it with llvm::shuffle, but there appears to be another failure in a Windows build. The latest failure is reported on https://reviews.llvm.org/D121556#3440383	2022-04-08 18:37:26 -07:00
Connor Kuehl	3f0587d0c6	[randstruct] Add randomize structure layout support The Randstruct feature is a compile-time hardening technique that randomizes the field layout for designated structures of a code base. Admittedly, this is mostly useful for closed-source releases of code, since the randomization seed would need to be available for public and open source applications. Why implement it? This patch set enhances Clang’s feature parity with that of GCC which already has the Randstruct feature. It's used by the Linux kernel in certain structures to help thwart attacks that depend on structure layouts in memory. This patch set is a from-scratch reimplementation of the Randstruct feature that was originally ported to GCC. The patches for the GCC implementation can be found here: https://www.openwall.com/lists/kernel-hardening/2017/04/06/14 Link: https://lists.llvm.org/pipermail/cfe-dev/2019-March/061607.html Co-authored-by: Cole Nixon <nixontcole@gmail.com> Co-authored-by: Connor Kuehl <cipkuehl@gmail.com> Co-authored-by: James Foster <jafosterja@gmail.com> Co-authored-by: Jeff Takahashi <jeffrey.takahashi@gmail.com> Co-authored-by: Jordan Cantrell <jordan.cantrell@mail.com> Co-authored-by: Nikk Forbus <nicholas.forbus@gmail.com> Co-authored-by: Tim Pugh <nwtpugh@gmail.com> Co-authored-by: Bill Wendling <isanbard@gmail.com> Signed-off-by: Bill Wendling <isanbard@gmail.com> Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D121556	2022-04-08 12:48:30 -07:00
Zi Xuan Wu	97e496054a	[Clang][CSKY] Add the CSKY target and compiler driver Add CSKY target toolchains to support csky in linux and elf environment. It can leverage the basic universal Linux toolchain for linux environment, and only add some compile or link parameters. For elf environment, add a CSKYToolChain to support compile and link. Also add some parameters into basic codebase of clang driver. Differential Revision: https://reviews.llvm.org/D121445	2022-04-06 11:37:37 +08:00
Yaxun (Sam) Liu	09a5eae0d1	[clang-offload-bundler] add -input/-output options Currently, clang-offload-bundler has -inputs and -outputs options that accept values with comma as the delimiter. This causes issues with file paths containing commas, which are valid file paths on Linux. This add two new options -input and -output, which accept one single file, and allow multiple instances. This allows arbitrary file paths. The old -inputs and -outputs options will be kept for backward compatibility, but are not allowed to be used with -input and -output options for simplicity. In the future, -inputs and -outputs options will be phasing out. RFC: https://discourse.llvm.org/t/rfc-adding-input-and-output-options-to-clang-offload-bundler/60049 Patch by: Siu Chi Chan Reviewed by: Yaxun Liu Differential Revision: https://reviews.llvm.org/D120662	2022-04-05 11:13:01 -04:00
Fangrui Song	85bd90cb71	[Driver] Move legacy -f[no-]unit-at-a-time to clang_ignored_gcc_optimization_f_Group Move to clang_ignored_gcc_optimization_f_Group like other ignored options. This decreases code size a bit: ~400 bytes on x86-64.	2022-03-30 23:20:49 -07:00
Fangrui Song	c37accf0a2	[Option] Avoid using the default argument for the 3-argument hasFlag. NFC The default argument true is error-prone: I think many would think the default is false.	2022-03-26 00:57:06 -07:00
Paul Robinson	6aa0397758	Remove dead code in driver parsing -gsimple-template-names= options While -g[no-]simple-template-names is a driver option, the fancier -gsimple-template-names={simple,mangled} option is cc1-only, so code to handle it in the driver is dead. Differential Revision: https://reviews.llvm.org/D122503	2022-03-25 13:23:24 -07:00
Daniel Grumberg	5ef2ec7e4e	[clang][extract-api] Suppprt for the module name property in SymbolGraph Adds `--product-name=` flag to the clang driver. This gets forwarded to cc1 only when we are performing a ExtractAPI Action. This is used to populate the `name` field of the module object in the generated SymbolGraph. Differential Revision: https://reviews.llvm.org/D122141	2022-03-23 16:34:08 +00:00
Daniel Grumberg	edbb99a7ed	Ensure -extract-api handles multiple headers correctly clang -extract-api should accept multiple headers and forward them to a single CC1 instance. This change introduces a new ExtractAPIJobAction. Currently API Extraction is done during the Precompile phase as this is the current phase that matches the requirements the most. Adding a new phase would need to change some logic in how phases are scheduled. If the headers scheduled for API extraction are of different types the driver emits a diagnostic. Differential Revision: https://reviews.llvm.org/D121936	2022-03-21 21:04:47 +00:00
Joseph Huber	a3248e4b28	[CUDA] Add getTargetFeatures for the NVPTX toolchain The NVPTX toolchain uses target features to determine the PTX version to use. However this isn't exposed externally like most other toolchain specific target features are. Add this functionaliy in preparation for using it in for OpenMP offloading. Reviewed By: jdoerfert, tra Differential Revision: https://reviews.llvm.org/D122089	2022-03-21 16:32:36 -04:00
Joseph Huber	670438e55d	[OpenMP][Fix] Add offloading kind to AMDGPU libraries Summary: A previous patch added the offloading kind to the triple format we used. I forgot to update the line where we add the AMDGPU libraries.	2022-03-14 21:18:19 -04:00
Joseph Huber	9f89769cd7	[Clang] Add offload kind to embedded offload object This patch adds the offload kind to the embedded section name in preparation for offloading to different kinda like CUDA or HIP. Depends on D120288 Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D120271	2022-03-14 20:08:27 -04:00
Yuanfang Chen	eddd94c27d	Reland "[clang][debug] port clang-cl /JMC flag to ELF" This relands commit `7313474319`. It failed on Windows/Mac because `-fjmc` is only checked for ELF targets. Check the flag unconditionally instead and issue a warning for non-ELF targets.	2022-03-07 21:55:41 -08:00
Yuanfang Chen	f46fa4de4a	Revert "[clang][debug] port clang-cl /JMC flag to ELF" This reverts commit `7313474319`. Break bots: http://45.33.8.238/win/54551/step_7.txt http://45.33.8.238/macm1/29590/step_7.txt	2022-03-07 12:40:43 -08:00
Yuanfang Chen	7313474319	[clang][debug] port clang-cl /JMC flag to ELF The motivation is to enable the MSVC-style JMC instrumentation usable by a ELF-based debugger. Since there is no prior experience implementing JMC feature for ELF-based debugger, it might be better to just reuse existing MSVC-style JMC instrumentation. For debuggers that support both ELF&COFF (like lldb), the JMC implementation might be shared between ELF&COFF. If this is found to inadequate, it is pretty low-cost switching to alternatives. Implementation: - The '-fjmc' is already a driver and cc1 flag. Wire it up for ELF in the driver. - Refactor the JMC instrumentation pass a little bit. - The ELF handling is different from MSVC in two places: * the flag section name is ".just.my.code" instead of ".msvcjmc" * the way default function is provided: MSVC uses /alternatename; ELF uses weak function. Based on D118428. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D119910	2022-03-07 10:16:24 -08:00
Paul Robinson	7b85f0f32f	[PS4] isPS4 and isPS4CPU are not meaningfully different	2022-03-03 11:36:59 -05:00
Egor Zhdan	3cdc1c155b	[Clang] Add `-funstable` flag to enable unstable and experimental features This new flag enables `__has_feature(cxx_unstable)` that would replace libc++ macros for individual unstable/experimental features, e.g. `_LIBCPP_HAS_NO_INCOMPLETE_RANGES` or `_LIBCPP_HAS_NO_INCOMPLETE_FORMAT`. This would make it easier and more convenient to opt-in into all libc++ unstable features at once. Differential Revision: https://reviews.llvm.org/D120160	2022-03-01 12:35:20 +00:00
Yaxun (Sam) Liu	9d899d8f01	[HIP] Support `-fgpu-default-stream` Introduce -fgpu-default-stream={legacy\|per-thread} option to support per-thread default stream for HIP runtime. When -fgpu-default-stream=per-thread, HIP kernels are launched through hipLaunchKernel_spt instead of hipLaunchKernel. Also HIP_API_PER_THREAD_DEFAULT_STREAM=1 is defined by the preprocessor to enable other per-thread stream API's. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D120298	2022-02-23 22:28:29 -05:00
Zahira Ammarguellat	1592d88aa7	Add support for floating-point option `ffp-eval-method` and for `pragma clang fp eval_method`. Differential Revision: https://reviews.llvm.org/D109239	2022-02-23 15:00:18 -08:00
Joseph Huber	2b97b16f29	[OpenMP] Add option to make offloading mandatory Currently when we generate OpenMP offloading code we always make fallback code for the CPU. This is necessary for implementing features like conditional offloading and ensuring that unhandled pragmas don't result in missing symbols. However, this is problematic for a few cases. For offloading tests we can silently fail to the host without realizing that offloading failed. Additionally, this makes it impossible to provide interoperabiility to other offloading schemes like HIP or CUDA because those methods do not provide any such host fallback guaruntee. this patch adds the `-fopenmp-offload-mandatory` flag to prevent generating the fallback symbol on the CPU and instead replaces the function with a dummy global and the failed branch with 'unreachable'. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D120353	2022-02-23 16:45:36 -05:00
Joseph Huber	0870a4f59a	[OpenMP] Add flag for disabling thread state in runtime The runtime uses thread state values to indicate when we use an ICV or are in nested parallelism. This is done for OpenMP correctness, but it not needed in the majority of cases. The new flag added is `-fopenmp-assume-no-thread-state`. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D120106	2022-02-18 08:35:05 -05:00
Florian Hahn	09193f20a1	Revert "Add support for floating-point option `ffp-eval-method` and for" This reverts commit `32b73bc6ab`. This breaks builds on macOS in some configurations, because __FLT_EVAL_METHOD__ is set to an unexpected value. E.g. https://green.lab.llvm.org/green/job/clang-stage1-RA/28282/consoleFull#129538464349ba4694-19c4-4d7e-bec5-911270d8a58c More details available in the review thread https://reviews.llvm.org/D109239	2022-02-18 11:04:00 +00:00
Zahira Ammarguellat	32b73bc6ab	Add support for floating-point option `ffp-eval-method` and for `pragma clang fp eval_method`. https://reviews.llvm.org/D109239	2022-02-17 08:59:21 -08:00
Joseph Huber	64ecdc1cb1	[OpenMP] Pass AMDGPU math libraries into the linker wrapper This patch passes in the AMDPGU math libraries to the linker wrapper. The wrapper already handles linking OpenMP bitcode libraries via the `--target-library` option. This should be sufficient to link in math libraries for the accompanying architecture. Fixes #53526. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D119841	2022-02-16 16:40:40 -05:00
Joseph Huber	5781839f7a	Revert "[OpenMP] Pass AMDGPU math libraries into the linker wrapper" This hits an assertion in the linker wrapper. Revert for now, will fix later. This reverts commit `61fb260d9d`.	2022-02-16 11:54:44 -05:00
Joseph Huber	61fb260d9d	[OpenMP] Pass AMDGPU math libraries into the linker wrapper This patch passes in the AMDPGU math libraries to the linker wrapper. The wrapper already handles linking OpenMP bitcode libraries via the `--target-library` option. This should be sufficient to link in math libraries for the accompanying architecture. Fixes #53526. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D119841	2022-02-16 11:39:11 -05:00
Adrian Prantl	0604d86c07	Darwin: introduce a global override for debug prefix map entries. This patch adds a new Darwin clang driver environment variable in the spirit of RC_DEBUG_OPTIONS, called RC_DEBUG_PREFIX_MAP, which allows a meta build tool to add one additional -fdebug-prefix-map entry without the knowledge of the build system. rdar://85224675 Differential Revision: https://reviews.llvm.org/D119850	2022-02-16 08:36:26 -08:00
Nico Weber	125abb61f7	Revert "Add support for floating-point option `ffp-eval-method` and for" This reverts commit `4bafe65c2b`. Breaks at least Misc/warning-flags.c, see comments on https://reviews.llvm.org/D109239	2022-02-15 22:02:25 -05:00
Zahira Ammarguellat	4bafe65c2b	Add support for floating-point option `ffp-eval-method` and for `pragma clang fp eval_method`.	2022-02-15 13:59:27 -08:00
Joseph Huber	24ecafb413	[OpenMP] Add support for CPU offloading in new driver This patch adds support for linking CPU offloading applications in the linker wrapper. We generate the necessary linking job using the host linker's path and library arguments. This may not be true for more complex offloading schemes, but this is sufficient for now. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D119613	2022-02-15 15:05:30 -05:00
Joseph Huber	a0e8077d28	[OpenMP][NFC] Simplify identifying the device bitcode library Now that the old device runtime has been deleted there is only a single target that differs by the triple and the architecture. Simplify the scheme for identifying the library but directly using the triple. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D119638	2022-02-12 14:55:47 -05:00
Yuanfang Chen	f927021410	Reland "[clang-cl] Support the /JMC flag" This relands commit `b380a31de0`. Restrict the tests to Windows only since the flag symbol hash depends on system-dependent path normalization.	2022-02-10 15:16:17 -08:00
Yuanfang Chen	b380a31de0	Revert "[clang-cl] Support the /JMC flag" This reverts commit `bd3a1de683`. Break bots: https://luci-milo.appspot.com/ui/p/fuchsia/builders/toolchain.ci/clang-windows-x64/b8822587673277278177/overview	2022-02-10 14:17:37 -08:00
Yuanfang Chen	bd3a1de683	[clang-cl] Support the /JMC flag The introduction and some examples are on this page: https://devblogs.microsoft.com/cppblog/announcing-jmc-stepping-in-visual-studio/ The `/JMC` flag enables these instrumentations: - Insert at the beginning of every function immediately after the prologue with a call to `void __fastcall __CheckForDebuggerJustMyCode(unsigned char *JMC_flag)`. The argument for `__CheckForDebuggerJustMyCode` is the address of a boolean global variable (the global variable is initialized to 1) with the name convention `__<hash>_<filename>`. All such global variables are placed in the `.msvcjmc` section. - The `<hash>` part of `__<hash>_<filename>` has a one-to-one mapping with a directory path. MSVC uses some unknown hashing function. Here I used DJB. - Add a dummy/empty COMDAT function `__JustMyCode_Default`. - Add `/alternatename:__CheckForDebuggerJustMyCode=__JustMyCode_Default` link option via ".drectve" section. This is to prevent failure in case `__CheckForDebuggerJustMyCode` is not provided during linking. Implementation: All the instrumentations are implemented in an IR codegen pass. The pass is placed immediately before CodeGenPrepare pass. This is to not interfere with mid-end optimizations and make the instrumentation target-independent (I'm still working on an ELF port in a separate patch). Reviewed By: hans Differential Revision: https://reviews.llvm.org/D118428	2022-02-10 10:26:30 -08:00
Yuanfang Chen	b96106af3f	[AArch64][ARM] add -Wunaligned-access only for clang Reviewed By: lenary Differential Revision: https://reviews.llvm.org/D119301	2022-02-10 10:26:30 -08:00
Yaxun (Sam) Liu	1d97cb1f6e	[HIP] Emit amdgpu_code_object_version module flag code object version determines ABI, therefore should not be mixed. This patch emits amdgpu_code_object_version module flag in LLVM IR based on code object version (default 4). The amdgpu_code_object_version value is code object version times 100. LLVM IR with different amdgpu_code_object_version module flag cannot be linked. The -cc1 option -mcode-object-version=none is for ROCm device library use only, which supports multiple ABI. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D119026	2022-02-08 21:58:40 -05:00
Bill Wendling	deaf22bc0e	[X86] Implement -fzero-call-used-regs option The "-fzero-call-used-regs" option tells the compiler to zero out certain registers before the function returns. It's also available as a function attribute: zero_call_used_regs. The two upper categories are: - "used": Zero out used registers. - "all": Zero out all registers, whether used or not. The individual options are: - "skip": Don't zero out any registers. This is the default. - "used": Zero out all used registers. - "used-arg": Zero out used registers that are used for arguments. - "used-gpr": Zero out used registers that are GPRs. - "used-gpr-arg": Zero out used GPRs that are used as arguments. - "all": Zero out all registers. - "all-arg": Zero out all registers used for arguments. - "all-gpr": Zero out all GPRs. - "all-gpr-arg": Zero out all GPRs used for arguments. This is used to help mitigate Return-Oriented Programming exploits. Reviewed By: nickdesaulniers Differential Revision: https://reviews.llvm.org/D110869	2022-02-08 17:42:54 -08:00
Amilendra Kodithuwakku	424e850f1e	[clang][ARM] Re-word PACBTI warning. The original warning added in D115501 when pacbti is used with an incompatible architecture was not exactly correct because it was not really ignored and can affect codegen. Therefore reword to say that the pacbti option is incompatible with the given architecture. Reviewed By: chill Differential Revision: https://reviews.llvm.org/D119166	2022-02-08 19:13:02 +00:00
Mark Murray	3d7662142d	[ARM] Undeprecate complex IT blocks AArch32/Armv8A introduced the performance deprecation of certain patterns of IT instructions. After some debate internal to ARM, this is now being reverted; i.e. no IT instruction patterns are performance deprecated anymore, as the perfomance degredation is not significant enough. This reverts the following: "ARMv8-A deprecates some uses of the T32 IT instruction. All uses of IT that apply to instructions other than a single subsequent 16-bit instruction from a restricted set are deprecated, as are explicit references to the PC within that single 16-bit instruction. This permits the non-deprecated forms of IT and subsequent instructions to be treated as a single 32-bit conditional instruction." The deprecation no longer applies, but the behaviour may be controlled by the -arm-restrict-it and -arm-no-restrict-it command-line options, with the latter being the default. No warnings about complex IT blocks will be generated. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D118044	2022-02-07 15:47:53 +00:00
Joseph Huber	034adaf5be	[OpenMP] Completely remove old device runtime This patch completely removes the old OpenMP device runtime. Previously, the old runtime had the prefix `libomptarget-new-` and the old runtime was simply called `libomptarget-`. This patch makes the formerly new runtime the only runtime available. The entire project has been deleted, and all references to the `libomptarget-new` runtime has been replaced with `libomptarget-`. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D118934	2022-02-04 15:31:33 -05:00
Joseph Huber	8cc4ca95b0	[OpenMP] Add Cuda path to linker wrapper tool The linker wrapper tool uses the 'nvlink' and 'ptxas' binaries to link and assemble device files. Previously we searched for this using the binaries in the user's path. This didn't work in cases where the user passed in a specific Cuda path to Clang. This patch changes the linker wrapper to accept an argument for the Cuda path we can get from Clang. This should fix #53573. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D118944	2022-02-03 20:39:18 -05:00
Joseph Huber	9375f1563e	[OpenMP] Cleanup the Linker Wrapper Summary: Various changes and cleanup for the Linker Wrapper tool.	2022-01-31 23:11:42 -05:00
Joseph Huber	bf499c58af	[OpenMP] Implement save temps functionality in linker wrapper Summary: This patch implements the `-save-temps` flag for the linker wrapper. This allows the user to inspect the intermeditary outpout that the linker wrapper creates.	2022-01-31 23:11:42 -05:00
Joseph Huber	cb7cfaec71	[OpenMP] Add extra flag handling to linker wrapper This patch adds support for a few extra flags in the linker wrapper, such as debugging flags, verbose output, and passing arguments to ptxas. We also now forward pass remarks to the LLVM backend so they will show up in the LTO passes. Depends on D117049 Differential Revision: https://reviews.llvm.org/D117156	2022-01-31 23:11:41 -05:00
Joseph Huber	f28c3153ee	[OpenMP] Add support for embedding bitcode images in wrapper tool Summary; This patch adds support for embedding device images in the linker wrapper tool. This will be used for performing JIT functionality in the future. Depends on D117048 Differential Revision: https://reviews.llvm.org/D117049	2022-01-31 23:11:41 -05:00
Joseph Huber	3762111aa9	[OpenMP] Link the bitcode library late for device LTO Summary: This patch adds support for linking the OpenMP device bitcode library late when doing LTO. This simply passes it in as an additional device file when doing the final device linking phase with LTO. This has the advantage that we don't link it multiple times, and the device references do not get inlined and prevent us from doing needed OpenMP optimizations when we have visiblity of the whole module. Fix some failings where the implicit conversion of an Error to an Expected triggered the deleted copy constructor. Depends on D116675 Differential revision: https://reviews.llvm.org/D117048	2022-01-31 23:11:41 -05:00
Joseph Huber	c732c3df74	[OpenMP] Initial Implementation of LTO and bitcode linking in linker wrapper This patch implements the fist support for handling LTO in the offloading pipeline. The flag `-foffload-lto` is used to control if bitcode is embedded into the device. If bitcode is found in the device, the extracted files will be sent to the LTO pipeline to be linked and sent to the backend. This implementation does not separately link the device bitcode libraries yet. Depends on D116675 Differential Revision: https://reviews.llvm.org/D116975	2022-01-31 23:11:41 -05:00
Joseph Huber	95c8f74640	[Clang] Introduce Clang Linker Wrapper Tool This patch introduces a linker wrapper tool that allows us to preprocess files before they are sent to the linker. This adds a dummy action and job to the driver stage that builds the linker command as usual and then replaces the command line with the wrapper tool. Depends on D116543 Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D116544	2022-01-31 15:56:04 -05:00
Joseph Huber	12ae095bbb	[OpenMP] Embed device files into the host IR This patch adds support for embedding the device object files into the host IR to create a fat binary. Each offloading file will be inserted into a section with the following naming format `.llvm.offloading.<triple>.<arch>.<filename>`. Depends on D116542 Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D116543	2022-01-31 15:56:02 -05:00
Joseph Huber	2f9ace9e9a	[OpenMP] Introduce new flag to change offloading driver pipeline This patch introduces the `-fopenmp-new-driver` option which instructs the compiler to use a new driver scheme for producing offloading code. In this scheme we create a complete offloading object file and then pass it as input to the host compilation phase. This will allow us to embed the object code in the backend phase. This is the start of a series of commits to rework the OpenMP offloading driver pipeline. The goal of this is to simplify the steps required for creating an offloading program. This patch changes the driver's configuration to simply pass the device file back to the host as an input so it can be embedded as an LLVM IR global during the backend, then simply passes that object file to the linker. This driver implementation will currently create the following phases, ``` $ clang input.c -fopenmp -fopenmp-targets=nvptx64 -fopenmp-new-driver -ccc-print-phases +- 0: input, "input.c", c, (host-openmp) +- 1: preprocessor, {0}, cpp-output, (host-openmp) +- 2: compiler, {1}, ir, (host-openmp) \| \| +- 3: input, "input.c", c, (device-openmp) \| \| +- 4: preprocessor, {3}, cpp-output, (device-openmp) \| \|- 5: compiler, {4}, ir, (device-openmp) \| +- 6: offload, "host-openmp (x86_64-unknown-linux-gnu)" {2}, "device-openmp (nvptx64)" {5}, ir \| +- 7: backend, {6}, assembler, (device-openmp) \|- 8: assembler, {7}, object, (device-openmp) +- 9: offload, "host-openmp (x86_64-unknown-linux-gnu)" {2}, "device-openmp (nvptx64)" {8}, ir +- 10: backend, {9}, assembler, (host-openmp) +- 11: assembler, {10}, object, (host-openmp) 12: clang-linker-wrapper, {11}, image, (host-openmp) ``` Which will map to the following bindings ``` # "x86_64-unknown-linux-gnu" - "clang", inputs: ["input.c"], output: "/tmp/input-bae62e.bc" # "nvptx64" - "clang", inputs: ["input.c", "/tmp/input-bae62e.bc"], output: "/tmp/input-76784e.s" # "nvptx64" - "NVPTX::Assembler", inputs: ["/tmp/input-76784e.s"], output: "/tmp/input-8f29db.o" # "x86_64-unknown-linux-gnu" - "clang", inputs: ["/tmp/input-bae62e.bc", "/tmp/input-8f29db.o"], output: "/tmp/input-545450.o" # "x86_64-unknown-linux-gnu" - "Offload::Linker", inputs: ["/tmp/input-545450.o"], output: "a.out" ``` Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D116541	2022-01-31 15:55:58 -05:00
Ben Shi	ac3894cf1e	[Clang] Move XCore specific options from Clang.cpp to XCore.cpp Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D118535	2022-01-30 02:24:35 +00:00
Amilendra Kodithuwakku	1f08b08674	[clang][ARM] Emit warnings when PACBTI-M is used with unsupported architectures Branch protection in M-class is supported by - Armv8.1-M.Main - Armv8-M.Main - Armv7-M Attempting to enable this for other architectures, either by command-line (e.g -mbranch-protection=bti) or by target attribute in source code (e.g. __attribute__((target("branch-protection=..."))) ) will generate a warning. In both cases function attributes related to branch protection will not be emitted. Regardless of the warning, module level attributes related to branch protection will be emitted when it is enabled via the command-line. The following people also contributed to this patch: - Victor Campos Reviewed By: chill Differential Revision: https://reviews.llvm.org/D115501	2022-01-28 09:59:58 +00:00
Fangrui Song	35d15222c0	[Driver] Remove obsoleted -gz=zlib-gnu GCC added -gz=zlib-gnu in 2014 for -gz meaning change (.zdebug => SHF_COMPRESSED) and the legacy zlib-gnu hasn't gain adoption. According to Debian Code Search (`gz=zlib-gnu`), no project uses -gz=zlib-gnu (valgrind has a configure to use -gz=zlib). Any possible -gz=zlib-gnu user can switch to -gz smoothly (supported by integrated assemblers for many years; binutils 2.26). Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D117744	2022-01-26 13:26:51 -08:00

1 2 3 4 5 ...

1150 Commits