llvm-project

Commit Graph

Author	SHA1	Message	Date
Fangrui Song	3a79f1caa9	[Driver][test] Restore %clang -cc1 in test/Driver This partially reverts commit `1609a5d771` (the test/Driver part). We want to discourage %clang_cc1 and clang -cc1 in test/Driver. The clang -cc1 uses in hlsl/offload/etc are not good examples.	2022-09-29 12:21:57 -07:00
Michał Górny	1609a5d771	[clang] [test] Use %clang_cc1 substitution consistently Use the `%clang_cc1` substitution consistently across the test suite, replacing inline `%clang -cc1` invocations, except for one Preprocessor test where this is causing breakage. This is necessary to ensure that additional parameters passed via `%clang` do not interfere with `-cc1` that must always be passed as the first command-line argument. Remove the additional substitution blocking `%clang_cc1` use in Driver tests. It has been added in 2013 and was supposed to prevent tests calling `clang -cc1` from being added to Driver. The state of the test suite proves that it did not succeed at all. Differential Revision: https://reviews.llvm.org/D134880	2022-09-29 20:59:00 +02:00
Joseph Huber	080022d8ed	[LinkerWrapper] Embed OffloadBinaries for OpenMP offloading images The OpenMP offloading runtine currently uses an array of linked offloading images. One downside to this is that we cannot know the architecture or triple associated with the given image. In this patch, instead of embedding the image itself, we embed an offloading binary instead. This binary is simply a binary format that wraps around the original linked image to provide some additional metadata. This will allow us to support offloading to multiple architecture, or performing future JIT compilation inside of the runtime, more clearly. Additionally, these can be placed at a special section such that the supported architectures can be identified using objdump with the support from D126904. This needs to be stored in a new section name `.llvm.offloading.images` because the `.llvm.offloading` section implicitly uses the `SHF_EXCLUDE` flag and will always be stripped. This patch does not contain the necessary code to parse these in libomptarget. Depends on D127246 Reviewed By: saiislam Differential Revision: https://reviews.llvm.org/D127304	2022-07-21 13:20:01 -04:00
Joseph Huber	fe6a391357	[Clang] Fix tests failing due to invalid syntax for host triple Summary: We use the `--host-triple=` argument to manually set the target triple. This was changed to include the `=` previously but was not included in these additional test cases, causing it for fail on some unsupported systems.	2022-07-11 21:31:56 -04:00
Joseph Huber	ce091eb3b9	[HIP] Add support for handling HIP in the linker wrapper This patch adds the necessary changes required to bundle and wrap HIP files. The bundling is done using `clang-offload-bundler` currently to mimic `fatbinary` and the wrapping is done using very similar runtime calls to CUDA. This still does not support managed / surface / texture variables, that would require some additional information in the entry. One difference in the codegeneration with AMD is that I don't check if the handle is null before destructing it, I'm not sure if that's required. With this we should be able to support HIP with the new driver. Depends on D128850 Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D128914	2022-07-11 15:49:23 -04:00
Joseph Huber	d2ead9e324	[LinkerWrapper][NFC] Rework command line argument handling in the linker wrapper Summary: This patch reworks the command line argument handling in the linker wrapper from using the LLVM `cl` interface to using the `Option` interface with TableGen. This has several benefits compared to the old method. We use arguments from the linker arguments in the linker wrapper, such as the libraries and input files, this allows us to properly parse these. Additionally we can now easily set up aliases to the linker wrapper arguments and pass them in the linker input directly. That is, pass an option like `cuda-path=` as `--offload-arg=cuda-path=` in the linker's inputs. This will allow us to handle offloading compilation in the linker itself some day. Finally, this is also a much cleaner interface for passing arguments to the individual device linking jobs.	2022-07-08 11:18:38 -04:00
Joseph Huber	34fc1db9a8	[LinkerWrapper] Change wrapping to include jumps for other variables Summary: We don't currently support other variable types, like managed or surface. This patch simply adds code that checks the flags and does nothing. This prevents us from registering a surface as a variable as we do now. In the future, registering these will require adding the flags to the entry struct.	2022-06-29 14:48:39 -04:00
Nikita Popov	41d5033eb1	[IR] Enable opaque pointers by default This enabled opaque pointers by default in LLVM. The effect of this is twofold: * If IR that contains neither explicit ptr nor %T* types is passed to tools, we will now use opaque pointer mode, unless -opaque-pointers=0 has been explicitly passed. * Users of LLVM as a library will now default to opaque pointers. It is possible to opt-out by calling setOpaquePointers(false) on LLVMContext. A cmake option to toggle this default will not be provided. Frontends or other tools that want to (temporarily) keep using typed pointers should disable opaque pointers via LLVMContext. Differential Revision: https://reviews.llvm.org/D126689	2022-06-02 09:40:56 +02:00
Joseph Huber	26eb04268f	[Clang] Introduce clang-offload-packager tool to bundle device files In order to do offloading compilation we need to embed files into the host and create fatbainaries. Clang uses a special binary format to bundle several files along with their metadata into a single binary image. This is currently performed using the `-fembed-offload-binary` option. However this is not very extensibile since it requires changing the command flag every time we want to add something and makes optional arguments difficult. This patch introduces a new tool called `clang-offload-packager` that behaves similarly to CUDA's `fatbinary`. This tool takes several input files with metadata and embeds it into a single image that can then be embedded in the host. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D125165	2022-05-11 09:39:13 -04:00
Joseph Huber	f49d576a88	[CUDA] Add wrapper code generation for registering CUDA images This patch adds the necessary code generation to create the wrapper code that registers all the globals in CUDA. We create the necessary functions and iterate through the list of `__start_cuda_offloading_entries` to find which globals must be registered. This is very similar to the code generation done currently in Clang for non-rdc builds, but here we are registering a fully linked fatbinary and finding the globals via the above sections. With this we should be able to fully support basic RDC / LTO building of CUDA code. It's also worth noting that this does not include the necessary PTX to JIT the image, so to use this support the offloading architecture must match the system's architecture. Depends on D123810 Reviewed By: tra Differential Revision: https://reviews.llvm.org/D123812	2022-05-11 07:30:25 -04:00
Joseph Huber	ee74abaad7	[OpenMP] Add triple to the linker wrapper job Summary: I forgot to add the triple to the linker wrapper job, so we were still generating code for the unintended platforms.	2022-04-20 08:23:43 -04:00
Joseph Huber	1dfe0273fd	[OpenMP] Add explicit triple to linker wrapper test Summary: Some platforms like Mach-O require different handling of section names. This is not supported on Mac-OS or Windows yet so we shouldn't be testing the compilation there. Add an explicit triple to the tests.	2022-04-20 07:24:51 -04:00
Joseph Huber	8c64928887	[OpenMP] Add necessary registered targets for linker wrapper test Summary: The linker wrapper needs to use the registered backend to perform LTO. This was causing problems on the buildbots that didn't support it.	2022-04-19 18:48:58 -04:00
Joseph Huber	260c5df2d5	[OpenMP] Add better testing for the linker wrapper The linker wrapper is used to perform linking and wrapping of embedded device object files. Currently its internals are not able to be tested easily. This patch adds the `--dry-run` and `--print-wrapped-module` options to investigate the link jobs that will be run along with the wrapped code that will be created to register the binaries. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D124039	2022-04-19 18:37:09 -04:00

14 Commits