llvm-project

Commit Graph

Author	SHA1	Message	Date
Kazu Hirata	5891420e68	[clang] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-03 11:54:46 -08:00
Joseph Huber	bfd69ad596	[LinkerWrapper] Ignore OFK_None kinds for building registration code Summary: The linker wrapper uses this metadata to determine which registration code to emit, e.g. CUDA, HIP or OpenMP. If we encounter an OFK_None we should just ignore it.	2022-11-29 09:46:51 -06:00
Zequan Wu	84be92d26f	Reland "[LTO][COFF] Use bitcode file names in lto native object file names." This reverts commit `3410808294` with fixes.	2022-11-22 13:46:42 -08:00
Roman Lebedev	3410808294	Revert "Reland "[LTO][COFF] Use bitcode file names in lto native object file names."" Breaks build of LLVMgold here: ``` /repositories/llvm-project/llvm/tools/gold/gold-plugin.cpp:1108:19: error: no matching function for call to 'localCache' Cache = check(localCache("ThinLTO", "Thin", options::cache_dir, AddBuffer)); ^~~~~~~~~~ /repositories/llvm-project/llvm/include/llvm/Support/Caching.h:72:21: note: candidate function not viable: no known conversion from '(lambda at /repositories/llvm-project/llvm/tools/gold/gold-plugin.cpp:1102:20)' to 'llvm::AddBufferFn' (aka 'function<void (unsigned int, const llvm::Twine &, std::unique_ptr<MemoryBuffer>)>') for 4th argument Expected<FileCache> localCache( ^ /repositories/llvm-project/llvm/tools/gold/gold-plugin.cpp:1110:18: error: no viable conversion from '(lambda at /repositories/llvm-project/llvm/tools/gold/gold-plugin.cpp:1094:20)' to 'llvm::AddStreamFn' (aka 'function<Expected<std::unique_ptr<CachedFileStream>> (unsigned int, const llvm::Twine &)>') check(Lto->run(AddStream, Cache)); ^~~~~~~~~ /usr/bin/../lib/gcc/x86_64-linux-gnu/12/../../../../include/c++/12/bits/std_function.h:375:7: note: candidate constructor not viable: no known conversion from '(lambda at /repositories/llvm-project/llvm/tools/gold/gold-plugin.cpp:1094:20)' to 'std::nullptr_t' for 1st argument function(nullptr_t) noexcept ^ /usr/bin/../lib/gcc/x86_64-linux-gnu/12/../../../../include/c++/12/bits/std_function.h:386:7: note: candidate constructor not viable: no known conversion from '(lambda at /repositories/llvm-project/llvm/tools/gold/gold-plugin.cpp:1094:20)' to 'const std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream>> (unsigned int, const llvm::Twine &)> &' for 1st argument function(const function& __x) ^ /usr/bin/../lib/gcc/x86_64-linux-gnu/12/../../../../include/c++/12/bits/std_function.h:404:7: note: candidate constructor not viable: no known conversion from '(lambda at /repositories/llvm-project/llvm/tools/gold/gold-plugin.cpp:1094:20)' to 'std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream>> (unsigned int, const llvm::Twine &)> &&' for 1st argument function(function&& __x) noexcept ^ /usr/bin/../lib/gcc/x86_64-linux-gnu/12/../../../../include/c++/12/bits/std_function.h:435:2: note: candidate template ignored: requirement '_Callable<(lambda at /repositories/llvm-project/llvm/tools/gold/gold-plugin.cpp:1094:20) &, (lambda at /repositories/llvm-project/llvm/tools/gold/gold-plugin.cpp:1094:20), std::__invoke_result<(lambda at /repositories/llvm-project/llvm/tools/gold/gold-plugin.cpp:1094:20) &, unsigned int, const llvm::Twine &>>::value' was not satisfied [with _Functor = (lambda at /repositories/llvm-project/llvm/tools/gold/gold-plugin.cpp:1094:20) &] function(_Functor&& __f) ^ /repositories/llvm-project/llvm/include/llvm/LTO/LTO.h:278:25: note: passing argument to parameter 'AddStream' here Error run(AddStreamFn AddStream, FileCache Cache = nullptr); ^ ``` This reverts commit `387620aa8c`.	2022-11-23 00:40:12 +03:00
Zequan Wu	387620aa8c	Reland "[LTO][COFF] Use bitcode file names in lto native object file names." This reverts commit `eef5405f74`.	2022-11-22 11:26:18 -08:00
Zequan Wu	eef5405f74	Revert "[LTO][COFF] Use bitcode file names in lto native object file names." This reverts commit `531ed6d5aa`.	2022-11-22 10:55:05 -08:00
Zequan Wu	531ed6d5aa	[LTO][COFF] Use bitcode file names in lto native object file names. Currently the lto native object files have names like main.exe.lto.1.obj. In PDB, those names are used as names for each compiland. Microsoft’s tool SizeBench uses those names to present to users the size of each object files. So, names like main.exe.lto.1.obj is not user friendly. This patch makes the lto native object file names more readable by using the bitcode file names as part of the file names. For example, if the input bitcode file has path like "path/to/foo.obj", its corresponding lto native object file path would be "path/to/main.exe.lto.foo.obj". Since the lto native object file name only bothers PDB, this patch only changes the lld-linker's behavior. Reviewed By: tejohnson, MaskRay, #lld-macho Differential Revision: https://reviews.llvm.org/D137217	2022-11-22 10:19:58 -08:00
Fangrui Song	88aac16324	[LinkerWrapper] Fix -Wpessimizing-move	2022-11-11 11:53:06 -08:00
Joseph Huber	0f7e863154	[LinkerWrapper] Perform device linking steps in parallel This patch changes the device linking steps to be performed in parallel when multiple offloading architectures are being used. We use the LLVM parallelism support to accomplish this by simply doing each inidividual device linking job in a single thread. This change required re-parsing the input arguments as these arguments have internal state that would not be properly shared between the threads otherwise. By default, the parallelism uses all threads availible. But this can be controlled with the `--wrapper-jobs=` option. This was required in a few tests to ensure the ordering was still deterministic. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D136701	2022-11-11 13:46:33 -06:00
Joseph Huber	3384f05a2c	[llvm-objdump][Offload] Use common offload extraction method A previous patch introduced a common function used to extract offloading binaries from an image. Therefore we no longer need to duplicate the functionality in the `llvm-objdump` implementation. Functionally, this removes the old warning behaviour when given malformed input. This has been changed to a hard error, which is effectively the same. This required a slight tweak in the linker wrapper to filter out the user passing shared objects directly. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D136796	2022-11-03 16:19:13 -05:00
Joseph Huber	b9ee2acc9c	[LinkerWrapper] report on missing libraries The linker wrapper does its own library searching for static archives that can contain device code. The device linking phases happen before the host linking phases so that we can generate the necessary registration code and link it in with the rest of the code. Previously, If a library containing needed device code was not found the execution would continue silently until it failed with undefined symbols. This patch allows the linker wrapper to perform its own check beforehand to catch these errors. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D137180	2022-11-02 15:28:09 -05:00
Joseph Huber	f50a7c7a26	[LinkerWrapper] Fix optimized debugging builds for NVPTX LTO The ptxas assembler does not allow the `-g` flag along with optimizations. Normally this is degraded to line info in the driver, but when using LTO we did not have this step and the linker wrapper was not correctly degrading the option. Note that this will not work if the user does not pass `-g` again to the linker invocation. That will require setting some flags in the binary to indicate that debugging was used when building. This fixes #57990 Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D134660	2022-09-27 10:49:17 -05:00
Joseph Huber	5dbc7cf7ca	[Object] Refactor code for extracting offload binaries We currently extract offload binaries inside of the linker wrapper. Other tools may wish to do the same extraction operation. This patch simply factors out this handling into the `OffloadBinary.h` interface. Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D132689	2022-09-06 08:55:16 -05:00
Joseph Huber	ca6d70b32e	[LinkerWrapper][NFC] Clean-up error handling Summary: We should return and propagate these errors instead of reporting them immediately. Also use llvm::copy instead.	2022-08-18 15:01:01 -04:00
Kazu Hirata	6d9cd9199a	Use llvm::all_of (NFC)	2022-08-14 16:25:36 -07:00
Joseph Huber	b71b22e48d	[LinkerWrapper] Use C++17 structured bindings Summary: This patch updates the linker wrapper to use C++17 structured bindings now that they are availible.	2022-08-08 15:07:03 -04:00
Nicolai Hähnle	3e874bcf06	ClangLinkerWrapper: explicitly #include <atomic> This code relied on implicitly having std::atomic available via the ManagedStatic.h header. Differential Revision: https://reviews.llvm.org/D130574	2022-07-27 14:57:34 +02:00
Joseph Huber	080022d8ed	[LinkerWrapper] Embed OffloadBinaries for OpenMP offloading images The OpenMP offloading runtine currently uses an array of linked offloading images. One downside to this is that we cannot know the architecture or triple associated with the given image. In this patch, instead of embedding the image itself, we embed an offloading binary instead. This binary is simply a binary format that wraps around the original linked image to provide some additional metadata. This will allow us to support offloading to multiple architecture, or performing future JIT compilation inside of the runtime, more clearly. Additionally, these can be placed at a special section such that the supported architectures can be identified using objdump with the support from D126904. This needs to be stored in a new section name `.llvm.offloading.images` because the `.llvm.offloading` section implicitly uses the `SHF_EXCLUDE` flag and will always be stripped. This patch does not contain the necessary code to parse these in libomptarget. Depends on D127246 Reviewed By: saiislam Differential Revision: https://reviews.llvm.org/D127304	2022-07-21 13:20:01 -04:00
Joseph Huber	47b0aa5e4b	[LinkerWrapper] Rework passing args to the LLVM backend	2022-07-18 12:44:15 -04:00
Joseph Huber	1a5c1ee5b8	[LinkerWrapper] Fix deleted constructor error on older compilers Summary: The previous patch moved some functoinality into a new function and returned it. The vector contained move-only members. Newer compilers should figure this out and I didn't notice any problems, but other ones have problems. Explicitly move this vector to hopefully solve the issue.	2022-07-13 15:45:35 -04:00
Joseph Huber	c70db40ceb	[LinkerWrapper] Support time tracing in the linker wrapper Summary: This patch adds the new `--wrapper-time-trace=` option to write a time tracing JSON file indicating where time was spent in the linker wrapper. We also reformat and group some of the existing code to make constraining the scope easier for time tracing. We use the `--wrapper` prefix to set this apart from the time tracing that lld may use.	2022-07-13 15:38:39 -04:00
Joseph Huber	20d253e3bf	[LinkerWrapper] Fix linker-wrapper not working with host-LTO	2022-07-13 12:32:02 -04:00
Joseph Huber	a3cbb158a2	[LinkerWrapper] Tweak save-temps output name Summary: A previous patch added the Task to the output filename when doing `save-temps` the majority of cases there is only a single task so we only add the task explicitly to differentiate it from the first one.	2022-07-12 19:42:04 -04:00
Joseph Huber	1586b59528	[LinkerWrapper] Clean-up unused definitions Summary: There are a few definitions that are unused or unnecessary after previous changes, clean them up.	2022-07-12 11:46:46 -04:00
Joseph Huber	86a49a4f4f	[LinkerWrapper] Make ThinLTO work inside the linker wrapper Summary: Previous assumptions held that the LTO stage would only have a single output. This is incorrect when using thinLTO which outputs multiple files. Additionally there were some bugs with how we hanlded input that cause problems when performing thinLTO. This patch addresses these issues. The performance of Thin-LTO is currently pretty bad. But I am content to leave it that way as long as it compiles.	2022-07-12 11:46:46 -04:00
Joseph Huber	ce091eb3b9	[HIP] Add support for handling HIP in the linker wrapper This patch adds the necessary changes required to bundle and wrap HIP files. The bundling is done using `clang-offload-bundler` currently to mimic `fatbinary` and the wrapping is done using very similar runtime calls to CUDA. This still does not support managed / surface / texture variables, that would require some additional information in the entry. One difference in the codegeneration with AMD is that I don't check if the handle is null before destructing it, I'm not sure if that's required. With this we should be able to support HIP with the new driver. Depends on D128850 Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D128914	2022-07-11 15:49:23 -04:00
Joseph Huber	dbd3ade17b	[LinkerWrapper] Fix errors not exiting inside of the LTO pipeline The LTO pipeline handles its errors using the diagnostics handler callback function. We were not checking the results of these errors and not properly returning an error code in the linker wrapper when errors occured inside of the LTO pipeline. This patch adds a simple boolean flag to indicate if the LTO backend failed to any reason and quit. Reviewed By: ye-luo Differential Revision: https://reviews.llvm.org/D129423	2022-07-09 11:29:04 -04:00
Joseph Huber	d36b96afb2	[LinkerWrapper] Fix use of string savers and correctly pass bitcode libraries This patch removes some uses of string savers that are no-longer needed. We also create a new string saver when linking bitcode files. It seems that occasionally the symbol string references can go out of scope when they are added to the LTO input so we need to save these names that are used for symbol resolution. Additionally, a previous patch added new logic for handling bitcode libraries, but failed to actually add them to the input. This bug has been fixed. Fixes #56445 Reviewed By: ye-luo Differential Revision: https://reviews.llvm.org/D129383	2022-07-08 17:22:35 -04:00
Joseph Huber	74a8fce6e8	[LinkerWrapper] Fix save-temps and argument name Summary: The previous path reworked some handling of temporary files which exposed some bugs related to capturing local state by reference in the callback labmda. Squashing this by copying in everything instead. There was also a problem where the argument name was changed for `--bitcode-library=` but clang still used `--target-library=`.	2022-07-08 11:38:33 -04:00
Joseph Huber	e0de264f63	[LinkerWrapper][NFC] Move error handling to a common function Summary: This patch merges all the error handling functions to a single function call so we don't define the same lambda many times.	2022-07-08 11:18:38 -04:00
Joseph Huber	d2ead9e324	[LinkerWrapper][NFC] Rework command line argument handling in the linker wrapper Summary: This patch reworks the command line argument handling in the linker wrapper from using the LLVM `cl` interface to using the `Option` interface with TableGen. This has several benefits compared to the old method. We use arguments from the linker arguments in the linker wrapper, such as the libraries and input files, this allows us to properly parse these. Additionally we can now easily set up aliases to the linker wrapper arguments and pass them in the linker input directly. That is, pass an option like `cuda-path=` as `--offload-arg=cuda-path=` in the linker's inputs. This will allow us to handle offloading compilation in the linker itself some day. Finally, this is also a much cleaner interface for passing arguments to the individual device linking jobs.	2022-07-08 11:18:38 -04:00
Joseph Huber	42e10354d4	[LinkerWrapper] Identify offloading sections using ELF type Summary: A previous patch added a new ELF section type for LLVM offloading. We should use this when extracting the offloading sections rather than checking the string. This pach also removes the implicit support for COFF and MACH-O because we don't support those currently and should not be included.	2022-07-07 12:38:00 -04:00
Joseph Huber	ed801ad5e5	[Clang] Use metadata to make identifying embedded objects easier Currently we use the `embedBufferInModule` function to store binary strings containing device offloading data inside the host object to create a fatbinary. In the case of LTO, we need to extract this object from the LLVM-IR. This patch adds a metadata node for the embedded objects containing the embedded pointers and the sections they were stored at. This should create a cleaner interface for identifying these values. In the future it may be worthwhile to also encode an `ID` in the metadata corresponding to the object's special section type if relevant. This would allow us to extract the data from an object file and LLVM-IR using the same ID. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D129033	2022-07-07 12:20:25 -04:00
Joseph Huber	0bb1bf1b17	[LinkerWrapper] Add AMDGPU specific options to the LLD invocation We use LLD to perform AMDGPU linking. This linker accepts some arguments through the `-plugin-opt` facilities. These options match what `Clang` will output when given the same input. Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D128923	2022-07-05 13:43:51 -04:00
Joseph Huber	1dcbe03c32	[Binary] Further improve malformed input handling for the OffloadBinary Summary: This patch adds some new sanity checks to make sure that the sizes of the offsets are within the bounds of the file or what is expected by the binary. This also improves the error handling of the version structure to be built into the binary itself so we can change it easier.	2022-06-24 09:57:44 -04:00
Joseph Huber	6e6889288c	[Offloading] Embed the target features in the OffloadBinary The target features are necessary for correctly compiling most programs in LTO mode. Currently, these are derived in clang at link time and passed as an arguemnt to the linker wrapper. This is problematic because it requires knowing the required toolchain at link time, which should not be necessry. Instead, these features should be embedded into the offloading binary so we can unify them in the linker wrapper for LTO. This also required changing the offload packager to interpret multiple arguments as concatenation with a comma. This is so we can still use the `,` separator for the argument list. Depends on D127246 Reviewed By: tra Differential Revision: https://reviews.llvm.org/D127686	2022-06-23 13:15:01 -04:00
Joseph Huber	7597988729	[LinkerWrapper][NFC] Change interface to use a StringRef to TempFiles Summary: Currently we use temporary files to write the intermediate results to. However, these are stored as regular strings and we do a few unnecessary copies and conversions of them. This patch simply replaces these strings with a reference to the filename stored in the list of temporary files. The temporary files will stay alive during the whole linking phase and have stable pointers, so we should be able to cheaply pass references to them rather than copying them every time.	2022-06-22 13:16:37 -04:00
Joseph Huber	a9fd8b9113	[LinkerWrapper] Fix calls to deleted Error constructor on older compilers Summary: A recent patch added some new code paths to the linker wrapper. Older compilers seem to have problems with returning errors wrapped in an Excepted type without explicitly moving them. This caused failures in some of the buildbots. This patch fixes that.	2022-06-22 09:39:23 -04:00
Joseph Huber	958a885050	[LinkerWrapper] Rework the linker wrapper and use owning binaries The linker wrapper currently eagerly extracts all identified offloading binaries to a file. This isn't ideal because we will soon open these files again to examine their symbols for LTO and other things. Additionally, we may not use every extracted file in the case of static libraries. This would be very noisy in the case of static libraries that may contain code for several targets not participating in the current link. Recent changes allow us to treat an Offloading binary as a standard binary class. So that allows us to use an OwningBinary to model the file. Now we keep it in memory and only write it once we know which files will be participating in the final link job. This also reworks a lot of the structure around how we handle this by removing the old DeviceFile class. The main benefit from this is that the following doesn't output 32+ files and instead will only output a single temp file for the linked module. ``` $ clang input.c -fopenmp --offload-arch=sm_70 -foffload-lto -save-temps ``` Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D127246	2022-06-22 09:24:10 -04:00
Fangrui Song	95a134254a	Remove unneeded cl::ZeroOrMore for cl::opt/cl::list options	2022-06-05 01:07:51 -07:00
Fangrui Song	d0d1c416cb	Remove unneeded cl::ZeroOrMore for cl::list options	2022-06-04 23:51:13 -07:00
Fangrui Song	734c223445	[clang-link-wrapper] Remove unneeded cl::ZeroOrMore for cl::opt options. NFC Similar to `557efc9a8b`	2022-06-03 22:02:11 -07:00
Joseph Huber	3723868d9e	[OpenMP] Fix file arguments for embedding bitcode in the linker wrapper Summary: The linker wrapper supports embedding bitcode images instead of linked device images to facilitate JIT in the device runtime. However, we were incorrectly passing in the file twice when this option was set. This patch makes sure we only use the intermediate result of the LTO pass and don't add the final output to the full job. In the future we will want to add both of these andle handle that accoridngly to allow the runtime to either use the AoT compiled version or JIT compile the bitcode version if availible.	2022-05-24 13:45:52 -04:00
Joseph Huber	f37101983f	[OpenMP] Add `-Xoffload-linker` to forward input to the device linker We use the clang-linker-wrapper to perform device linking of embedded offloading object files. This is done by generating those jobs inside of the linker-wrapper itself. This patch adds an argument in Clang and the linker-wrapper that allows users to forward input to the device linking phase. This can either be done for every device linker, or for a specific target triple. We use the `-Xoffload-linker <arg>` and the `-Xoffload-linker-<triple> <arg>` syntax to accomplish this. Reviewed By: markdewing, tra Differential Revision: https://reviews.llvm.org/D126226	2022-05-24 09:11:02 -04:00
Joseph Huber	8a0fb965f6	[LinkerWrapper] Group static libraries in their own buffer Summary: Static libraries need to be handled differently from regular inpout files, namely they are loaded lazily. Previously we used a flag to indicate a file camm from a static library. This patch simplifies this by simply keeping a different array that contains the static libraries so we don't need to parse them out again.	2022-05-12 20:45:49 -04:00
Joseph Huber	1bfa88d0c5	[LinkerWrapper] Remove stripping features from the linker wrapper Summary: The linker wrapper previously had functionality to strip the sections manually. We don't use this at all because this is much better done by the linker via the `SHF_EXCLUDE` flag. This patch simply removes the support for thi sfeature to simplify the code.	2022-05-12 20:45:49 -04:00
Joseph Huber	42a1fb5ca5	[LinkerWrapper][Fix} Fix bad alignment from extracted archive members Summary: We use embedded binaries to extract offloading device code from the host fatbinary. This uses a binary format whose necessary alignment is eight bytes. The alignment is included within the ELF section type so the data extracted from the ELF should always be aligned at that amount. However, if this file was extraqcted from a static archive, it was being sent as an offset in the archive file which did not have the same alignment guaruntees as the ELF file. This was causing errors in the UB-sanitizer build as it would occasionally try to access a misaligned address. To fix this, I simply copy the memory directly to a new buffer which is guarnteed to have worst-case alignment of 16 in the case that it's not properly aligned.	2022-05-11 16:56:41 -04:00
Joseph Huber	f49d576a88	[CUDA] Add wrapper code generation for registering CUDA images This patch adds the necessary code generation to create the wrapper code that registers all the globals in CUDA. We create the necessary functions and iterate through the list of `__start_cuda_offloading_entries` to find which globals must be registered. This is very similar to the code generation done currently in Clang for non-rdc builds, but here we are registering a fully linked fatbinary and finding the globals via the above sections. With this we should be able to fully support basic RDC / LTO building of CUDA code. It's also worth noting that this does not include the necessary PTX to JIT the image, so to use this support the offloading architecture must match the system's architecture. Depends on D123810 Reviewed By: tra Differential Revision: https://reviews.llvm.org/D123812	2022-05-11 07:30:25 -04:00
Joseph Huber	e7858a9fab	[Cuda] Add initial support for wrapping CUDA images in the new driver. This patch adds the initial support for wrapping CUDA images. This requires changing some of the logic for how we bundle images. We now need to copy the image for all kinds that are active for the architecture. Then we need to run a separate wrapping job if the Kind is Cuda. For cuda wrapping we need to use the `fatbinary` program from the CUDA SDK to bundle all the binaries together. This is then passed to a new function to perfom the actual module code generation that will be implemented in a later patch. Depends on D120273 D123471 Reviewed By: tra Differential Revision: https://reviews.llvm.org/D123810	2022-05-11 07:30:23 -04:00
Joseph Huber	e12905b4d5	[OpenMP] Add basic support for properly handling static libraries Currently we handle static libraries like any other object in the linker wrapper. However, this does not preserve the sematnics that dictate static libraries should be lazily loaded as the symbols are needed. This allows us to ignore linking in architectures that are not used by the main application being compiled. This patch adds the basic support for detecting if a file came from a static library, and only including it in the link job if it's used by other object files. This patch only adds the basic support, to be more correct we should check the symbols and only inclue the library if the link job contains symbols that are needed. Ideally we could just put this on the linker itself, but nvlink doesn't seem to support `.a` files. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D125092	2022-05-06 11:20:58 -04:00

1 2

90 Commits