llvm-project

Commit Graph

Author	SHA1	Message	Date
Joe Loser	5e96cea1db	[llvm] Use std::size instead of llvm::array_lengthof LLVM contains a helpful function for getting the size of a C-style array: `llvm::array_lengthof`. This is useful prior to C++17, but not as helpful for C++17 or later: `std::size` already has support for C-style arrays. Change call sites to use `std::size` instead. Differential Revision: https://reviews.llvm.org/D133429	2022-09-08 09:01:53 -06:00
Fangrui Song	fbf5e87219	[MC] Support writing ELFCOMPRESS_ZSTD compressed debug info sections and add --compress-debug-sections=zstd to llvm-mc for testing. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D130724	2022-09-08 01:00:06 -07:00
Nikita Popov	0444b40ed3	Revert "[Support] Add llvm::compression::{getReasonIfUnsupported,compress,decompress}" This reverts commit `19dc3cff0f`. This reverts commit `5b19a1f8e8`. This reverts commit `9397648ac8`. This reverts commit `10842b4475`. Breaks the GCC build, as reported here: https://reviews.llvm.org/D130506#3776415	2022-09-08 09:33:12 +02:00
Fangrui Song	9397648ac8	[MC] Support writing ELFCOMPRESS_ZSTD compressed debug info sections and add --compress-debug-sections=zstd to llvm-mc for testing. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D130724	2022-09-08 00:03:40 -07:00
Marco Elver	343700358f	[AsmPrinter] Emit PCs into requested PCSections Interpret MD_pcsections in AsmPrinter emitting the requested metadata to the associated sections. Functions and normal instructions are handled. Differential Revision: https://reviews.llvm.org/D130879	2022-09-07 11:36:02 +02:00
Markus Böck	f049b2c3fc	[MC] Emit Stackmaps before debug info This patch is essentially an alternative to https://reviews.llvm.org/D75836 and was mentioned by @lhames in a comment. The gist of the issue is that Mach-O has restrictions on which kind of sections are allowed after debug info has been emitted, which is also properly asserted within LLVM. Problem is that stack maps are currently emitted as one of the last sections in each target-specific AsmPrinter so far, which would cause the assertion to trigger. The current approach of special casing for the `__LLVM_STACKMAPS` section is not viable either, as downstream users can overwrite the stackmap format using plugins, which may want to use different sections. This patch fixes the issue by emitting the stack map earlier, right before debug info is emitted. The way this is implemented is by taking the choice when to emit the StackMap away from the target AsmPrinter and doing so in the base class. The only disadvantage of this approach is that the `StackMaps` member is now part of the base class, even for targets that do not support them. This is functionaly not a problem however, as emitting an empty `StackMaps` is a no-op. Differential Revision: https://reviews.llvm.org/D132708	2022-09-06 20:20:56 +02:00
Kazu Hirata	fedc59734a	[llvm] Use range-based for loops (NFC)	2022-09-03 11:17:40 -07:00
Sam Clegg	92920c4fe3	[MC][WebAssembly] Allow accurate errors in doBeforeLabelEmit Although we only currently have one error produced in this function I am working on changes right now that add some more. This change makes the error location more accurate. Differential Revision: https://reviews.llvm.org/D133016	2022-09-01 01:26:33 -07:00
Sam Clegg	c5c4ba37b1	[WebAssembly][MC] Avoid the need for .size directives for functions Warn if `.size` is specified for a function symbol. The size of a function symbol is determined solely by its content. I noticed this simplification was possible while debugging #57427, but this change doesn't fix that specific issue. Differential Revision: https://reviews.llvm.org/D132929	2022-08-31 14:28:56 -07:00
Daniel Bertalan	389e0a81a1	[lld-macho] Support synthesizing __TEXT,__init_offsets This section stores 32-bit `__TEXT` segment offsets of initializer functions, and is used instead of `__mod_init_func` when chained fixups are enabled. Storing the offsets lets us avoid emitting fixups for the initializers. Differential Revision: https://reviews.llvm.org/D132947	2022-08-31 10:13:45 +02:00
Kazu Hirata	0e9d37ff95	[llvm] Qualify auto in range-based for loops (NFC)	2022-08-28 23:29:00 -07:00
Adhemerval Zanella	abcb3d58d9	[MC][ELF] Mark GNU ABI if ifunc are used Similar to D107861. Some tools required the GNU ABI mark to output the symbol is a IFUNC type correctly (for instance binutils readelf). Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D131162	2022-08-25 18:19:10 -03:00
Stephen Long	525af9f8eb	[MC] Omit fill value if it's zero when emitting code alignment Previously, we were generating zeroes when generating code alignments for AArch64, but now we should omit the value and let the assembler choose to generate nops or zeroes. Reviewed By: efriedma, MaskRay Differential Revision: https://reviews.llvm.org/D132508	2022-08-25 10:07:33 -07:00
Matthias Braun	443d9a46a8	Revert "MC: make section classification a bit more thorough" This reverts commit `73a9dfcee2`. We started to hit assertions when compiling & assembling separately. See comments in https://reviews.llvm.org/D131270	2022-08-24 17:57:45 -07:00
Sami Tolvanen	cff5bef948	KCFI sanitizer The KCFI sanitizer, enabled with `-fsanitize=kcfi`, implements a forward-edge control flow integrity scheme for indirect calls. It uses a !kcfi_type metadata node to attach a type identifier for each function and injects verification code before indirect calls. Unlike the current CFI schemes implemented in LLVM, KCFI does not require LTO, does not alter function references to point to a jump table, and never breaks function address equality. KCFI is intended to be used in low-level code, such as operating system kernels, where the existing schemes can cause undue complications because of the aforementioned properties. However, unlike the existing schemes, KCFI is limited to validating only function pointers and is not compatible with executable-only memory. KCFI does not provide runtime support, but always traps when a type mismatch is encountered. Users of the scheme are expected to handle the trap. With `-fsanitize=kcfi`, Clang emits a `kcfi` operand bundle to indirect calls, and LLVM lowers this to a known architecture-specific sequence of instructions for each callsite to make runtime patching easier for users who require this functionality. A KCFI type identifier is a 32-bit constant produced by taking the lower half of xxHash64 from a C++ mangled typename. If a program contains indirect calls to assembly functions, they must be manually annotated with the expected type identifiers to prevent errors. To make this easier, Clang generates a weak SHN_ABS `__kcfi_typeid_<function>` symbol for each address-taken function declaration, which can be used to annotate functions in assembly as long as at least one C translation unit linked into the program takes the function address. For example on AArch64, we might have the following code: ``` .c: int f(void); int (*p)(void) = f; p(); .s: .4byte __kcfi_typeid_f .global f f: ... ``` Note that X86 uses a different preamble format for compatibility with Linux kernel tooling. See the comments in `X86AsmPrinter::emitKCFITypeId` for details. As users of KCFI may need to locate trap locations for binary validation and error handling, LLVM can additionally emit the locations of traps to a `.kcfi_traps` section. Similarly to other sanitizers, KCFI checking can be disabled for a function with a `no_sanitize("kcfi")` function attribute. Relands `67504c9549` with a fix for 32-bit builds. Reviewed By: nickdesaulniers, kees, joaomoreira, MaskRay Differential Revision: https://reviews.llvm.org/D119296	2022-08-24 22:41:38 +00:00
Sami Tolvanen	a79060e275	Revert "KCFI sanitizer" This reverts commit `67504c9549` as using PointerEmbeddedInt to store 32 bits breaks 32-bit arm builds.	2022-08-24 19:30:13 +00:00
Sami Tolvanen	67504c9549	KCFI sanitizer The KCFI sanitizer, enabled with `-fsanitize=kcfi`, implements a forward-edge control flow integrity scheme for indirect calls. It uses a !kcfi_type metadata node to attach a type identifier for each function and injects verification code before indirect calls. Unlike the current CFI schemes implemented in LLVM, KCFI does not require LTO, does not alter function references to point to a jump table, and never breaks function address equality. KCFI is intended to be used in low-level code, such as operating system kernels, where the existing schemes can cause undue complications because of the aforementioned properties. However, unlike the existing schemes, KCFI is limited to validating only function pointers and is not compatible with executable-only memory. KCFI does not provide runtime support, but always traps when a type mismatch is encountered. Users of the scheme are expected to handle the trap. With `-fsanitize=kcfi`, Clang emits a `kcfi` operand bundle to indirect calls, and LLVM lowers this to a known architecture-specific sequence of instructions for each callsite to make runtime patching easier for users who require this functionality. A KCFI type identifier is a 32-bit constant produced by taking the lower half of xxHash64 from a C++ mangled typename. If a program contains indirect calls to assembly functions, they must be manually annotated with the expected type identifiers to prevent errors. To make this easier, Clang generates a weak SHN_ABS `__kcfi_typeid_<function>` symbol for each address-taken function declaration, which can be used to annotate functions in assembly as long as at least one C translation unit linked into the program takes the function address. For example on AArch64, we might have the following code: ``` .c: int f(void); int (*p)(void) = f; p(); .s: .4byte __kcfi_typeid_f .global f f: ... ``` Note that X86 uses a different preamble format for compatibility with Linux kernel tooling. See the comments in `X86AsmPrinter::emitKCFITypeId` for details. As users of KCFI may need to locate trap locations for binary validation and error handling, LLVM can additionally emit the locations of traps to a `.kcfi_traps` section. Similarly to other sanitizers, KCFI checking can be disabled for a function with a `no_sanitize("kcfi")` function attribute. Reviewed By: nickdesaulniers, kees, joaomoreira, MaskRay Differential Revision: https://reviews.llvm.org/D119296	2022-08-24 18:52:42 +00:00
Saleem Abdulrasool	73a9dfcee2	MC: make section classification a bit more thorough This does NOT change the emitted section flags in any way. This only impacts the internal classification of sections. Extend the section classification in LLVM for ELF targets. This has one important change: we now classify sections as text by default rather than readonly. This matches the behaviour for GAS better. Ensure that any section that has a writable attribute set is not treated as readonly. We also special case any section named `.debug_` which is reserved for DWARF as metadata. In the case none of the attributes are set (or because no attributes were provided), consult the section name for classification. We match the well known names and classify the section accordingly. Any remaining section is now classified as text. This change allows us to classify sections in the MC layer more precisely which is needed for subsequent changes for handling target specific behaviour. Differential Revision: https://reviews.llvm.org/D131270 Reviewed By: @echristo	2022-08-19 16:45:41 +00:00
Rainer Orth	d9993484ee	[Sparc] Don't use SunStyleELFSectionSwitchSyntax As discussed in D85414 <https://reviews.llvm.org/D85414>, two tests currently `FAIL` on Sparc since that backend uses the Sun assembler syntax for the `.section` directive, controlled by `SunStyleELFSectionSwitchSyntax`. Instead of adapting the affected tests, this patch changes that default. The internal assembler still accepts both forms as input, only the output syntax is affected. Current support for the Sun syntax is cursory at best: the built-in assembler cannot even assemble some of the directives emitted by GCC, and the set supported by the Solaris assembler is even larger: SPARC Assembly Language Reference Manual, 3.4 Pseudo-Op Attributes <https://docs.oracle.com/cd/E37838_01/html/E61063/gmabi.html#scrolltoc>. A few Sparc test cases need to be adjusted. At the same time, the patch fixes the failures from D85414 <https://reviews.llvm.org/D85414>. Tested on `sparcv9-sun-solaris2.11`. Differential Revision: https://reviews.llvm.org/D85415	2022-08-17 12:59:29 +02:00
Fangrui Song	d797c2ffdb	[DebugInfo] -fdebug-prefix-map: handle '#line "file"' for asm source `getContext().setMCLineTableRootFile` (from D62074) sets `RootFile.Name` to `FirstCppHashFilename`. `RootFile.Name` is not processed by -fdebug-prefix-map and will go to DW_TAG_compile_unit's DT_AT_name and DW_TAG_label's DW_AT_decl_file. Remap `RootFile.Name`. Fix another issue reported by https://github.com/llvm/llvm-project/issues/56609 Reviewed By: #debug-info, dblaikie, raj.khem Differential Revision: https://reviews.llvm.org/D131848	2022-08-14 20:58:23 -07:00
Kazu Hirata	109df7f9a4	[llvm] Qualify auto in range-based for loops (NFC) Identified with readability-qualified-auto.	2022-08-13 12:55:42 -07:00
Fangrui Song	f62e60fb23	[MCDwarf] Respect -fdebug-prefix-map= for generated assembly debug info (DWARF v5) For generated assembly debug info, MCDwarfLineTableHeader::CompilationDir is an unmapped path set in MCContext::setGenDwarfRootFile. Remap it. A relative destination path of -fdebug-prefix-map= exposes a llvm-dwarfdump bug which joins relative DW_AT_comp_dir and directories[0]. Fix https://github.com/llvm/llvm-project/issues/56609 Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D131749	2022-08-12 12:52:36 -07:00
Fangrui Song	b0c4cd35df	[MCDwarf] Use emplace to avoid move assignment. NFC	2022-08-12 05:05:49 +00:00
Eric Astor	94fae7a581	[ms] [llvm-ml] Add support for nested PROC/ENDP pairs This is believed to match behavior by ML.EXE and ML64.EXE. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D131522	2022-08-11 14:19:02 +00:00
Ilya Biryukov	7c80c4d677	[MC] NFC. Avoid redundant copies when constructing StructFieldInfo Follow-up after D131595, see comments in the review thread. The intention of having two constructors was to minimize the copies of `vector`, but a lack of `std::move` on the call site caused the wrong constructor to be called. Switched to a single constructor that accepts a value. Accepting by value allows to have a single constructor and still decide to copy or move on the call site.	2022-08-11 11:53:24 +02:00
esmeyi	7a70e6e224	[XCOFF] ignore the cold attribute. Summary: AIX XCOFF doesn't support the cold feature. While it shouldn't be a function error when XCOFF catching the cold attribute. As with the behavior of other formats, we just ignore the attribute for now. Reviewed By: DiggerLin Differential Revision: https://reviews.llvm.org/D131473	2022-08-11 01:13:05 -04:00
Eric Astor	bc9617899c	[ms] [llvm-ml] Handle OPTION PROLOGUE/EPILOGUE:NONE Since we don't yet implement PROC's PROLOGUE and EPILOGUE support, we can safely ignore the option that disables them. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D131524	2022-08-10 17:52:15 +00:00
Ilya Biryukov	e3b5254d71	[MC] Avoid calling vector<FieldInfo> members before FieldInfo is defined The C++ Standard requires a complete type T when using any members of `vector<T>`, see https://eel.is/c++draft/vector#overview-4. This only breaks with latest libc++ in C++20 mode and does not show up in common configurations. We have an internal experimental configuration that discovered this. Reviewed By: alexfh Differential Revision: https://reviews.llvm.org/D131595	2022-08-10 19:20:09 +02:00
Kazu Hirata	558caf6d2b	[MCParser] Fix a warning This patch fixes: llvm/lib/MC/MCParser/COFFMasmParser.cpp:333:28: error: comparison of integers of different signs: 'unsigned int' and 'int' [-Werror,-Wsign-compare]	2022-08-10 09:56:44 -07:00
Eric Astor	4b566933cf	[ms] [llvm-ml] Add support for the (many) optional SEGMENT parameters Add support for many parameters to the SEGMENT directive Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D131523	2022-08-10 15:55:23 +00:00
Martin Storsjö	0c52ab3968	[MC] [Win64EH] Fix the calculation of the end of epilogs Exclude the terminating end opcode from the epilog - it doesn't correspond to an actual instruction that is included in the epilog itself (within the .seh_startepilogue/.seh_endepilogue range). In most (all?) cases, an epilog is followed by a matching terminating instruction though (a ret or a branch to a tail call), but it's not strictly within the .seh_startepilogue/.seh_endepilogue range. This fixes a number of failed asserts in cases where the codegen has incorrectly reoredered SEH opcodes so they don't match up exactly with their instructions. However this still just avoids failing the assertion; the root cause of generating unexpected epilogs is still present (and fixing that is a less obvious issue). Differential Revision: https://reviews.llvm.org/D131393	2022-08-08 23:03:17 +03:00
Fangrui Song	de9d80c1c5	[llvm] LLVM_FALLTHROUGH => [[fallthrough]]. NFC With C++17 there is no Clang pedantic warning or MSVC C5051.	2022-08-08 11:24:15 -07:00
Kazu Hirata	e20d210eef	[llvm] Qualify auto (NFC) Identified with readability-qualified-auto.	2022-08-07 23:55:27 -07:00
Kazu Hirata	a2d4501718	[llvm] Fix comment typos (NFC)	2022-08-07 00:16:14 -07:00
Zhaoshi Zheng	99e50e5838	[WinEH][ARM64] Split Unwind Info for Fucntions Larger than 1MB Create function segments and emit unwind info of them. A segment must be less than 1MB and no prolog or epilog is splitted between two segments. This patch should generate correct, though not optimal, unwind info for large functions. Currently it only generate pacted info (.pdata) only for functions that are less than 1MB (single-segment functions). This is NFC from before this patch. The next step is to enable (.pdata) only unwind info for the first segment or segments that have neither prolog or epilog in a multi-segment function. Another future work item is to further split segments that require more than 255 code words or have more than 65535 epilogs. Reference: https://docs.microsoft.com/en-us/cpp/build/arm64-exception-handling#function-fragments Differential Revision: https://reviews.llvm.org/D130049	2022-08-05 11:46:41 -07:00
Kazu Hirata	bf6021709a	Use drop_begin (NFC)	2022-07-31 15:17:09 -07:00
Stanislav Mekhanoshin	0562cf442f	Allow data prefetch into non-default address space I am playing with the LoopDataPrefetch pass and found out that it bails to work with a pointer in a non-zero address space. This patch adds the target callback to check if an address space is to be considered for prefetching. Default implementation still only allows address space 0, so this is NFCI. This does not currently affect any known targets, but seems to be generally useful for the future. Differential Revision: https://reviews.llvm.org/D129795	2022-07-27 10:01:26 -07:00
Ying Yi	8d46cb343f	Emit a simple StackSizesSection on PS4. Differential Revision: https://reviews.llvm.org/D130495	2022-07-27 09:39:24 +01:00
Simon Tatham	55f1fbf005	[MC,llvm-objdump,ARM] Target-dependent disassembly resync policy. Currently, when llvm-objdump is disassembling a code section and encounters a point where no instruction can be decoded, it uses the same policy on all targets: consume one byte of the section, emit it as "<unknown>", and try disassembling from the next byte position. On an architecture where instructions are always 4 bytes long and 4-byte aligned, this makes no sense at all. If a 4-byte word cannot be decoded as an instruction, then the next place that a valid instruction could //possibly// be found is 4 bytes further on. Disassembling from a misaligned address can't possibly produce anything that the code generator intended, or that the CPU would even attempt to execute. This patch introduces a new MCDisassembler virtual method called `suggestBytesToSkip`, which allows each target to choose its own resynchronization policy. For Arm (as opposed to Thumb) and AArch64, I've filled in the new method to return a fixed width of 4. Thumb is a more interesting case, because the criterion for identifying 2-byte and 4-byte instruction encodings is very simple, and doesn't require the particular instruction to be recognized. So `suggestBytesToSkip` is also passed an ArrayRef of the bytes in question, so that it can take that into account. The new test case shows Thumb disassembly skipping over two unrecognized instructions, and identifying one as 2-byte and one as 4-byte. For targets other than Arm and AArch64, this is NFC: the base class implementation of `suggestBytesToSkip` still returns 1, so that the existing behavior is unchanged. Other targets can fill in their own implementations as they see fit; I haven't attempted to choose a new behavior for each one myself. I've updated all the call sites of `MCDisassembler::getInstruction` in llvm-objdump, and also one in sancov, which was the only other place I spotted the same idiom of `if (Size == 0) Size = 1` after a call to `getInstruction`. Reviewed By: DavidSpickett Differential Revision: https://reviews.llvm.org/D130357	2022-07-26 09:35:30 +01:00
Kazu Hirata	3f3930a451	Remove redundaunt virtual specifiers (NFC) Identified with tidy-modernize-use-override.	2022-07-25 23:00:59 -07:00
Kazu Hirata	b5188591a0	[llvm] Remove redundaunt virtual specifiers (NFC) Identified with modernize-use-override.	2022-07-24 21:50:35 -07:00
Fangrui Song	7feab85df8	[MC] Remove unused renameELFSection	2022-07-24 01:23:07 -07:00
Fangrui Song	6977ff4006	[MC] Delete dead zlib-gnu code and simplify writeSectionData	2022-07-24 01:17:34 -07:00
Dmitri Gribenko	aba43035bd	Use llvm::sort instead of std::sort where possible llvm::sort is beneficial even when we use the iterator-based overload, since it can optionally shuffle the elements (to detect non-determinism). However llvm::sort is not usable everywhere, for example, in compiler-rt. Reviewed By: nhaehnle Differential Revision: https://reviews.llvm.org/D130406	2022-07-23 15:19:05 +02:00
esmeyi	b1847ff068	[XCOFF] write the aux header when the visibility is specified in XCOFF32. The n_type field in the symbol table entry has two interpretations in XCOFF32, and a single interpretation in XCOFF64. The new interpretation is used in XCOFF32 if the value of the o_vstamp field in the auxiliary header is 2. In XCOFF64 and the new XCOFF32 interpretation, the n_type field is used for the symbol type and visibility. The patch writes the aux header with an o_vstamp field value of 2 when the visibility is specified in XCOFF32 to make the new XCOFF32 interpretation used. Reviewed By: DiggerLin, jhenderson Differential Revision: https://reviews.llvm.org/D128148	2022-07-20 07:09:34 -04:00
Jez Ng	2e2737cdf9	[MC][MachO] Change addrsig format + ensure its size is properly set There were two problems with the previous setup: 1. We weren't setting its size, which caused problems when `__llvm_addrsig` wasn't the last section. In particular, `__debug_line` (if created) is generated and placed after `__llvm_addrsig`, and would result in an invalid object file w/ overlapping sections being emitted. 2. The symbol indices could be invalidated if e.g. `llvm-strip` ran on the object file. See discussion [here][1]. To fix both these issues, we use symbol relocations instead of encoding symbol indices directly in the section contents. The section itself doesn't contain any data. That sidesteps the layout problem in addition to solving the second issue. The corresponding LLD change to read in this new format: {D128938}. It will fix the icf-safe.ll test failure on this diff. [1]: https://discourse.llvm.org/t/problems-with-mach-o-address-significance-table-generation/63392/ Reviewed By: #lld-macho, alx32 Differential Revision: https://reviews.llvm.org/D127637	2022-07-19 21:22:23 -04:00
Tim Besard	c71d77876f	[MC] Avoid UAF in WinCOFFObjectWriter with weak symbols. When using weak symbols, the WinCOFFObjectWriter keeps a list (`WeakDefaults`) that's used to make names unique. This list should be reset when the object writer is reset, because otherwise reuse of the object writer can result in freed symbols being accessed. With some added output, this becomes clear when using `llc` in `--run-twice` mode: ``` $ ./llc --compile-twice -mtriple=x86_64-pc-win32 trivial.ll -filetype=obj DefineSymbol::WeakDefaults - .weak.foo.default - .weak.bar.default DefineSymbol::WeakDefaults - .weak.foo.default - áÑJÄ³⌂ p§┼Ø┐☺ - .debug_macinfo.dw - .weak.bar.default ``` This does not seem to leak into the output object file though, so I couldn't come up with a test. I added one that just does `--run-twice` (and verified that it does access freed memory), which should result in detecting the invalid memory accesses when running under ASAN. Observed in a Julia PR where we started using weak symbols: https://github.com/JuliaLang/julia/pull/45649 Reviewed By: mstorsjo Differential Revision: https://reviews.llvm.org/D129840	2022-07-16 13:24:08 +03:00
Kazu Hirata	246bf08db3	Use drop_begin (NFC)	2022-07-15 23:24:59 -07:00
David Blaikie	d63ec445ca	Remove zlibgnu support in llvm-mc The feature's been removed from most other tools in LLVM at this point.	2022-07-14 22:38:59 +00:00
Kazu Hirata	611ffcf4e4	[llvm] Use value instead of getValue (NFC)	2022-07-13 23:11:56 -07:00
Fangrui Song	e690137dde	[Support] Change compression::zlib::{compress,uncompress} to use uint8_t * It's more natural to use uint8_t * (std::byte needs C++17 and llvm has too much uint8_t ) and most callers use uint8_t instead of char *. The functions are recently moved into `llvm::compression::zlib::`, so downstream projects need to make adaption anyway.	2022-07-13 16:26:54 -07:00
spupyrev	eecd41aa09	Revert "Rebase: [Facebook] [MC] Introduce NeverAlign fragment type" This reverts commit `6d0528636a`.	2022-07-11 09:50:47 -07:00
Rafael Auler	6d0528636a	Rebase: [Facebook] [MC] Introduce NeverAlign fragment type Summary: Introduce NeverAlign fragment type. The intended usage of this fragment is to insert it before a pair of macro-op fusion eligible instructions. NeverAlign fragment ensures that the next fragment (first instruction in the pair) does not end at a given alignment boundary by emitting a minimal size nop if necessary. In effect, it ensures that a pair of macro-fusible instructions is not split by a given alignment boundary, which is a precondition for macro-op fusion in modern Intel Cores (64B = cache line size, see Intel Architecture Optimization Reference Manual, 2.3.2.1 Legacy Decode Pipeline: Macro-Fusion). This patch introduces functionality used by BOLT when emitting code with MacroFusion alignment already in place. The use case is different from BoundaryAlign and instruction bundling: - BoundaryAlign can be extended to perform the desired alignment for the first instruction in the macro-op fusion pair (D101817). However, this approach has higher overhead due to reliance on relaxation as BoundaryAlign requires in the general case - see https://reviews.llvm.org/D97982#2710638. - Instruction bundling: the intent of NeverAlign fragment is to prevent the first instruction in a pair ending at a given alignment boundary, by inserting at most one minimum size nop. It's OK if either instruction crosses the cache line. Padding both instructions using bundles to not cross the alignment boundary would result in excessive padding. There's no straightforward way to request instruction bundling to avoid a given end alignment for the first instruction in the bundle. LLVM: https://reviews.llvm.org/D97982 Manual rebase conflict history: https://phabricator.intern.facebook.com/D30142613 Test Plan: sandcastle Reviewers: #llvm-bolt Subscribers: phabricatorlinter Differential Revision: https://phabricator.intern.facebook.com/D31361547	2022-07-11 09:31:52 -07:00
Cole Kissane	ea61750c35	[NFC] Refactor llvm::zlib namespace * Refactor compression namespaces across the project, making way for a possible introduction of alternatives to zlib compression. Changes are as follows: * Relocate the `llvm::zlib` namespace to `llvm::compression::zlib`. Reviewed By: MaskRay, leonardchan, phosek Differential Revision: https://reviews.llvm.org/D128953	2022-07-08 11:19:07 -07:00
Eric Astor	33f2a00023	[ms] [llvm-ml] Add support for the remaining binary named operators Finish adding support for the remaining binary named operators in expression context: XOR, SHL, and SHR. Differential Revision: https://reviews.llvm.org/D129299	2022-07-07 21:45:23 -04:00
Joseph Huber	1d2ce4da84	[Object] Add ELF section type for offloading objects Currently we use the `.llvm.offloading` section to store device-side objects inside the host, creating a fat binary. The contents of these sections is currently determined by the name of the section while it should ideally be determined by its type. This patch adds the new `SHT_LLVM_OFFLOADING` section type to the ELF section types. Which should make it easier to identify this specific data format. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D129052	2022-07-07 12:20:30 -04:00
Eric Astor	7d1a295484	[ms] [llvm-ml] Add support for anonymous labels (`@@`, `@B`, `@F`) ml.exe and ml64.exe support the use of anonymous labels, with @B and @F referring to the previous and next anonymous label respectively. We add similar support to llvm-ml, with the exception that we are unable to emit an error message for an @F expression not followed by a @@ label. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D128944	2022-07-07 10:29:10 -04:00
esmeyi	83456f2b4f	[NFC][XCOFF] remove an unused global variable.	2022-06-30 06:55:49 -04:00
Keegan Saunders	fd6b9074ca	[MC] Skip lower-case integer suffixes `mov x0, 1024u` is permitted in binutils but rejected by the integrated assembler. Support the case. This is especially important when using the C pre-processor with the assembler: some shared code between C and assembler may use lower-cased suffices. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D128871	2022-06-29 20:55:05 -07:00
Chen Zheng	60a85726ed	[XCOFF] make sure same number of paddings are added for DWARF sections We are going to change alignment for DWARF sections. This patch is to fix functionality issue if the alignment is not the same with DefaultSectionAlign defined in XCOFFObjectWriter.cpp. Currently no test for this patch as for now alignment for DWARF sections and other sections are always the same. This patch will be tested when patch changing DWARF section is merged in. Reviewed By: Esme Differential Revision: https://reviews.llvm.org/D116092	2022-06-29 04:12:35 +00:00
Rahman Lavaee	0aa6df6575	[Propeller] Encode address offsets of basic blocks relative to the end of the previous basic blocks. This is a resurrection of D106421 with the change that it keeps backward-compatibility. This means decoding the previous version of `LLVM_BB_ADDR_MAP` will work. This is required as the profile mapping tool is not released with LLVM (AutoFDO). As suggested by @jhenderson we rename the original section type value to `SHT_LLVM_BB_ADDR_MAP_V0` and assign a new value to the `SHT_LLVM_BB_ADDR_MAP` section type. The new encoding adds a version byte to each function entry to specify the encoding version for that function. This patch also adds a feature byte to be used with more flexibility in the future. An use-case example for the feature field is encoding multi-section functions more concisely using a different format. Conceptually, the new encoding emits basic block offsets and sizes as label differences between each two consecutive basic block begin and end label. When decoding, offsets must be aggregated along with basic block sizes to calculate the final offsets of basic blocks relative to the function address. This encoding uses smaller values compared to the existing one (offsets relative to function symbol). Smaller values tend to occupy fewer bytes in ULEB128 encoding. As a result, we get about 17% total reduction in the size of the bb-address-map section (from about 11MB to 9MB for the clang PGO binary). The extra two bytes (version and feature fields) incur a small 3% size overhead to the `LLVM_BB_ADDR_MAP` section size. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D121346	2022-06-28 07:42:54 -07:00
Egor Zhdan	be4b40d5bc	[MC] Allow annotating custom sections as zerofill This is already possible for e.g. `cstring_literals`, but the entry for zerofill was unnamed. rdar://90336380 Differential Revision: https://reviews.llvm.org/D128654	2022-06-28 15:08:47 +01:00
Kazu Hirata	a81b64a1fb	[llvm] Use Optional::has_value instead of Optional::hasValue (NFC) This patch replaces x.hasValue() with x.has_value() where x is not contextually convertible to bool.	2022-06-26 16:10:42 -07:00
Kazu Hirata	a7938c74f1	[llvm] Don't use Optional::hasValue (NFC) This patch replaces Optional::hasValue with the implicit cast to bool in conditionals only.	2022-06-25 21:42:52 -07:00
Kazu Hirata	3b7c3a654c	Revert "Don't use Optional::hasValue (NFC)" This reverts commit `aa8feeefd3`.	2022-06-25 11:56:50 -07:00
Kazu Hirata	aa8feeefd3	Don't use Optional::hasValue (NFC)	2022-06-25 11:55:57 -07:00
esmeyi	d29e986ed5	[XCOFF] write the real source file name in C_FILE symbol. The symbol table starts with all the C_FILE symbols. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D126623	2022-06-22 06:23:36 -04:00
Kazu Hirata	7a47ee51a1	[llvm] Don't use Optional::getValue (NFC)	2022-06-20 22:45:45 -07:00
Kazu Hirata	0916d96d12	Don't use Optional::hasValue (NFC)	2022-06-20 20:17:57 -07:00
Kazu Hirata	e0e687a615	[llvm] Don't use Optional::hasValue (NFC)	2022-06-20 10:38:12 -07:00
Kazu Hirata	129b531c9c	[llvm] Use value_or instead of getValueOr (NFC)	2022-06-18 23:07:11 -07:00
Kazu Hirata	a5258e5b27	Revert "[MCParser] Use default member initialization (NFC)" This reverts commit `68090a014c`. The patch seems to cause a build error on ppc64le: https://lab.llvm.org/buildbot#builders/121/builds/20536	2022-06-18 21:02:09 -07:00
Kazu Hirata	68090a014c	[MCParser] Use default member initialization (NFC) Identified with modernize-use-default-member-init.	2022-06-18 20:54:56 -07:00
Chris Bieneman	0f6ab346fd	bzero->memset - Fix Windows bots How does Windows not have bzero? How did I break this while working on a Windows-specific component of LLVM? The world may never know...	2022-06-17 21:41:58 -05:00
Chris Bieneman	3adc908b26	[DirectX][MC] Add MC support for DXContainer DXContainer files resemble traditional object files in that they are comprised of parts which resemble sections. Adding DXContainer as an object file format in the MC layer will allow emitting DXContainer objects through the normal object emission pipeline. Differential Revision: https://reviews.llvm.org/D127165	2022-06-17 21:19:32 -05:00
Guillaume Chatelet	412c788ab0	[NFC][Alignment] Use Align in MCAlignFragment	2022-06-15 12:31:00 +00:00
Jez Ng	d4bcb45db7	[MC][re-land] Omit DWARF unwind info if compact unwind is present where eligible This reverts commit `d941d59783`. Differential Revision: https://reviews.llvm.org/D122258	2022-06-12 17:24:19 -04:00
Jez Ng	d941d59783	Revert "[MC] Omit DWARF unwind info if compact unwind is present where eligible" This reverts commit `ef501bf85d`.	2022-06-12 10:47:08 -04:00
Jez Ng	ef501bf85d	[MC] Omit DWARF unwind info if compact unwind is present where eligible Previously, omitting unnecessary DWARF unwinds was only done in two cases: * For Darwin + aarch64, if no DWARF unwind info is needed for all the functions in a TU, then the `__eh_frame` section would be omitted entirely. If any one function needed DWARF unwind, then MC would emit DWARF unwind entries for all the functions in the TU. * For watchOS, MC would omit DWARF unwind on a per-function basis, as long as compact unwind was available for that function. This diff makes it so that we omit DWARF unwind on a per-function basis for Darwin + aarch64 as well. In addition, we introduce the flag `--emit-dwarf-unwind=` which can toggle between `always`, `no-compact-unwind` (only emit DWARF when CU cannot be emitted for a given function), and the target platform `default`. `no-compact-unwind` is particularly useful for newer x86_64 platforms: we don't want to omit DWARF unwind for x86_64 in general due to possible backwards compat issues, but we should make it possible for people to opt into this behavior if they are only targeting newer platforms. Motivation: I'm working on adding support for `__eh_frame` to LLD, but I'm concerned that we would suffer a perf hit. Processing compact unwind is already expensive, and that's a simpler format than EH frames. Given that MC currently produces one EH frame entry for every compact unwind entry, I don't think processing them will be cheap. I tried to do something clever on LLD's end to drop the unnecessary EH frames at parse time, but this made the code significantly more complex. So I'm looking at fixing this at the MC level instead. Addendum: It turns out that there was a latent bug in the X86 backend when `OmitDwarfIfHaveCompactUnwind` is naively enabled, which is not too surprising given that this combination has not been heretofore used. For functions that have unwind info that cannot be encoded with CU, MC would end up dropping both the compact unwind entry (OK; existing behavior) as well as the DWARF entries (not OK). This diff fixes things so that we emit the DWARF entry, as well as a CU entry with encoding `UNWIND_X86_MODE_DWARF` -- this basically tells the unwinder to look for the DWARF entry. I'm not 100% sure the `UNWIND_X86_MODE_DWARF` CU entry is necessary, this was the simplest fix. ld64 seems to be able to handle both the absence and presence of this CU entry. Ultimately ld64 (and LLD) will synthesize `UNWIND_X86_MODE_DWARF` if it is absent, so there is no impact to the final binary size. Reviewed By: davide, lhames Differential Revision: https://reviews.llvm.org/D122258	2022-06-12 10:03:56 -04:00
Fangrui Song	adf4142f76	[MC] De-capitalize SwitchSection. NFC Add SwitchSection to return switchSection. The API will be removed soon.	2022-06-10 22:50:55 -07:00
Florian Mayer	0593ce5f0b	[MC] Add 'G' to augmentation string for MTE instrumented functions This was agreed on in https://lists.llvm.org/pipermail/llvm-dev/2020-May/141345.html The thread proposed two options * add a character to augmentation string and handle in libuwind * use a separate personality function. It was determined that this is the simpler and better option. This is part of ARM's Aarch64 ABI: https://github.com/ARM-software/abi-aa/blob/main/aadwarf64/aadwarf64.rst#id22 The next step after this is teaching libunwind to untag when this augmentation character is set. Reviewed By: MaskRay, eugenis Differential Revision: https://reviews.llvm.org/D127007	2022-06-08 12:36:32 -07:00
Hongtao Yu	ab34ab2b87	[PseudoProbe] Use callee name as callsite identfier for MCDecodedPseudoProbeInlineTree. The callsite identifier used in pseudo probe encoding and decoding is consisted of a function name and the callsite probe id. For encoding, i.e., `MCPseudoProbeInlineTree`, the function name is callee function name. However for decoding, i.e., `MCDecodedPseudoProbeInlineTree`, the caller function name is used actually. This results in multiple callees that are inlined at the same callsite, likely via indirect call promotion, sharing the same decoded inline frame. While it is not a problem for profile generation, it confuses probe re-encoding in Bolt. In Bolt, we decode pseudo probes first and build `MCDecodedPseudoProbeInlineTree`. The decoded tree is used for final re-encoding. Here comes the problem. Two inlinees from the same callsite share the same decoded inline frame. During re-encoding, the frame name (whatever inlinee comes first) will be used and encoded in the bolted binary. This will cause wrong inline contexts in the profile generated on the bolted binary. The fix is a no-op to pre-bolt profile generation. Some of the bolt tests are not yet upstreamed, thus I'm not adding a bolt test here. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D126434	2022-06-08 10:54:40 -07:00
Fangrui Song	15d82c62dc	[MC] De-capitalize MCStreamer functions Follow-up to `c031378ce0` . The class is mostly consistent now.	2022-06-07 00:31:02 -07:00
luxufan	a7b154aa17	[MC][ARM] Reuse symbol value in constant pool Fix https://github.com/llvm/llvm-project/issues/55816 Before this patch, MCConstantExpr were reused, but MCSymbolExpr were not. To reuse symbol value, this patch added a DenseMap to record the symbol value. Differential Revision: https://reviews.llvm.org/D127113	2022-06-07 13:39:52 +08:00
Martin Storsjö	98dc3e86fd	[ARM] [MinGW] Default to WinEH exception handling instead of Dwarf Switching this target to WinEH also seems to affect the `-windows-itanium` target. Differential Revision: https://reviews.llvm.org/D126870	2022-06-06 23:27:19 +03:00
Fangrui Song	77e300ffdf	[MC] Change EndOfStatement "unexpected tokens in .xxx directive " to "expected newline"	2022-06-05 15:11:01 -07:00
serge-sans-paille	4d73c46ccf	[MCParser] Set default alignment value when meeting invalid align Upon invalid alignment value, still set a default valid alignment value to avoid hitting later asserts. Fix #55273 Differential Revision: https://reviews.llvm.org/D125688	2022-06-02 16:22:35 +02:00
Martin Storsjö	298e9cac92	[MC] [Win64EH] Check that the SEH unwind opcodes match the actual instructions It's a fairly common issue that the generating code incorrectly marks instructions as narrow or wide; check that the instruction lengths add up to the expected value, and error out if it doesn't. This allows catching code generation bugs. Also check that prologs and epilogs are properly terminated, to catch other code generation issues. Differential Revision: https://reviews.llvm.org/D125647	2022-06-01 11:25:49 +03:00
Martin Storsjö	d4022ff331	[MC] [Win64EH] Optimize the ARM unwind info Use the packed unwind info format if possible; otherwise try to create a packed epilog. Differential Revision: https://reviews.llvm.org/D125646	2022-06-01 11:25:49 +03:00
Martin Storsjö	6b75a3523f	[ARM] [MC] Add support for writing ARM WinEH unwind info This includes .seh_* directives for generating it from assembly. It is designed fairly similarly to the ARM64 handling. For .seh_handler directives, such as ".seh_handler __C_specific_handler, @except" (which is supported on x86_64 and aarch64 so far), the "@except" bit doesn't work in ARM assembly, as '@' is used as a comment character (on all current platforms). Allow using '%' instead of '@' for this purpose. This convention is used by GAS in similar contexts already, e.g. [1]: Note on targets where the @ character is the start of a comment (eg ARM) then another character is used instead. For example the ARM port uses the % character. In practice, this unfortunately means that all such .seh_handler directives will need ifdefs for ARM. Contrary to ARM64, on ARM, it's quite common that we can't evaluate e.g. the function length at this point, due to instructions whose length is finalized later. (Also, inline jump tables end with a ".p2align 1".) If unable to to evaluate the function length immediately, emit it as an MCExpr instead. If we'd implement splitting the unwind info for a function (which isn't implemented for ARM64 yet either), we wouldn't know whether we need to split it though. Avoid calling getFrameIndexOffset() on an unset FuncInfo.UnwindHelpFrameIdx, to avoid triggering asserts in the preexisting testcase CodeGen/ARM/Windows/wineh-basic.ll. (Once MSVC exception handling is fully implemented, those changes can be reverted.) [1] https://sourceware.org/binutils/docs/as/Section.html#Section Differential Revision: https://reviews.llvm.org/D125645	2022-06-01 11:25:48 +03:00
Martin Storsjö	e71b07e468	[MC] [Win64EH] Wrap the epilog instructions in a struct. NFC. For ARM SEH, the epilogs will need a little more associated data than just the plain list of opcodes. This is a preparatory refactoring for D125645. Differential Revision: https://reviews.llvm.org/D125879	2022-06-01 11:25:48 +03:00
Fangrui Song	9ee15bba47	[MC] Lower case the first letter of EmitCOFF* EmitWin* EmitCV*. NFC	2022-05-26 00:14:08 -07:00
Maksim Panchenko	bed9efed71	[MCDisassembler] Disambiguate Size parameter in tryAddingSymbolicOperand() MCSymbolizer::tryAddingSymbolicOperand() overloaded the Size parameter to specify either the instruction size or the operand size depending on the architecture. However, for proper symbolic disassembly on X86, we need to know both sizes, as an instruction can have two operands, and the instruction size cannot be reliably calculated based on the operand offset and its size. Hence, split Size into OpSize and InstSize. For X86, the new interface allows to fix a couple of issues: * Correctly adjust the value of PC-relative operands. * Set operand size to zero when the operand is specified implicitly. Differential Revision: https://reviews.llvm.org/D126101	2022-05-25 13:44:32 -07:00
Yusra Syeda	5ac411aea8	[SystemZ][z/OS] Add the PPA1 to SystemZAsmPrinter Differential Revision: https://reviews.llvm.org/D125725	2022-05-18 14:13:17 -04:00
Martin Storsjö	924defada9	[MC] [Win64EH] Don't produce packed ARM64 unwind info with homed parameters There's an inconsistency regarding the epilogs of packed ARM64 unwind info with homed parameters; according to the documentation (and according to common sense), the epilog wouldn't have a series of nop instructions matching the stp x0-x7 in the prolog - however in practice, RtlVirtualUnwind still seems to behave as if the epilog does have the mirrored nops from the prolog. In practice, MSVC doesn't seem to produce packed unwind info with homed parameters, which might be why this inconsistency hasn't been noticed. Thus, to play it safe, avoid creating such packed unwind info with homed parameters. (LLVM's current behaviour matches the current runtime behaviour of RtlVirtualUnwind, but if it later is bug fixed to match the documentation, such unwind information would be incorrect.) See https://github.com/llvm/llvm-project/issues/54879 for further discussion on the matter. Differential Revision: https://reviews.llvm.org/D125876	2022-05-18 20:34:59 +03:00
Martin Storsjö	091a55c16a	[MC] [Win64EH] Remove an unused parameter to ARM64EmitUnwindCode. NFC. Differential Revision: https://reviews.llvm.org/D125878	2022-05-18 20:33:17 +03:00
esmeyi	8d6e2c3e3d	[XCOFF] support writing sections, relocations and symbols for XCOFF64. This is the second patch to enable the XCOFF64 object writer. Reviewed By: jhenderson, shchenz Differential Revision: https://reviews.llvm.org/D122287	2022-05-17 04:27:47 -04:00
Martin Storsjö	64a3c63e01	[MC] [Win64EH] Check for matches between epilogs and the prolog on ARM64 This allows sharing opcodes between prolog and epilog even when there is more than one epilog. I didn't make any handcrafted special MC level testcases for this (yet at least), but it does seem to have the expected effect on two existing CodeGen level testcases. Differential Revision: https://reviews.llvm.org/D125619	2022-05-17 00:41:39 +03:00
Martin Storsjö	cabefea2ec	[MC] [Win64EH] Try writing an ARM64 "packed epilog" even if the epilog doesn't share opcodes with the prolog The "packed epilog" form only implies that the epilog is located exactly at the end of the function (so the location of the epilog is implicit from the epilog opcodes), but it doesn't have to share opcodes with the prolog - as long as the total number of opcode bytes and the offset to the epilog fit within the bitfields. This avoids writing a 4 byte epilog scope in many cases. (I haven't measured how much this shrinks actual xdata sections in practice though.) Differential Revision: https://reviews.llvm.org/D125536	2022-05-17 00:41:39 +03:00
Alan Zhao	8a1b5f2680	[llvm-ml] Add support for extern proc EXTERN PROC isn't really well documented in MSVC, so after poking around it seems as if it's just a regular extern symbol. Interestingly enough, under MSVC the following is allowed: extern foo:proc mov eax, foo MSVC will output: mov eax, 0 while llvm-ml will currently output: mov eax, dword ptr [foo] (since foo is an extern) Arguably, llvm-ml's output makes more sense, even though it's inconsistent with MSVC ml. However, since moving an extern proc symbol to a register doesn't really make sense in the first place, we'll treat it as undefined behavior for now. Reviewed By: epastor Differential Revision: https://reviews.llvm.org/D125582	2022-05-13 17:20:55 -04:00

1 2 3 4 5 ...

4891 Commits