llvm-project

Commit Graph

Author	SHA1	Message	Date
Jacob Lambert	7470244475	[AMDGPU] Add agpr_count to metadata and AsmParser gfx90a allows the number of ACC registers (AGPRs) to be set independently to the VGPR registers. For both HSA and PAL metadata, we now include an "agpr_count" key to report the number of AGPRs set for supported devices (gfx90a, gfx908, as determined by hasMAIInsts()). This is collected from SIProgramInfo.NumAccVGPR for both HSA and PAL. The AsmParser also now recognizes ".kernel.agpr_count" for supported devices. Differential Revision: https://reviews.llvm.org/D116140	2022-02-16 15:17:23 -08:00
Kristof Beyls	520a925272	Fix 2 RestructuredText warnings.	2022-02-16 14:16:52 +01:00
Simon Moll	03e83cc8eb	[VP] vp.fptosi cast intrinsic and docs Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D119535	2022-02-15 18:17:19 +01:00
zhijian	0135aa7b98	[llvm-nm] add a new option -X to specify the type of object file llvm-nm should examine Summary: Added a new option "-X" to specify, which type of object file should be examine. For example: 1. "llvm-nm -X64 archive.a" only deal with the 64bit object files in the archive.a ,ignore the all 32bit object files in the archive.a 2. "llvm-nm -X32 xcoffobj32.o xcoffobj64.o " only deal with the 32bit object file "xcoffobj32.o" , 64bit object file "xcoffobj64.o" will be ignored Reviewers: James Henderson,Fangrui Song Differential Revision: https://reviews.llvm.org/D118193	2022-02-15 09:43:31 -05:00
David Spickett	8d4d0f7d1a	[lldb] Remove memory region non-address change from release notes This is now on 14.x as af19ae529271f9ae96927662d7d876489115fb26 so it is not new to 15.	2022-02-15 11:36:55 +00:00
Markus Böck	db8ae2fef1	[llvm][doc] Update comments and documentation of custom stackmap formats in GC Since https://reviews.llvm.org/D53892 it is possible to emit a custom stackmap by overwriting the emitStackMaps method of GCMetadataPrinter. That way even AOT compilers can generate a more efficient and more suitable format for their needs. This patch updates documentation and stale comments in source code. In particular it removes the issue from the issue list in the Statepoints documentation and adjusts comments in GCStrategy. Differential Revision: https://reviews.llvm.org/D119660	2022-02-15 12:17:19 +01:00
Ahmed Bougacha	c703f852c9	[IR] Define "ptrauth" operand bundle. This introduces a new "ptrauth" operand bundle to be used in call/invoke. At the IR level, it's semantically equivalent to an @llvm.ptrauth.auth followed by an indirect call, but it additionally provides additional hardening, by preventing the intermediate raw pointer from being exposed. This mostly adds the IR definition, verifier checks, and support in a couple of general helper functions. Clang IRGen and backend support will come separately. Note that we'll eventually want to support this bundle in indirectbr as well, for similar reasons. indirectbr currently doesn't support bundles at all, and the IR data structures need to be updated to allow that. Differential Revision: https://reviews.llvm.org/D113685	2022-02-14 11:27:35 -08:00
Momchil Velikov	6398903ac8	Extend the `uwtable` attribute with unwind table kind We have the `clang -cc1` command-line option `-funwind-tables=1\|2` and the codegen option `VALUE_CODEGENOPT(UnwindTables, 2, 0) ///< Unwind tables (1) or asynchronous unwind tables (2)`. However, this is encoded in LLVM IR by the presence or the absence of the `uwtable` attribute, i.e. we lose the information whether to generate want just some unwind tables or asynchronous unwind tables. Asynchronous unwind tables take more space in the runtime image, I'd estimate something like 80-90% more, as the difference is adding roughly the same number of CFI directives as for prologues, only a bit simpler (e.g. `.cfi_offset reg, off` vs. `.cfi_restore reg`). Or even more, if you consider tail duplication of epilogue blocks. Asynchronous unwind tables could also restrict code generation to having only a finite number of frame pointer adjustments (an example of not having a finite number of `SP` adjustments is on AArch64 when untagging the stack (MTE) in some cases the compiler can modify `SP` in a loop). Having the CFI precise up to an instruction generally also means one cannot bundle together CFI instructions once the prologue is done, they need to be interspersed with ordinary instructions, which means extra `DW_CFA_advance_loc` commands, further increasing the unwind tables size. That is to say, async unwind tables impose a non-negligible overhead, yet for the most common use cases (like C++ exceptions), they are not even needed. This patch extends the `uwtable` attribute with an optional value: - `uwtable` (default to `async`) - `uwtable(sync)`, synchronous unwind tables - `uwtable(async)`, asynchronous (instruction precise) unwind tables Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D114543	2022-02-14 14:35:02 +00:00
Nikita Popov	5a43a278f7	[Docs] Update OpaquePointers transition state (NFC) We're at a point where working optimized binaries can be produced in opaque pointer mode.	2022-02-14 12:55:58 +01:00
Markus Böck	e101eb5c7b	[llvm][doc] Add Aarch64 to list of architectures supporting statepoints Fixes https://github.com/llvm/llvm-project/issues/53655 Differential Revision: https://reviews.llvm.org/D119659	2022-02-13 20:35:15 +01:00
YASHASVI KHATAVKAR	f9f78a2c40	Fix build broken by missing empty line in SourceLevelDebugging.rst	2022-02-11 15:19:07 -05:00
YASHASVI KHATAVKAR	70fdbf35de	Adding DiBuilder interface for assumed length strings	2022-02-11 14:40:02 -05:00
Julien Pages	dcb2da13f1	[AMDGPU] Add a new intrinsic to control fp_trunc rounding mode Add a new llvm.fptrunc.round intrinsic to precisely control the rounding mode when converting from f32 to f16. Differential Revision: https://reviews.llvm.org/D110579	2022-02-11 12:08:23 -05:00
Louis Dionne	6a7f6e9404	[docs] Fix missing space in the GettingStarted documentation	2022-02-11 09:17:37 -05:00
Arthur Eubanks	2fa87ab524	[docs] Replace `opt -analyze` with better alternatives. `opt -analyze` is legacy PM-specific. Show better ways of doing the same thing, generally with some sort of `-passes=print<foo>`. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D119486	2022-02-10 15:38:31 -08:00
YASHASVI KHATAVKAR	93d1a623ce	Reverting an entire stack of changes causing build failures	2022-02-10 17:58:22 -05:00
YASHASVI KHATAVKAR	ac15cd7af6	Modified SourceLevelDebugging.rst to include information about memory location exp	2022-02-10 15:24:51 -05:00
Louis Dionne	4ae83bb2b1	Update all LLVM documentation mentioning runtimes in LLVM_ENABLE_PROJECTS We are moving away from building the runtimes with LLVM_ENABLE_PROJECTS, however the documentation was largely outdated. This commit updates all the documentation I could find to use LLVM_ENABLE_RUNTIMES instead of LLVM_ENABLE_PROJECTS for building runtimes. Note that in the near future, libcxx, libcxxabi and libunwind will stop supporting being built with LLVM_ENABLE_PROJECTS altogether. I don't know what the plans are for other runtimes like libc, openmp and compiler-rt, so I didn't make any changes to the documentation that would imply something for those projects. Once this lands, I will also cherry-pick this on the release/14.x branch to make sure that LLVM's documentation is up-to-date and reflects what we intend to support in the future. Differential Revision: https://reviews.llvm.org/D119351	2022-02-10 15:05:23 -05:00
David Spickett	2937b28218	Reland "[lldb] Remove non address bits when looking up memory regions" This reverts commit `0df522969a`. Additional checks are added to fix the detection of the last memory region in GetMemoryRegions or repeating the "memory region" command when the target has non-address bits. Normally you keep reading from address 0, looking up each region's end address until you get LLDB_INVALID_ADDR as the region end address. (0xffffffffffffffff) This is what the remote will return once you go beyond the last mapped region: [0x0000fffffffdf000-0x0001000000000000) rw- [stack] [0x0001000000000000-0xffffffffffffffff) --- Problem is that when we "fix" the lookup address, we remove some bits from it. On an AArch64 system we have 48 bit virtual addresses, so when we fix the end address of the [stack] region the result is 0. So we loop back to the start. [0x0000fffffffdf000-0x0001000000000000) rw- [stack] [0x0000000000000000-0x0000000000400000) --- To fix this I added an additional check for the last range. If the end address of the region is different once you apply FixDataAddress, we are at the last region. Since the end of the last region will be the last valid mappable address, plus 1. That 1 will be removed by the ABI plugin. The only side effect is that on systems with non-address bits, you won't get that last catch all unmapped region from the max virtual address up to 0xf...f. [0x0000fffff8000000-0x0000fffffffdf000) --- [0x0000fffffffdf000-0x0001000000000000) rw- [stack] <ends here> Though in some way this is more correct because that region is not just unmapped, it's not mappable at all. No extra testing is needed because this is already covered by TestMemoryRegion.py, I simply forgot to run it on system that had both top byte ignore and pointer authentication. This change has been tested on a qemu VM with top byte ignore, memory tagging and pointer authentication enabled. Reviewed By: omjavaid Differential Revision: https://reviews.llvm.org/D115508	2022-02-10 10:42:49 +00:00
Lu Weining	42fd2bfc90	[LoongArch 1/6] Add triples loongarch{32,64} for the upcoming LoongArch target This is the first patch to incrementally add an MC layer for LoongArch to LLVM. This patch also adds unit testcases for these new triples. RFC for adding this new backend: https://lists.llvm.org/pipermail/llvm-dev/2021-December/154371.html Differential revision: https://reviews.llvm.org/D115857	2022-02-10 10:23:34 +00:00
Daniel Thornburgh	694f384553	[Debuginfod] Flag-determine debuginfod lookups in llvm-symbolizer. This change adds a pair of flags controlling whether llvm-symbolizer attempts debuginfod lookups. Lookups are attempted if --debuginfod is passed and disabled if --no-debuginfod is passed. The default behavior is made more nuanced: debuginfod lookups are now only attempted if an HTTP client is compiled in and at least one backing debuginfod URL was configured via environment variable. Previously, debuginfod lookups would always be attempted, even if there were no chance that they could succeed. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D118665	2022-02-09 22:20:54 +00:00
Craig Topper	60745fb16f	[VP] llvm.vp.fneg intrinsic and LangRef Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D119262	2022-02-09 07:54:36 -08:00
Daniel Thornburgh	dcd4950d42	[Symbolizer] Add Build ID flag to llvm-symbolizer. This adds a --build-id=<hex build ID> flag to llvm-symbolizer. If --obj is unspecified, this will attempt to look up the provided build ID using whatever mechanisms are available to the Symbolizer (typically, debuginfod). The semantics are then as if the found binary were given using the --obj flag. Reviewed By: jhenderson, phosek Differential Revision: https://reviews.llvm.org/D118633	2022-02-08 23:08:18 +00:00
Lancelot Six	046017291f	[AMDGPU][NFC] AMDGPUUsage.rst: fix wording.	2022-02-07 20:06:17 -05:00
Craig Topper	cef177d186	[VP] llvm.vp.fma intrinsic and LangRef Differential Revision: https://reviews.llvm.org/D119185	2022-02-07 15:53:27 -08:00
Keith Smiley	4c12a75e69	[llvm-libtool-darwin] Add -warnings_as_errors libtool can currently produce 2 warnings: 1. No symbols were in the object file 2. An object file with the same basename was specified multiple times The first warning here is often harmless and may just mean you have some translation units with no symbols for the target you're building for. The second warning can lead to real issues like those mentioned in https://reviews.llvm.org/D113130 where ODR violations can slip in. This introduces a new -warnings_as_errors flag that can be used by build systems that want to verify they never hit these warnings. For example with bazel the libtool caller first uniques names to make sure the duplicate base name case is not possible, but if that doesn't work as expected, having it fail would be preferred. It's also worth noting that llvm-libtool-darwin works around an issue that cctools libtool experiences related to debug info and duplicate basenames, the workaround is described here: `30baa5d2a4/llvm/lib/Object/ArchiveWriter.cpp (L424-L465)` And it avoids this bug: `f0cbbb1c37/DuplicateBasenameIssue` Differential Revision: https://reviews.llvm.org/D118931	2022-02-07 14:39:21 -08:00
Mark Murray	3d7662142d	[ARM] Undeprecate complex IT blocks AArch32/Armv8A introduced the performance deprecation of certain patterns of IT instructions. After some debate internal to ARM, this is now being reverted; i.e. no IT instruction patterns are performance deprecated anymore, as the perfomance degredation is not significant enough. This reverts the following: "ARMv8-A deprecates some uses of the T32 IT instruction. All uses of IT that apply to instructions other than a single subsequent 16-bit instruction from a restricted set are deprecated, as are explicit references to the PC within that single 16-bit instruction. This permits the non-deprecated forms of IT and subsequent instructions to be treated as a single 32-bit conditional instruction." The deprecation no longer applies, but the behaviour may be controlled by the -arm-restrict-it and -arm-no-restrict-it command-line options, with the latter being the default. No warnings about complex IT blocks will be generated. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D118044	2022-02-07 15:47:53 +00:00
Volodymyr Turanskyy	c127ba25fb	Add LLVM Embedded Toolchains call to the table of sync ups. LLVM Embedded Toolchains working group regular sync up calls to start in early March, adding details to the table of sync ups for general reference. Differential Revision: https://reviews.llvm.org/D118884	2022-02-07 16:38:42 +01:00
Dmitry Preobrazhensky	95a52b376a	[AMDGPU][GFX9][DOC][NFC] Corrected description of registers available via getreg/setreg This is to reflect changes introduced by https://reviews.llvm.org/D118860.	2022-02-04 17:55:32 +03:00
Nikita Popov	e990e591c9	[LangRef] Require elementtype attribute for gc.statepoint intrinsic The gc.statepoint intrinsic currently determines the target function type based on the pointer element type of the argument. In order to support opaque pointers, require that the argument is annotated with an elementtype attribute. Here's an example of the change: ; Before: %safepoint_token = tail call token (i64, i32, i1 (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_i1f(i64 0, i32 0, i1 () @return_i1, i32 0, i32 0, i32 0, i32 0) ; After: %safepoint_token = tail call token (i64, i32, i1 (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_i1f(i64 0, i32 0, i1 () elementtype(i1 ()) @return_i1, i32 0, i32 0, i32 0, i32 0) ; After with opaque pointers: %safepoint_token = tail call token (i64, i32, i1 ()*, i32, i32, ...) @llvm.experimental.gc.statepoint.p0(i64 0, i32 0, ptr elementtype(i1 ()) @return_i1, i32 0, i32 0, i32 0, i32 0) Differential Revision: https://reviews.llvm.org/D117890	2022-02-04 09:47:31 +01:00
Changpeng Fang	022c8d4a3f	AMDGPU [NFC]: Fix a few typos in docs AMDGPUUsage.rst Summery: Fix a few typos in docs AMDGPUUsage.rst Differential Revision: https://reviews.llvm.org/D118272	2022-02-02 14:22:52 -08:00
Lancelot SIX	73ed118eda	[Docs][NFC] Contributing.rst: fix wording Fix a sentence containing two consecutive 'and'.	2022-02-02 13:49:03 +01:00
Tom Stellard	a2601c9887	Bump the trunk major version to 15	2022-02-01 23:54:52 -08:00
Tom Stellard	e80c52986e	[docs] Remove hard-coded version numbers from sphinx configs This updates all the non-runtime project release notes to use the version number from CMake instead of the hard-coded version numbers in conf.py. It also hides warnings about pre-releases when the git suffix is dropped from the LLVM version in CMake. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D112181	2022-02-01 23:14:12 -08:00
Tanya Lattner	1b12e92c80	Update status on migration again. Add note about issues with reply by email from emails pre-migration.	2022-02-01 22:25:31 -08:00
Tanya Lattner	bbc5b62e85	Add new status of the move to Discourse.	2022-02-01 18:30:46 -08:00
Tanya Lattner	e36afc6511	Update discourse migration status.	2022-02-01 18:09:31 -08:00
Tanya Lattner	769d634789	Update status of move.	2022-02-01 10:45:40 -08:00
Fangrui Song	30e8f83c84	[GlobalOpt] Don't replace alias with aliasee if either alias/aliasee may be preemptible Generalize D99629 for ELF. A default visibility non-local symbol is preemptible in a -shared link. `isInterposable` is an insufficient condition. Moreover, a non-preemptible alias may be referenced in a sub constant expression which intends to lower to a PC-relative relocation. Replacing the alias with a preemptible aliasee may introduce a linker error. Respect dso_preemptable and suppress optimization to fix the abose issues. With the change, `alias = 345` will not be rewritten to use aliasee in a `-fpic` compile. ``` int aliasee; extern int alias __attribute__((alias("aliasee"), visibility("hidden"))); void foo() { alias = 345; } // intended to access the local copy ``` While here, refine the condition for the alias as well. For some binary formats like COFF, `isInterposable` is a sufficient condition. But I think canonicalization for the changed case has little advantage, so I don't bother to add the `Triple(M.getTargetTriple()).isOSBinFormatELF()` or `getPICLevel/getPIELevel` complexity. For instrumentations, it's recommended not to create aliases that refer to globals that have a weak linkage or is preemptible. However, the following is supported and the IR needs to handle such cases. ``` int aliasee __attribute__((weak)); extern int alias __attribute__((alias("aliasee"))); ``` There are other places where GlobalAlias isInterposable usage may need to be fixed. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D107249	2022-02-01 10:41:16 -08:00
Fangrui Song	dd6e7e0d57	[llvm-ar] Add --thin for creating a thin archive In GNU ar (since 2008), the modifier 'T' means creating a thin archive. In many other ar implementations (FreeBSD, macOS, elfutils, etc), -T means "allow filename truncation of extracted files", as specified by X/Open System Interface. For portability, 'T' with thin archive semantics should be avoided. See https://sourceware.org/bugzilla/show_bug.cgi?id=28759 binutils 2.38 will deprecate 'T' (without diagnostic) and add --thin. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D116979	2022-02-01 09:56:50 -08:00
Tanya Lattner	acef496b5e	Add status of migration.	2022-01-31 19:03:29 -08:00
Changpeng Fang	1194b9cdda	AMDGPU {NFC}: Add code object v5 support and generate metadata for implicit kernel args Summary: Add code object v5 support (deafult is still v4) Generate metadata for implicit kernel args for the new ABI Set the metadata version to be 1.2 Reviewers: t-tye, b-sumner, arsenm, and bcahoon Fixes: SWDEV-307188, SWDEV-307189 Differential Revision: https://reviews.llvm.org/D118272	2022-01-31 18:07:47 -08:00
Daniel McIntosh	0ee7a2c304	[docs] Update Prolog/Epilog Code Insertion docs to show it's still incomplete Compact Unwind is a subsection, but that was lost in rGff9feeb520a32d076c3095468208ae116c428285 Reviewed By: void Differential Revision: https://reviews.llvm.org/D118499	2022-01-31 15:25:46 -05:00
Jeff Bailey	f86844da49	Remove reference to LLVMLibC as the doc has moved. https://reviews.llvm.org/D117436 caused a build failure due to this error. Tested: ninja docs-llvm-libc builds Reviewed By: abrachet Differential Revision: https://reviews.llvm.org/D118537	2022-01-29 23:39:03 +00:00
Simon Pilgrim	058c5dfc78	Raise the minimum Visual Studio version to VS2019 As raised here: https://lists.llvm.org/pipermail/llvm-dev/2021-November/153881.html Now that VS2022 is on general release, LLVM is expected to build on VS2017, VS2019 and VS2022, which is proving hazardous to maintain due to changes in behaviour including preprocessor and constexpr changes. Plus of the few developers that work with VS, many have already moved to VS2019/22. This patch proposes to raise the minimum supported version to VS2019 (16.x) - I've made the hard limit 16.0 or later, with the soft limit VS2019 16.7 - older versions of VS2019 are "allowed" (at your own risk) via the LLVM_FORCE_USE_OLD_TOOLCHAIN cmake flag. Differential Revision: https://reviews.llvm.org/D114639	2022-01-29 10:56:41 +00:00
Jeff Bailey	4465c29906	Move LLVM Proposal to doc directory, create index The LLVM Libc project is no longer just a proposal and should have a webpage tracking the status of the project. This changes puts the pieces into the right place so that the webpage can be created. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D117436	2022-01-29 00:29:31 +00:00
Ahmed Bougacha	634ca7349d	[ObjCARC] Require the function argument in the clang.arc.attachedcall bundle. Currently, the clang.arc.attachedcall bundle takes an optional function argument. Depending on whether the argument is present, calls with this bundle have the following semantics: - on x86, with the argument present, the call is lowered to: call _target mov rax, rdi call _objc_retainAutoreleasedReturnValue - on AArch64, without the argument, the call is lowered to: bl _target mov x29, x29 and the objc runtime call is expected to be emitted separately. That's because, on x86, the objc runtime checks for both the mov and the call on x86, and treats the combination as the ARC autorelease elision marker. But on AArch64, it only checks for the dedicated NOP marker, as that's historically been sufficiently unique. Thanks to that, the runtime call wasn't required to be adjacent to the NOP marker, so it wasn't emitted as part of the bundle sequence. This patch unifies both architectures: on AArch64, we now emit all 3 instructions for the bundle. This guarantees that the runtime call is adjacent to the marker in the sequence, and that's information the runtime can use to further optimize this. This helps simplify some of the handling, in particular BundledRetainClaimRVs, which no longer needs to know whether the bundle is sufficient or not: it now always should be. Note that this does not include an AutoUpgrade for the nullary bundles, as they are only produced in ObjCContract as part of the obj/asm emission pipeline, and are not expected to be in bitcode. Differential Revision: https://reviews.llvm.org/D118214	2022-01-28 12:41:45 -08:00
Ellis Hoag	11d3074267	[InstrProf] Add single byte coverage mode Use the llvm flag `-pgo-function-entry-coverage` to create single byte "counters" to track functions coverage. This mode has significantly less size overhead in both code and data because * We mark a function as "covered" with a store instead of an increment which generally requires fewer assembly instructions * We use a single byte per function rather than 8 bytes per block The trade off of course is that this mode only tells you if a function has been covered. This is useful, for example, to detect dead code. When combined with debug info correlation [0] we are able to create an instrumented Clang binary that is only 150M (the vanilla Clang binary is 143M). That is an overhead of 7M (4.9%) compared to the default instrumentation (without value profiling) which has an overhead of 31M (21.7%). [0] https://groups.google.com/g/llvm-dev/c/r03Z6JoN7d4 Reviewed By: kyulee Differential Revision: https://reviews.llvm.org/D116180	2022-01-27 17:38:55 -08:00
Tanya Lattner	586759cee5	Add email addresses to create a topic via email in a specific category.	2022-01-26 23:22:04 -08:00
Aaron Ballman	f3e22946e5	Update the Bug Life Cycle docs for the switch to GitHub issues This updates the Bug Life Cycle docs now that we've switched to GitHub issues. The intent is to retain the same general process we used to use for triaging bugs under Bugzilla, but with the facilities we have available in GitHub.	2022-01-26 15:55:36 -05:00

1 2 3 4 5 ...

9155 Commits