llvm-project

Commit Graph

Author	SHA1	Message	Date
Bardia Mahjour	a7e2c26939	[LV] Epilogue Vectorization with Optimal Control Flow (Recommit) This is yet another attempt at providing support for epilogue vectorization following discussions raised in RFC http://llvm.1065342.n5.nabble.com/llvm-dev-Proposal-RFC-Epilog-loop-vectorization-tt106322.html#none and reviews D30247 and D88819. Similar to D88819, this patch achieve epilogue vectorization by executing a single vplan twice: once on the main loop and a second time on the epilogue loop (using a different VF). However it's able to handle more loops, and generates more optimal control flow for cases where the trip count is too small to execute any code in vector form. Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D89566	2020-12-02 10:09:56 -05:00
David Sherwood	71bd59f0cb	[SVE] Add support for scalable vectors with vectorize.scalable.enable loop attribute In this patch I have added support for a new loop hint called vectorize.scalable.enable that says whether we should enable scalable vectorization or not. If a user wants to instruct the compiler to vectorize a loop with scalable vectors they can now do this as follows: br i1 %exitcond, label %for.end, label %for.body, !llvm.loop !2 ... !2 = !{!2, !3, !4} !3 = !{!"llvm.loop.vectorize.width", i32 8} !4 = !{!"llvm.loop.vectorize.scalable.enable", i1 true} Setting the hint to false simply reverts the behaviour back to the default, using fixed width vectors. Differential Revision: https://reviews.llvm.org/D88962	2020-12-02 13:23:43 +00:00
Tony	ac1b2ae9dc	[NFC][AMDGPU] Fix broken link to ClangOffloadBundler in AMDGPUUsage	2020-12-02 03:04:28 +00:00
Tony	04424c69bc	[NFC][AMDGPU] AMDGPU code object V4 ABI documentation - Documantation for AMDGPU code object V4. - Documentation clarification for code object V2 and V3. - Documentation for the clang-offload-bundler. - Numerous other documentation clarifications. Change-Id: I338b327cc9e75da6c987b7e081b496402a5a020e Differential Revision: https://reviews.llvm.org/D92434	2020-12-01 23:31:04 +00:00
Bardia Mahjour	c94af03f7f	Revert "[LV] Epilogue Vectorization with Optimal Control Flow" This reverts commit `9c5504adce`. Reverting to investigate build failure in http://lab.llvm.org:8011/#/builders/98/builds/1461/steps/9	2020-12-01 12:50:36 -05:00
Bardia Mahjour	9c5504adce	[LV] Epilogue Vectorization with Optimal Control Flow This is yet another attempt at providing support for epilogue vectorization following discussions raised in RFC http://llvm.1065342.n5.nabble.com/llvm-dev-Proposal-RFC-Epilog-loop-vectorization-tt106322.html#none and reviews D30247 and D88819. Similar to D88819, this patch achieve epilogue vectorization by executing a single vplan twice: once on the main loop and a second time on the epilogue loop (using a different VF). However it's able to handle more loops, and generates more optimal control flow for cases where the trip count is too small to execute any code in vector form. Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D89566	2020-12-01 12:04:29 -05:00
Amy Huang	efd1ec0dec	Recommit "[llvm-symbolizer] Switch to using native symbolizer by default on Windows" This reverts commit `1b63177a56`.	2020-11-30 17:36:12 -08:00
Juneyoung Lee	8e504615e9	[LangRef] missing link, minor fix	2020-11-30 23:09:36 +09:00
David Spickett	c2ead57ccf	[llvm-objdump] Document --mattr=help in --help output This does the same as `--mcpu=help` but was only documented in the user guide. * Added a test for both options. * Corrected the single dash in `-mcpu=help` text. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D92305	2020-11-30 12:52:54 +00:00
Juneyoung Lee	1856e22eeb	[LangRef] minor fixes to poison examples and well-defined values section (NFC)	2020-11-29 20:51:25 +09:00
Juneyoung Lee	2e32c49d97	[LangRef] Add poison constant This patch adds a description about the newly added poison constant to LangRef. Differential Revision: https://reviews.llvm.org/D92162	2020-11-27 10:29:52 +09:00
Marek Kurdej	d8ffb1f6a7	[llvm-profgen] [docs] Fix invalid header. Add to ToC. NFC.	2020-11-26 10:45:05 +01:00
Amy Huang	1b63177a56	Revert "[llvm-symbolizer] Switch to using native symbolizer by default on Windows" Breaks some asan tests on the buildbot. This reverts commit `c74b427cb2`.	2020-11-23 16:29:45 -08:00
Amy Huang	c74b427cb2	[llvm-symbolizer] Switch to using native symbolizer by default on Windows llvm-symbolizer used to use the DIA SDK for symbolization on Windows; this patch switches to using native symbolization, which was implemented recently. Users can still make the symbolizer use DIA by adding the `-dia` flag in the LLVM_SYMBOLIZER_OPTS environment variable. Differential Revision: https://reviews.llvm.org/D91814	2020-11-23 15:57:08 -08:00
Paul C. Anagnostopoulos	b23e84ffcf	[TableGen] Eliminte source location from CodeInit Step 1 in eliminating the 'code' type. Differential Revision: https://reviews.llvm.org/D91932	2020-11-23 11:30:13 -05:00
Tony	8605d3134c	[NFC][AMDGPU] Document kernel descriptor - Document that the kernel descriptor defined is for code object V3. Document that it also applies to earlier code object formats for CP. - Document the deprecated bits in kernel descriptor. Differential Revision: https://reviews.llvm.org/D91458	2020-11-21 04:54:17 +00:00
wlei	21c91454a8	[llvm-profgen][NFC]Fix build failure on different platform see titile Test Plan: ninja & ninja check-llvm Reviewed By: hoy Differential Revision: https://reviews.llvm.org/D91897	2020-11-20 16:36:04 -08:00
wlei	32221694cb	[CSSPGO][llvm-profgen] Disassemble text sections This stack of changes introduces `llvm-profgen` utility which generates a profile data file from given perf script data files for sample-based PGO. It’s part of(not only) the CSSPGO work. Specifically to support context-sensitive with/without pseudo probe profile, it implements a series of functionalities including perf trace parsing, instruction symbolization, LBR stack/call frame stack unwinding, pseudo probe decoding, etc. Also high throughput is achieved by multiple levels of sample aggregation and compatible format with one stop is generated at the end. Please refer to: https://groups.google.com/g/llvm-dev/c/1p1rdYbL93s for the CSSPGO RFC. This change enables disassembling the text sections to build various address maps that are potentially used by the virtual unwinder. A switch `--show-disassembly` is being added to print the disassembly code. Like the llvm-objdump tool, this change leverages existing LLVM components to parse and disassemble ELF binary files. So far X86 is supported. Test Plan: ninja check-llvm Reviewed By: wmi, wenlei Differential Revision: https://reviews.llvm.org/D89712	2020-11-20 14:26:26 -08:00
wlei	a94fa86229	[CSSPGO][llvm-profgen] Parse mmap events from perf script This stack of changes introduces `llvm-profgen` utility which generates a profile data file from given perf script data files for sample-based PGO. It’s part of(not only) the CSSPGO work. Specifically to support context-sensitive with/without pseudo probe profile, it implements a series of functionalities including perf trace parsing, instruction symbolization, LBR stack/call frame stack unwinding, pseudo probe decoding, etc. Also high throughput is achieved by multiple levels of sample aggregation and compatible format with one stop is generated at the end. Please refer to: https://groups.google.com/g/llvm-dev/c/1p1rdYbL93s for the CSSPGO RFC. As a starter, this change sets up an entry point by introducing PerfReader to load profiled binaries and perf traces(including perf events and perf samples). For the event, here it parses the mmap2 events from perf script to build the loader snaps, which is used to retrieve the image load address in the subsequent perf tracing parsing. As described in llvm-profgen.rst, the tool being built aims to support multiple input perf data (preprocessed by perf script) as well as multiple input binary images. It should also support dynamic reload/unload shared objects by leveraging the loader snaps being built by this change Reviewed By: wenlei, wmi Differential Revision: https://reviews.llvm.org/D89707	2020-11-20 14:26:26 -08:00
Alex Richardson	3bc4157556	Add a default address space for globals to DataLayout This is similar to the existing alloca and program address spaces (D37052) and should be used when creating/accessing global variables. We need this in our CHERI fork of LLVM to place all globals in address space 200. This ensures that values are accessed using CHERI load/store instructions instead of the normal MIPS/RISC-V ones. The problem this is trying to fix is that most of the time the type of globals is created using a simple PointerType::getUnqual() (or ::get() with the default address-space value of 0). This does not work for us and we get assertion/compilation/instruction selection failures whenever a new call is added that uses the default value of zero. In our fork we have removed the default parameter value of zero for most address space arguments and use DL.getProgramAddressSpace() or DL.getGlobalsAddressSpace() whenever possible. If this change is accepted, I will upstream follow-up patches to use DL.getGlobalsAddressSpace() instead of relying on the default value of 0 for PointerType::get(), etc. This patch and the follow-up changes will not have any functional changes for existing backends with the default globals address space of zero. A follow-up commit will change the default globals address space for AMDGPU to 1. Reviewed By: dylanmckay Differential Revision: https://reviews.llvm.org/D70947	2020-11-20 15:46:52 +00:00
Pavel Iliin	4d7df43ffd	[AArch64] Out-of-line atomics (-moutline-atomics) implementation. This patch implements out of line atomics for LSE deployment mechanism. Details how it works can be found in llvm/docs/Atomics.rst Options -moutline-atomics and -mno-outline-atomics to enable and disable it were added to clang driver. This is clang and llvm part of out-of-line atomics interface, library part is already supported by libgcc. Compiler-rt support is provided in separate patch. Differential Revision: https://reviews.llvm.org/D91157	2020-11-20 13:30:12 +00:00
Leonard Chan	a97f62837f	[llvm][IR] Add dso_local_equivalent Constant The `dso_local_equivalent` constant is a wrapper for functions that represents a value which is functionally equivalent to the global passed to this. That is, if this accepts a function, calling this constant should have the same effects as calling the function directly. This could be a direct reference to the function, the `@plt` modifier on X86/AArch64, a thunk, or anything that's equivalent to the resolved function as a call target. When lowered, the returned address must have a constant offset at link time from some other symbol defined within the same binary. The address of this value is also insignificant. The name is leveraged from `dso_local` where use of a function or variable is resolved to a symbol in the same linkage unit. In this patch: - Addition of `dso_local_equivalent` and handling it - Update Constant::needsRelocation() to strip constant inbound GEPs and take advantage of `dso_local_equivalent` for relative references This is useful for the [Relative VTables C++ ABI](https://reviews.llvm.org/D72959) which makes vtables readonly. This works by replacing the dynamic relocations for function pointers in them with static relocations that represent the offset between the vtable and virtual functions. If a function is externally defined, `dso_local_equivalent` can be used as a generic wrapper for the function to still allow for this static offset calculation to be done. See [RFC](http://lists.llvm.org/pipermail/llvm-dev/2020-August/144469.html) for more details. Differential Revision: https://reviews.llvm.org/D77248	2020-11-19 10:26:17 -08:00
Nick Desaulniers	f4c6080ab8	Revert "[IR] add fn attr for no_stack_protector; prevent inlining on mismatch" This reverts commit `b7926ce6d7`. Going with a simpler approach.	2020-11-17 17:27:14 -08:00
Florian Hahn	52f3714dae	[VPlan] Add VPDef class. This patch introduces a new VPDef class, which can be used to manage VPValues defined by recipes/VPInstructions. The idea here is to mirror VPUser for values defined by a recipe. A VPDef can produce either zero (e.g. a store recipe), one (most recipes) or multiple (VPInterleaveRecipe) result VPValues. To traverse the def-use chain from a VPDef to its users, one has to traverse the users of all values defined by a VPDef. VPValues now contain a pointer to their corresponding VPDef, if one exists. To traverse the def-use chain upwards from a VPValue, we first need to check if the VPValue is defined by a VPDef. If it does not have a VPDef, this means we have a VPValue that is not directly defined iniside the plan and we are done. If we have a VPDef, it is defined inside the region by a recipe, which is a VPUser, and the upwards def-use chain traversal continues by traversing all its operands. Note that we need to add an additional field to to VPVAlue to link them to their defs. The space increase is going to be offset by being able to remove the SubclassID field in future patches. Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D90558	2020-11-17 16:18:11 +00:00
Michael Liao	f375885ab8	[InferAddrSpace] Teach to handle assumed address space. - In certain cases, a generic pointer could be assumed as a pointer to the global memory space or other spaces. With a dedicated target hook to query that address space from a given value, infer-address-space pass could infer and propagate that to all its users. Differential Revision: https://reviews.llvm.org/D91121	2020-11-16 17:06:33 -05:00
Paul C. Anagnostopoulos	d4b3277d8e	[TableGen] Improve a couple of descriptions in the command guide Differential Revision: https://reviews.llvm.org/D91484	2020-11-15 09:59:59 -05:00
Paul C. Anagnostopoulos	54f9ee3341	[TableGen] Add frontend/backend phase timing capability. Describe in the BackEnd Developer's Guide. Instrument a few backends. Remove an old unused timing facility. Add a null backend for timing the parser. Differential Revision: https://reviews.llvm.org/D91388	2020-11-14 10:10:29 -05:00
Nikita Popov	c87c375096	[LangRef] Clarify GEP inbounds wrapping semantics Clarify the semantics of GEP inbounds, in particular with regard to what it means for wrapping. This cleans up some confusion on when it is legal to apply nuw/nsw flags to various parts of the GEP calculation. Differential Revision: https://reviews.llvm.org/D90708	2020-11-13 17:49:41 +01:00
Paul C. Anagnostopoulos	641428f928	[TableGen] Enhance the six comparison bang operators. Update the Programmer's Reference. Differential Revision: https://reviews.llvm.org/D91036	2020-11-13 09:57:27 -05:00
serge-sans-paille	95537f4508	llvmbuildectomy - compatibility with ocaml bindings Use exact component name in add_ocaml_library. Make expand_topologically compatible with new architecture. Fix quoting in is_llvm_target_library. Fix LLVMipo component name. Write release note.	2020-11-13 14:35:52 +01:00
Florian Hahn	8bb6347939	Add !annotation metadata and remarks pass. This patch adds a new !annotation metadata kind which can be used to attach annotation strings to instructions. It also adds a new pass that emits summary remarks per function with the counts for each annotation kind. The intended uses cases for this new metadata is annotating 'interesting' instructions and the remarks should provide additional insight into transformations applied to a program. To motivate this, consider these specific questions we would like to get answered: * How many stores added for automatic variable initialization remain after optimizations? Where are they? * How many runtime checks inserted by a frontend could be eliminated? Where are the ones that did not get eliminated? Discussed on llvm-dev as part of 'RFC: Combining Annotation Metadata and Remarks' (http://lists.llvm.org/pipermail/llvm-dev/2020-November/146393.html) Reviewed By: thegameg, jdoerfert Differential Revision: https://reviews.llvm.org/D91188	2020-11-13 13:24:10 +00:00
Florian Hahn	35e461ae2b	[docs] Fix undefined reference in ORCv2 design doc. This fixes a typo introduced in `984e87923f` which caused the docs build to fail.	2020-11-13 09:44:48 +00:00
serge-sans-paille	9218ff50f9	llvmbuildectomy - replace llvm-build by plain cmake No longer rely on an external tool to build the llvm component layout. Instead, leverage the existing `add_llvm_componentlibrary` cmake function and introduce `add_llvm_component_group` to accurately describe component behavior. These function store extra properties in the created targets. These properties are processed once all components are defined to resolve library dependencies and produce the header expected by llvm-config. Differential Revision: https://reviews.llvm.org/D90848	2020-11-13 10:35:24 +01:00
Lang Hames	c7e64df445	[docs] Fix formatting, clarify comment in ORCv2 doc	2020-11-12 13:11:01 +11:00
Lang Hames	48ee1ea05c	[docs] Fix formatting in ORCv2.rst. Bold and fixed-width do not appear to mix well.	2020-11-12 11:08:58 +11:00
Lang Hames	984e87923f	[docs] Update ORCv2 design doc. Fixes some formatting and wording, and adds a roadmap section.	2020-11-12 10:33:29 +11:00
Renato Golin	3073cbd2d4	[docs] link new support policy from developer policy Adding new paragraphs under "Introducing New Components" section to check the different levels of support we have, to help introduction of smaller set of changes without overwhelming new collaborators and potentially losing the contribution. Differential Revision: D91013	2020-11-10 19:40:57 +00:00
David Green	7f34b9ddf8	[Sphinx] Fix langref formatting. NFC	2020-11-10 16:47:43 +00:00
David Green	b2ac9681a7	[ARM] Alter t2DoLoopStart to define lr This changes the definition of t2DoLoopStart from t2DoLoopStart rGPR to GPRlr = t2DoLoopStart rGPR This will hopefully mean that low overhead loops are more tied together, and we can more reliably generate loops without reverting or being at the whims of the register allocator. This is a fairly simple change in itself, but leads to a number of other required alterations. - The hardware loop pass, if UsePhi is set, now generates loops of the form: %start = llvm.start.loop.iterations(%N) loop: %p = phi [%start], [%dec] %dec = llvm.loop.decrement.reg(%p, 1) %c = icmp ne %dec, 0 br %c, loop, exit - For this a new llvm.start.loop.iterations intrinsic was added, identical to llvm.set.loop.iterations but produces a value as seen above, gluing the loop together more through def-use chains. - This new instrinsic conceptually produces the same output as input, which is taught to SCEV so that the checks in MVETailPredication are not affected. - Some minor changes are needed to the ARMLowOverheadLoop pass, but it has been left mostly as before. We should now more reliably be able to tell that the t2DoLoopStart is correct without having to prove it, but t2WhileLoopStart and tail-predicated loops will remain the same. - And all the tests have been updated. There are a lot of them! This patch on it's own might cause more trouble that it helps, with more tail-predicated loops being reverted, but some additional patches can hopefully improve upon that to get to something that is better overall. Differential Revision: https://reviews.llvm.org/D89881	2020-11-10 15:57:58 +00:00
Paul C. Anagnostopoulos	91d2e5c81a	[TableGen] Add the !filter bang operator. Add a test. Update the Programmer's Reference. Use it in some TableGen files. Differential Revision: https://reviews.llvm.org/D91008	2020-11-09 10:56:55 -05:00
Sebastian Neubauer	a022b1ccd8	[AMDGPU] Add amdgpu_gfx calling convention Add a calling convention called amdgpu_gfx for real function calls within graphics shaders. For the moment, this uses the same calling convention as other calls in amdgpu, with registers excluded for return address, stack pointer and stack buffer descriptor. Differential Revision: https://reviews.llvm.org/D88540	2020-11-09 16:51:44 +01:00
Renato Golin	25ba6b2bcd	[docs] Adding a Support Policy As discussed in the mailing list [1-4], we need a separation of support tiers when requiring support from the whole community versus a sub-community. Essentially, if a sub-community is active enough and takes maintenance into their own internal costs without affecting other parts of the community's maintenance costs, then code that is not immediately relevant to all parts (ie. not released, actively tested, etc) can still find its way into the LLVM main repository without major pain points. The main benefit is to reduce the maintenance cost that those sub-communities have outside of LLVM (for example, in duplicating common code, applying the same patches on top of multiple user repositories or downstream projects). This document outlines the components and responsibilities of the sub-communities with regards to maintenance costs and how they affect the rest of the community. It also adds an addendum on removal policies, which expand the existing "new target removal" policy into something more generic, to encompass any piece of code, scripts or documents in the repository. [1] http://lists.llvm.org/pipermail/llvm-dev/2020-October/146249.html [2] http://lists.llvm.org/pipermail/llvm-dev/2020-November/146335.html [3] http://lists.llvm.org/pipermail/llvm-dev/2020-October/146138.html [4] http://lists.llvm.org/pipermail/llvm-dev/2020-November/146298.html	2020-11-07 21:06:05 +00:00
Arnold Schwaighofer	c6543cc6b8	llvm.coro.id.async lowering: Parameterize how-to restore the current's continutation context and restart the pipeline after splitting The `llvm.coro.suspend.async` intrinsic takes a function pointer as its argument that describes how-to restore the current continuation's context from the context argument of the continuation function. Before we assumed that the current context can be restored by loading from the context arguments first pointer field (`first_arg->caller_context`). This allows for defining suspension points that reuse the current context for example. Also: llvm.coro.id.async lowering: Add llvm.coro.preprare.async intrinsic Blocks inlining until after the async coroutine was split. Also, change the async function pointer's context size position struct async_function_pointer { uint32_t relative_function_pointer_to_async_impl; uint32_t context_size; } And make the position of the `async context` argument configurable. The position is specified by the `llvm.coro.id.async` intrinsic. rdar://70097093 Differential Revision: https://reviews.llvm.org/D90783	2020-11-06 06:22:46 -08:00
Paul C. Anagnostopoulos	6ea6444f11	[TableGen] Clarify text and fix errors in the Programmer's Reference Differential Revision: https://reviews.llvm.org/D90881	2020-11-06 08:56:29 -05:00
Paul C. Anagnostopoulos	6f288b11db	[TableGen] Clean up documentation toctrees; clarify two paragraphs. Differential Revision: https://reviews.llvm.org/D90804	2020-11-05 16:19:18 -05:00
Paul C. Anagnostopoulos	ae2cb4f427	[TableGen] Add true and false literals to represent booleans Update the Programmer's Reference document. Add a test. Update a couple of tests with an improved error message. Differential Revision: https://reviews.llvm.org/D90635	2020-11-05 09:07:21 -05:00
Atmn Patel	cea0599aa7	[LangRef] Adds llvm.loop.mustprogress loop metadata This patch adds the llvm.loop.mustprogress loop metadata. This is to be added to loops where the frontend language requires that the loop makes observable interactions with the environment. This is the loop-level equivalent to the function attribute `mustprogress` defined in D86233. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D88464	2020-11-04 22:32:50 -05:00
Arnold Schwaighofer	ea5989b43a	Start of an llvm.coro.async implementation This patch adds the `async` lowering of coroutines. This will be used by the Swift frontend to lower async functions. In contrast to the `retcon` lowering the frontend needs to be in control over control-flow at suspend points as execution might be suspended at these points. This is very much work in progress and the implementation will change as it evolves with the frontend. As such the documentation is lacking detail as some of it might change. rdar://70097093 Reapply with fix for memory sanitizer failure and sphinx failure. Differential Revision: https://reviews.llvm.org/D90612	2020-11-04 10:29:21 -08:00
Arnold Schwaighofer	42f1916640	Revert "Start of an llvm.coro.async implementation" This reverts commit `ea606cced0`. This patch causes memory sanitizer failures sanitizer-x86_64-linux-fast.	2020-11-04 08:26:20 -08:00
Arnold Schwaighofer	ea606cced0	Start of an llvm.coro.async implementation This patch adds the `async` lowering of coroutines. This will be used by the Swift frontend to lower async functions. In contrast to the `retcon` lowering the frontend needs to be in control over control-flow at suspend points as execution might be suspended at these points. This is very much work in progress and the implementation will change as it evolves with the frontend. As such the documentation is lacking detail as some of it might change. rdar://70097093 Differential Revision: https://reviews.llvm.org/D90612	2020-11-04 07:32:29 -08:00
Paul C. Anagnostopoulos	d56cd4291e	[TableGen] Add !interleave operator to concatenate a list of values with delimiters Add a test. Use it in some TableGen files. Differential Revision: https://reviews.llvm.org/D90469	2020-11-04 09:23:54 -05:00
Fangrui Song	d2c45f6620	[docs] Fix docs-llvm-html after recent TableGen changes D90617	2020-11-03 13:43:24 -08:00
Tony	45bcbe46d7	[NFC][AMDGPU] Minor editorial improvements to AMDGPUUsage.rst Differential Revision: https://reviews.llvm.org/D90661	2020-11-03 16:56:01 +00:00
Tim Renouf	89d41f3a2b	[AMDGPU] Add gfx1033 target Differential Revision: https://reviews.llvm.org/D90447 Change-Id: If2650fc7f31bbdd49c76e74a9ca8e3734d769761	2020-11-03 16:27:48 +00:00
Tim Renouf	ee3e642627	[AMDGPU] Add gfx90c target This differentiates the Ryzen 4000/4300/4500/4700 series APUs that were previously included in gfx909. Differential Revision: https://reviews.llvm.org/D90419 Change-Id: Ia901a7157eb2f73ccd9f25dbacec38427312377d	2020-11-03 16:27:43 +00:00
Mircea Trofin	34b0a99cce	[Docs][FileCheck] Small fix.	2020-11-03 07:08:51 -08:00
Tony	68160789c1	[NFC][AMDGPU] Restructure the AMDGPU memory model description Separate the AMDGPU memory model description into separate sections for each architecture. Differential Revision: https://reviews.llvm.org/D90548	2020-11-02 21:32:20 +00:00
Atmn Patel	eed8df6a13	[Coroutines][Docs] Remove frame packing as a TODO This has already been done by @rjmccall in D76526 (`49e5a97ec3`), and `9514c048d8`. We should remove this from the docs. Differential Revision: https://reviews.llvm.org/D90550	2020-11-02 15:57:04 -05:00
Mircea Trofin	22113341d7	[FileCheck] Added documentation for --allow-unused-prefixes Differential Revision: https://reviews.llvm.org/D90621	2020-11-02 12:15:45 -08:00
Paul C. Anagnostopoulos	473f8ae699	[TableGen] Fix a couple of minor issues regarding the paste operator. Update the documentation to fully describe it. Differential Revision: https://reviews.llvm.org/D90617	2020-11-02 12:21:54 -05:00
Caroline Concatto	71038788ce	Revert "[AArch64][AsmParser] Remove 'x31' alias for 'sp/xzr' register." This reverts commit `8b281bfaf3`.	2020-11-02 08:15:50 +00:00
Caroline Concatto	8b281bfaf3	[AArch64][AsmParser] Remove 'x31' alias for 'sp/xzr' register. Only the aliases 'xzr' and 'sp' exist for the physical register x31. The reason for wanting to remove the alias 'x31' is because it allows users to write invalid asm that is not accepted by the GNU assembler. Is there any objection to removing this alias? Or do we want to keep this for compatibility with existing code that uses w31/x31? Differential Revision: https://reviews.llvm.org/D90153	2020-11-02 07:57:05 +00:00
Liu, Chen3	756f597841	[X86] Support Intel avxvnni This patch mainly made the following changes: 1. Support AVX-VNNI instructions; 2. Introduce ExplicitVEXPrefix flag so that vpdpbusd/vpdpbusds/vpdpbusds/vpdpbusds instructions only use vex-encoding when user explicity add {vex} prefix. Differential Revision: https://reviews.llvm.org/D89105	2020-10-31 12:39:51 +08:00
Tony	fccf4f6add	[NFC][AMDGPU] Minor cleanup to AMDGPU memory model table Differential Revision: https://reviews.llvm.org/D90509	2020-10-30 22:50:22 +00:00
Scott Linder	580f99bcff	[NFC][AMDGPU] Resize Memory Model columns in AMDGPUUsage.rst Make all of the "AMDGPU Machine Code GFX*" columns in the Memory Model table a consistent width of 32-characters. Best viewed with something like --word-diff Differential Revision: https://reviews.llvm.org/D89977	2020-10-29 23:07:03 +00:00
Scott Linder	fb37943cc8	[AMDGPU] Update Memory Model in AMDGPUUsage.rst Mostly NFC, but some changes are "bug fixes" rather than just e.g. formatting changes or typo corrections. - Fix typo "competing" -> "completing". - Document why waintcnt is added to stores and not loads for sequentially consistent ordering. - Lowercase some mentions of `buffer_gl{0,1}_inv`. - Make mentions of `*cnt(0)` consistently include the `(0)` count. - Remove some mentions of instructions for incorrect address spaces. For example, remove mention of `flat_load` from `load atomic acquire workgroup global`. - Re-flow some text to get all the target columns to fit in a 32-character wide column. Makes a future NFC patch to make these columns both 32-character wide more straightforward. Modified cherry-pick of patch by Tony Tye Reviewed By: t-tye Differential Revision: https://reviews.llvm.org/D89596	2020-10-29 23:07:03 +00:00
Stefanos Baziotis	a3345300b6	[LCSSA] Doc for special treatment of PHIs Differential Revision: https://reviews.llvm.org/D89739	2020-10-29 22:50:07 +02:00
Nikita Popov	fa48ff3fc9	[CodeGen] Fix neutral value of vecreduce fadd in tests (NFC) The neutral value is -0.0, not 0.0. This doesn't matter for "fast" reductions due to nsz, but does matter for reassoc-only and seq reductions. Change tests to mostly use -0.0 where the neutral value was intended, and add some additional test coverage in some places. Also update LangRef to use the right value.	2020-10-29 21:26:14 +01:00
Tony	661797bd76	[AMDGPU] Update AMD GPU documentation - AMDGPUUsage.rst: Correct AMD GPU DWARF address space table address sizes which are in bits and not bytes. - clang/.../Options.td: Improve description of AMD GPU options. - Re-generate ClangComamndLineReference.rst from clang/.../Options.td . Differential Revision: https://reviews.llvm.org/D90364	2020-10-29 20:12:47 +00:00
Mehdi Amini	7d3e9578ca	Make the post-commit review expectations more explicit with respect to revert See http://lists.llvm.org/pipermail/llvm-dev/2016-March/096529.html for context. Reviewed By: silvas, rengolin, echristo, dexonsmith, gribozavr2 Differential Revision: https://reviews.llvm.org/D89995	2020-10-28 23:29:29 +00:00
Paul C. Anagnostopoulos	9d72065cf6	[TableGen] [AMDGPU] Add !sub operator for subtraction Use it in the AMDGPU target to eliminate !add(value1, !mul(value2, -1)) Differential Revision: https://reviews.llvm.org/D90107	2020-10-28 12:27:53 -04:00
Paul C. Anagnostopoulos	0ed1e1df40	[TableGen] Command description file requires a hyphen in document title.	2020-10-28 09:31:31 -04:00
Paul C. Anagnostopoulos	22a8f5a2c3	[TableGen] Update xxx-tblgen command document. Add a few cross-references among TableGen documents. Differential Revision: https://reviews.llvm.org/D90186 Add cross-references between TableGen documents.	2020-10-28 09:08:13 -04:00
Clement Courbet	a098f32a1f	[llvm-exegesis][doc] Remove old FIXME. This was fixed in a previous commit, the previous line in the documentation explains how to proceed.	2020-10-28 10:53:23 +01:00
Clement Courbet	992da89450	[llvm-exegesis] Update doc. We don't need an external script to scan all opcodes anymore, just use `-opcode-index=-1`.	2020-10-28 08:42:38 +01:00
Johannes Doerfert	14077836ec	[LangRef] Clarify `dereferenceable` -> `nonnull` implication If `null_pointer_is_valid` is present, `dereferenceable` does not imply `nonnull`, make it clear. Came up in D17993. Reviewed By: aqjune Differential Revision: https://reviews.llvm.org/D89417	2020-10-27 19:12:53 -05:00
Georgii Rymar	f855a55333	[llvm-readelf] - Implement --section-details option. --section-details/-t is a GNU readelf option that produce an output that is an alternative to --sections. Differential revision: https://reviews.llvm.org/D89304	2020-10-27 13:29:39 +03:00
Vedant Kumar	905f874c44	[cmake] Add LLVM_UBSAN_FLAGS, to allow overriding UBSan flags Allow overriding the default set of flags used to enable UBSan when building llvm. This can be used to test new checks or opt out of certain checks. Differential Revision: https://reviews.llvm.org/D89439	2020-10-26 15:48:19 -07:00
Benjamin Kramer	39a0d6889d	[X86] Add a stub for Intel's alderlake. No scheduling, no autodetection.	2020-10-24 19:01:22 +02:00
Tony	bf6518a806	[AMDGPU] Cleanup AMDGPUUsage.rst - Layout and typo improvements. - Add memory spaces section. - reStructure syntax fixes. Differential Revision: https://reviews.llvm.org/D90002	2020-10-24 06:21:27 +00:00
Artur Pilipenko	6ec2c5e402	GC-parseable element atomic memcpy/memmove This change introduces a GC parseable lowering for element atomic memcpy/memmove intrinsics. This way runtime can provide an implementation which can take a safepoint during copy operation. See "GC-parseable element atomic memcpy/memmove" thread on llvm-dev for the background and details: https://groups.google.com/g/llvm-dev/c/NnENHzmX-b8/m/3PyN8Y2pCAAJ Differential Revision: https://reviews.llvm.org/D88861	2020-10-23 14:06:09 -07:00
Nick Desaulniers	b7926ce6d7	[IR] add fn attr for no_stack_protector; prevent inlining on mismatch It's currently ambiguous in IR whether the source language explicitly did not want a stack a stack protector (in C, via function attribute no_stack_protector) or doesn't care for any given function. It's common for code that manipulates the stack via inline assembly or that has to set up its own stack canary (such as the Linux kernel) would like to avoid stack protectors in certain functions. In this case, we've been bitten by numerous bugs where a callee with a stack protector is inlined into an __attribute__((__no_stack_protector__)) caller, which generally breaks the caller's assumptions about not having a stack protector. LTO exacerbates the issue. While developers can avoid this by putting all no_stack_protector functions in one translation unit together and compiling those with -fno-stack-protector, it's generally not very ergonomic or as ergonomic as a function attribute, and still doesn't work for LTO. See also: https://lore.kernel.org/linux-pm/20200915172658.1432732-1-rkir@google.com/ https://lore.kernel.org/lkml/20200918201436.2932360-30-samitolvanen@google.com/T/#u Typically, when inlining a callee into a caller, the caller will be upgraded in its level of stack protection (see adjustCallerSSPLevel()). By adding an explicit attribute in the IR when the function attribute is used in the source language, we can now identify such cases and prevent inlining. Block inlining when the callee and caller differ in the case that one contains `nossp` when the other has `ssp`, `sspstrong`, or `sspreq`. Fixes pr/47479. Reviewed By: void Differential Revision: https://reviews.llvm.org/D87956	2020-10-23 11:55:39 -07:00
Paul C. Anagnostopoulos	876af264c1	[TableGen] Change !getop and !setop to !getdagop and !setdagop. Differential Revision: https://reviews.llvm.org/D89814	2020-10-23 10:36:05 -04:00
Nick Desaulniers	4abaf0ec0a	BitCodeFormat: update doc on new byref and mustprogress attrs; NFC Forked from review of: https://reviews.llvm.org/D87956	2020-10-22 16:29:56 -07:00
Tom Stellard	6f798e460c	HowToReleaseLLVM: Clean up document and remove references to SVN Reviewed By: hans Differential Revision: https://reviews.llvm.org/D80395	2020-10-22 11:34:03 -07:00
Paul C. Anagnostopoulos	b9eecbfada	[TableGen] Update documents to make them more complete Differential Revision: https://reviews.llvm.org/D89962	2020-10-22 13:19:19 -04:00
Arthur Eubanks	87520657b8	Revert "[Docs] Clarify that FunctionPasses can't add/remove declarations" This reverts commit `710676cf3a`.	2020-10-22 09:49:42 -07:00
Arthur Eubanks	710676cf3a	[Docs] Clarify that FunctionPasses can't add/remove declarations In preparation for potential future concurrency, a FunctionPass shouldn't modify anything at the module level that other FunctionPasses can also modify. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D89890	2020-10-22 09:03:42 -07:00
Paul C. Anagnostopoulos	b2faf75568	[TableGen] Continue improving the comments for the data structures. Differential Revision: https://reviews.llvm.org/D89901	2020-10-22 10:00:49 -04:00
Tianqing Wang	be39a6fe6f	[X86] Add User Interrupts(UINTR) instructions For more details about these instructions, please refer to the latest ISE document: https://software.intel.com/en-us/download/intel-architecture-instruction-set-extensions-programming-reference. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D89301	2020-10-22 17:33:07 +08:00
Wang, Pengfei	e32036b973	[X86] Add clang release notes for HRESET and minor change for llvm release notes. (NFC)	2020-10-21 15:59:42 +08:00
Konrad Kleine	f0f76aea37	[doc] Apply buildbot worker terminology change: slave->worker Recently [1], there was an upgrade to the version of buildbot being deployed. The new setup will still work with old buildslaves but I thought it might be a good idea to update the documentation to reflect, that you now can use a newer buildbot version to when setting up your worker (formely known as slave). The upgrade from buildbot 0.8.5 to 2.8.5 went a long with a transition to a new "worker" terminology [2] which is also reflected by this change. [1]: http://lists.llvm.org/pipermail/llvm-dev/2020-October/145629.html [2]: http://docs.buildbot.net/0.9.12/manual/worker-transition.html Reviewed By: gkistanova Differential Revision: https://reviews.llvm.org/D89230	2020-10-20 06:43:09 -04:00
Artur Pilipenko	037ef7d70c	Adding new Azul representative to security group Adding myself as a new Azul representative to security group. Differential Revision: https://reviews.llvm.org/D89287	2020-10-19 22:41:19 -07:00
Atmn Patel	1e55cf77f3	[LangRef] Define mustprogress attribute LLVM IR currently assumes some form of forward progress. This form is not explicitly defined anywhere, and is the cause of miscompilations in most languages that are not C++11 or later. This implicit forward progress guarantee can not be opted out of on a function level nor on a loop level. Languages such as C (C11 and later), C++ (pre-C++11), and Rust have different forward progress requirements and this needs to be evident in the IR. Specifically, C11 and onwards (6.8.5, Paragraph 6) states that "An iteration statement whose controlling expression is not a constant expression, that performs no input/output operations, does not access volatile objects, and performs no synchronization or atomic operations in its body, controlling expression, or (in the case of for statement) its expression-3, may be assumed by the implementation to terminate." C++11 and onwards does not have this assumption, and instead assumes that every thread must make progress as defined in [intro.progress] when it comes to scheduling. This was initially brought up in [0] as a bug, a solution was presented in [1] which is the current workaround, and the predecessor to this change was [2]. After defining a notion of forward progress for IR, there are two options to address this: 1) Set the default to assuming Forward Progress and provide an opt-out for functions and an opt-in for loops. 2) Set the default to not assuming Forward Progress and provide an opt-in for functions, and an opt-in for loops. Option 2) has been selected because only C++11 and onwards have a forward progress requirement and it makes sense for them to opt-into it via the defined `mustprogress` function attribute. The `mustprogress` function attribute indicates that the function is required to make forward progress as defined. This is sharply in contrast to the status quo where this is implicitly assumed. In addition, `willreturn` implies `mustprogress`. The background for why this definition was chosen is in [3] and for why the option was chosen is in [4] and the corresponding thread(s). The implementation is in D85393, the clang patch is in D86841, the LoopDeletion patch is in D86844, the Inliner patches are in D87180 and D87262, and there will be more incoming. [0] https://bugs.llvm.org/show_bug.cgi?id=965#c25 [1] https://lists.llvm.org/pipermail/llvm-dev/2017-October/118558.html [2] https://reviews.llvm.org/D65718 [3] https://lists.llvm.org/pipermail/llvm-dev/2020-September/144919.html [4] https://lists.llvm.org/pipermail/llvm-dev/2020-September/145023.html Reviewed By: jdoerfert, efriedma, nikic Differential Revision: https://reviews.llvm.org/D86233	2020-10-19 13:34:27 -04:00
Paul C. Anagnostopoulos	dc5d6632b0	[TableGen] Enhance !empty and !size to handle strings and DAGs. Fix bug in the type checking for !empty, !head, !size, !tail.	2020-10-19 09:22:20 -04:00
Sam Parker	03f3ef221b	[LangRef] Correct return type llvm.test.set.loop.iterations.* The langref description for llvm.test.set.loop.iterations.* were missing the i1 return type. Differential Revision: https://reviews.llvm.org/D89564 Patch by: Janek van Oirschot	2020-10-19 12:56:38 +01:00
Lang Hames	ad92f16ccc	[ORC][examples] Update Kaleidoscope and BuildingAJIT tutorial series to OrcV2. This patch updates the Kaleidoscope and BuildingAJIT tutorial series (chapter 1-4) to OrcV2. Chapter 5 of the BuildingAJIT series is removed -- it will be re-instated once we have in-tree support for out-of-process JITing. This patch only updates the tutorial code, not the text. Patches welcome for that, otherwise I will try to update it in a few weeks.	2020-10-18 21:03:04 -07:00
Paul C. Anagnostopoulos	a90f742dd8	[TableGen] Change Programmer's Reference to use "DAG argument" rather than "operand". Differential Revision: https://reviews.llvm.org/D89624	2020-10-18 10:50:14 -04:00
Juneyoung Lee	62a0ec1612	Add support for !noundef metatdata on loads This patch adds metadata !noundef and makes load instructions can optionally have it. A load with !noundef always return a well-defined value (has no undef bit or isn't poison). If the loaded value isn't well defined, the behavior is undefined. This metadata can be used to encode the assumption from C/C++ that certain reads of variables should have well-defined values. It is helpful for optimizing freeze instructions away, because freeze can be removed when its operand has well-defined value, and showing that a load from arbitrary location is well-defined is usually hard otherwise. The same information can be encoded with llvm.assume with operand bundle; using metadata is chosen because I wasn't sure whether code motion can be freely done when llvm.assume is inserted from clang instead. The existing codebase already is stripping unknown metadata when doing code motion, so using metadata is UB-safe as well. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D89050	2020-10-17 13:50:10 +09:00
Juneyoung Lee	701cf4b5a5	[LangRef] Rename the names of metadata in load/store's syntax (NFC) Discussed in D89050	2020-10-17 13:30:02 +09:00
Alok Kumar Sharma	0538353b3b	[DebugInfo] Support for DWARF operator DW_OP_over LLVM rejects DWARF operator DW_OP_over. This DWARF operator is needed for Flang to support assumed rank array. Summary: Currently LLVM rejects DWARF operator DW_OP_over. Below error is produced when llvm finds this operator. [..] invalid expression !DIExpression(151, 20, 16, 48, 30, 35, 80, 34, 6) warning: ignoring invalid debug info in over.ll [..] There were some parts missing in support of this operator, which are now completed. Testing -added a unit testcase -check-debuginfo -check-llvm Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D89208	2020-10-17 08:42:28 +05:30
Stanislav Mekhanoshin	173389e16d	[AMDGPU] Fix gfx1032 description in AMDGPUUsage.rst. NFC. Differential Revision: https://reviews.llvm.org/D89565	2020-10-16 13:29:20 -07:00
Vinicius Tinti	e95f9a23fa	[llvm-objdump] Implement --prefix option The prefix given to --prefix will be added to GNU absolute paths when used with --source option (source interleaved with the disassembly). This matches GNU's objdump behavior. GNU and C++17 rules for absolute paths are different. Differential Revision: https://reviews.llvm.org/D85024 Fixes PR46368. Differential Revision: https://reviews.llvm.org/D85024	2020-10-16 17:50:42 +01:00
Matt Arsenault	0a7cd99a70	Reapply "OpaquePtr: Add type to sret attribute" This reverts commit `eb9f7c28e5`. Previously this was incorrectly handling linking of the contained type, so this merges the fixes from D88973.	2020-10-16 11:05:02 -04:00
Stanislav Mekhanoshin	d1beb95d12	[AMDGPU] gfx1032 target Differential Revision: https://reviews.llvm.org/D89487	2020-10-15 12:41:18 -07:00
Paul C. Anagnostopoulos	4767bb2c0c	[TableGen] Add the !not and !xor operators. Update the TableGen Programmer's Reference.	2020-10-15 10:12:59 -04:00
Konstantin Zhuravlyov	3fdf3b1539	AMDGPU: Update AMDHSA code object version handling Differential Revision: https://reviews.llvm.org/D89076	2020-10-14 13:04:27 -04:00
Scott Linder	3f2386de63	[DebugInfo][docs] Document DILabel in LangRef Add some minimal documentation for DILabel, originally introduced in D45024. Update the name and semantics of the `variables:` field in the documentation for `DISubprogram`; the field is now called `retainedNodes:` and is a heterogeneous list of `DILocalVariable` and `DILabel`. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D89082	2020-10-13 18:26:41 +00:00
Paul C. Anagnostopoulos	04b2191d69	[TableGen] Add new section to the TableGen Programmer's Reference. Fix typos in it and the TableGen Backend Developer's Guide.	2020-10-13 09:59:13 -04:00
Pietro Albini	05ef552e56	Add expected response time and escalation path to the security docs Following up on the discussion within the group during the roundtable at the 2020 LLVM Developers Meeting, this commit adds to the security docs: * How long we expect acknowledging security reports will take * The escalation path the reporter can follow if they get no response A temporary line inviting reporters to directly follow the escalation path while the mailing list is being setup is also added. Differential Revision: https://reviews.llvm.org/D89068	2020-10-13 10:57:06 +02:00
Tobias Hieta	61133e0b11	[llvm-install-name-tool] Add -delete_all_rpaths option This diff adds an option to remove all rpaths from a Mach-O binary. Test plan: make check-all Differential revision: https://reviews.llvm.org/D88674	2020-10-13 00:45:57 -07:00
Wang, Pengfei	412cdcf2ed	[X86] Add HRESET instruction. For more details about these instructions, please refer to the latest ISE document: https://software.intel.com/en-us/download/intel-architecture-instruction-set-extensions-programming-reference. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D89102	2020-10-13 08:47:26 +08:00
Paul C. Anagnostopoulos	350fafabe9	[TableGen] Add overload of RecordKeeper::getAllDerivedDefinitions() and use in PseudoLowering backend. Now the two getAllDerivedDefinitions() use StringRef and Arrayref. Use all_of() in getAllDerivedDefinitions().	2020-10-12 16:40:09 -04:00
Tony	fe145b66ec	[AMDGPU] Correct processor names for gfx1010 and gfx1011 Change-Id: Ie409f86876b0437d0b0405aff42872963708d926 Differential Revision: https://reviews.llvm.org/D89259	2020-10-12 20:16:12 +00:00
Fangrui Song	012dd42e02	[X86] Support -march=x86-64-v[234] PR47686. These micro-architecture levels are defined in the x86-64 psABI: https://gitlab.com/x86-psABIs/x86-64-ABI/-/commit/77566eb03bc6a326811cb7e9 GCC 11 will support these levels. Note, -mtune=x86-64-v[234] are invalid and __builtin_cpu_is cannot be used on them. Reviewed By: craig.topper, RKSimon Differential Revision: https://reviews.llvm.org/D89197	2020-10-12 10:29:46 -07:00
Philip Reames	d89de5a14e	Step down from security group Resigning from security group as Azul representative as I have left Azul. Previously communicated via email with security group. Differential Revision: https://reviews.llvm.org/D88933	2020-10-10 09:48:02 -07:00
Tim Renouf	666ef0db20	[AMDGPU] Add gfx602, gfx705, gfx805 targets At AMD, in an internal audit of our code, we found some corner cases where we were not quite differentiating targets enough for some old hardware. This commit is part of fixing that by adding three new targets: * The "Oland" and "Hainan" variants of gfx601 are now split out into gfx602. LLPC (in the GPUOpen driver) and other front-ends could use that to avoid using the shaderZExport workaround on gfx602. * One variant of gfx703 is now split out into gfx705. LLPC and other front-ends could use that to avoid using the shaderSpiCsRegAllocFragmentation workaround on gfx705. * The "TongaPro" variant of gfx802 is now split out into gfx805. TongaPro has a faster 64-bit shift than its former friends in gfx802, and a subtarget feature could be set up for that to take advantage of it. This commit does not make that change; it just adds the target. V2: Add clang changes. Put TargetParser list in order. V3: AMDGCNGPUs table in TargetParser.cpp needs to be in GPUKind order, so fix the GPUKind order. Differential Revision: https://reviews.llvm.org/D88916 Change-Id: Ia901a7157eb2f73ccd9f25dbacec38427312377d	2020-10-10 17:22:22 +01:00
Alok Kumar Sharma	96bd4d34a2	[DebugInfo] Support for DWARF attribute DW_AT_rank This patch adds support for DWARF attribute DW_AT_rank. Summary: Fortran assumed rank arrays have dynamic rank. DWARF attribute DW_AT_rank is needed to support that. Testing: unit test cases added (hand-written) check llvm check debug-info Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D89141	2020-10-10 17:51:12 +05:30
Zi Xuan Wu	e1c38dd55d	[CSKY 1/n] Add basic stub or infra of csky backend This patch introduce files that just enough for lib/Target/CSKY to compile. Notably a basic CSKYTargetMachine and CSKYTargetInfo. Differential Revision: https://reviews.llvm.org/D88466	2020-10-10 10:44:08 +08:00
Rahman Lavaee	2b0c5d76a6	Introduce and use a new section type for the bb_addr_map section. This patch lets the bb_addr_map (renamed to __llvm_bb_addr_map) section use a special section type (SHT_LLVM_BB_ADDR_MAP) instead of SHT_PROGBITS. This would help parsers, dumpers and other tools to use the sh_type ELF field to identify this section rather than relying on string comparison on the section name. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D88199	2020-10-08 11:13:19 -07:00
Amara Emerson	283b4d6ba3	[GlobalISel] Add G_VECREDUCE_* opcodes for vector reductions. These mirror the IR and SelectionDAG intrinsics & nodes. Opcodes added: G_VECREDUCE_SEQ_FADD G_VECREDUCE_SEQ_FMUL G_VECREDUCE_FADD G_VECREDUCE_FMUL G_VECREDUCE_FMAX G_VECREDUCE_FMIN G_VECREDUCE_ADD G_VECREDUCE_MUL G_VECREDUCE_AND G_VECREDUCE_OR G_VECREDUCE_XOR G_VECREDUCE_SMAX G_VECREDUCE_SMIN G_VECREDUCE_UMAX G_VECREDUCE_UMIN Differential Revision: https://reviews.llvm.org/D88750	2020-10-08 10:33:19 -07:00
Luqman Aden	568035ac39	[llvm-readobj] Add --coff-tls-directory flag to print TLS Directory & test. Akin to dumpbin's /TLS option, this will print out the TLS directory, if present, in the image. Example output: ``` > llvm-readobj --coff-tls-directory test.exe File: test.exe Format: COFF-x86-64 Arch: x86_64 AddressSize: 64bit TLSDirectory { StartAddressOfRawData: 0x140004000 EndAddressOfRawData: 0x140004040 AddressOfIndex: 0x140002000 AddressOfCallBacks: 0x0 SizeOfZeroFill: 0x0 Characteristics [ (0x0) ] } ``` Reviewed By: jhenderson, grimar Differential Revision: https://reviews.llvm.org/D88635	2020-10-08 01:53:15 -07:00
Serge Guelton	b4ffc40d62	Update documentation and implementation of stage3 build Have the build work out of the box by forcing an LLD build. That way, we don't require an external LTO-aware linker, as we build one. Also remove reference to the seemingly dead builder. Differential Revision: https://reviews.llvm.org/D88990	2020-10-08 07:55:37 +02:00
Amara Emerson	322d0afd87	[llvm][mlir] Promote the experimental reduction intrinsics to be first class intrinsics. This change renames the intrinsics to not have "experimental" in the name. The autoupgrader will handle legacy intrinsics. Relevant ML thread: http://lists.llvm.org/pipermail/llvm-dev/2020-April/140729.html Differential Revision: https://reviews.llvm.org/D88787	2020-10-07 10:36:44 -07:00
Duncan P. N. Exon Smith	7193f72798	docs: Emphasize ArrayRef over SmallVectorImpl The section on SmallVector has a note about preferring SmallVectorImpl for APIs but doesn't mention ArrayRef. Although ArrayRef is discussed elsewhere, let's re-emphasize here. Differential Revision: https://reviews.llvm.org/D49881	2020-10-06 18:13:52 -04:00
Michael Kruse	c3f12dd606	[docs] Revise loop terminology reference. Motivated by D88183, this seeks to clarify the current loop nomenclature with added illustrations, examples for possibly unexpected situations (infinite loops not part of the "parent" loop, logical loops sharing the same header, ...), and clarification on what other sources may consider a loop. The current document also has multiple errors that are fixed here. Some selected errors: * Loops a defined as strongly-connected components. A component a partition of all nodes, i.e. a subloop can never be a component. That is, the document as it currently is only covers top-level loops, even it also uses the term SCC for subloops. * "a block can be the header of two separate loops at the same time" (it is considered a single loop by LoopInfo) * "execute before some interesting event happens" (some interesting event is not well-defined) Reviewed By: baziotis, Whitney Differential Revision: https://reviews.llvm.org/D88408	2020-10-05 10:28:04 -05:00
Paul C. Anagnostopoulos	0c1bb4f885	[TableGen] New backend to print detailed records. Pertinent lints are fixed.	2020-10-02 10:22:13 -04:00
Chris Lattner	71dcbe1e88	We don't need two different ways to get commit access, just simplify the policy here so that old SVN users and new contributors do the same thing.	2020-09-30 22:36:44 -07:00
Vedant Kumar	f71849c74e	[docs] Recommend dropLocation() over setDebugLoc(DebugLoc())	2020-09-29 17:07:14 -07:00
Tres Popp	eb9f7c28e5	Revert "OpaquePtr: Add type to sret attribute" This reverts commit `55c4ff91bd`. Issues were introduced as discussed in https://reviews.llvm.org/D88241 where this change made previous bugs in the linker and BitCodeWriter visible.	2020-09-29 10:31:04 +02:00
Arthur Eubanks	da036b4514	[Docs][NewPM] Add note about required passes Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D88342	2020-09-28 21:45:14 -07:00
Paul C. Anagnostopoulos	50a3df585d	[TableGen] Add/edit Doxygen comments to match "TableGen Backend Developer's Guide."	2020-09-26 09:09:22 -04:00
Juneyoung Lee	8bd205bf1d	[LangRef] Clarify the behavior of memory access instructions when pointers/sizes aren't well-defined This is a patch to LangRef that clarifies the behavior of load/store/memset/memcpy/memmove when the pointers or sizes are not well-defined as well. MSan detects a case when e.g., only lower bits of address are garbage when `-msan-check-access-address` is enabled, and it does not directly conflict with this patch because a C program should not use a pointer with undef bits and reasonable optimizations do not convert a well-defined pointer into a pointer with undef bits. This patch contains a definition of a well-defined value as well. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D87994	2020-09-26 08:13:27 +09:00
Matt Arsenault	55c4ff91bd	OpaquePtr: Add type to sret attribute Make the corresponding change that was made for byval in `b7141207a4`. Like byval, this requires a bulk update of the test IR tests to include the type before this can be mandatory.	2020-09-25 14:07:30 -04:00
Ian Levesque	6f7fbdd285	[xray] Function coverage groups Add the ability to selectively instrument a subset of functions by dividing the functions into N logical groups and then selecting a group to cover. By selecting different groups over time you could cover the entire application incrementally with lower overhead than instrumenting the entire application at once. Differential Revision: https://reviews.llvm.org/D87953	2020-09-24 22:09:53 -04:00
Stefanos Baziotis	7aa982a57c	[LoopTerminology][NFC] Fix formatting typo	2020-09-23 22:53:05 +03:00
Mehdi Amini	5281ba1994	Document the `--verbatim` flag from arc to update the description for a phabricator revision	2020-09-23 18:01:10 +00:00
Mehdi Amini	55f5a0137f	Update Phabricator doc to remove the warning on "arc land": tags a properly handled server side now	2020-09-23 18:01:09 +00:00
SuJunda (Junda Su)	270d334a66	[docs][llvm] Fix typos I don't have commit access. Please help me commit it. Thanks : ) Reviewed By: Paul-C-Anagnostopoulos Differential Revision: https://reviews.llvm.org/D88139	2020-09-23 10:19:02 -04:00
Florian Hahn	31923f6b36	[VPlan] Disconnect VPValue and VPUser. This refactors VPuser to not inherit from VPValue to facilitate introducing operations that introduce multiple VPValues (e.g. VPInterleaveRecipe). Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D84679	2020-09-23 14:44:31 +01:00
antonio-cortes-perez	af429cd89b	[NFC][docs] Fix link. The rendered html was (no hyperlink was generated): (see Getting Started <GettingStarted.html#git-pre-push-hook>) Now, it is (with proper hyperlink): (see Git pre-push hook) Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D88116	2020-09-22 23:40:03 +00:00
Paul C. Anagnostopoulos	21f5f509c8	Two patches to fix the broken build. One to fix a C++ compiler warning. One to allow Sphinx to find a new document.	2020-09-22 16:00:31 -04:00
Paul C. Anagnostopoulos	848d66fafd	Version 0.5 of the new "TableGen Backend Developer's Guide." Files modified to take comments into account. MLIR documentation updated for new TableGen documentation files.	2020-09-22 14:01:52 -04:00
antonio-cortes-perez	c82c0f99a5	[docs] Update ExtendingLLVM.rst Updated file paths and function signatures in section "Adding a new type". Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D88049	2020-09-21 16:49:48 -07:00
Alexander Shaposhnikov	53ba045f48	[llvm-install-name-tool] Update the command-line guide	2020-09-17 13:44:26 -07:00
Paul C. Anagnostopoulos	82687cf47b	Add section with details about DAGs.	2020-09-16 09:27:28 -04:00
Han Seoul-Oh	e15996b5c6	[doc] Fix broken link	2020-09-15 09:58:08 +02:00
Xun Li	1f837265eb	[Coroutines] Fix a typo in documentation In the example, the variable that's crossing suspend point was referred wrongly, fix it. Differential Revision: https://reviews.llvm.org/D83563	2020-09-14 18:56:57 -07:00
Arthur Eubanks	10b12d4035	Reland [docs][NewPM] Add docs for writing NPM passes As to not conflict with the legacy PM example passes under llvm/lib/Transforms/Hello, this is under HelloNew. This makes the CMakeLists.txt and general directory structure less confusing for people following the example. Much of the doc structure was taken from WritinAnLLVMPass.rst. This adds a HelloWorld pass which simply prints out each function name. More will follow after this, e.g. passes over different units of IR, analyses. https://llvm.org/docs/WritingAnLLVMPass.html contains a lot more. Relanded with missing "Support" dependency in LLVMBuild.txt. Reviewed By: ychen, asbirlea Differential Revision: https://reviews.llvm.org/D86979	2020-09-14 16:06:19 -07:00
Arthur Eubanks	39ec36415d	Revert "[docs][NewPM] Add docs for writing NPM passes" This reverts commit `c2590de30d`. Breaks shared libs build	2020-09-14 15:55:17 -07:00
Lang Hames	44da6c2369	[docs] Update OrcV1 removal timeline.	2020-09-14 14:23:20 -07:00
Arthur Eubanks	c2590de30d	[docs][NewPM] Add docs for writing NPM passes As to not conflict with the legacy PM example passes under llvm/lib/Transforms/Hello, this is under HelloNew. This makes the CMakeLists.txt and general directory structure less confusing for people following the example. Much of the doc structure was taken from WritinAnLLVMPass.rst. This adds a HelloWorld pass which simply prints out each function name. More will follow after this, e.g. passes over different units of IR, analyses. https://llvm.org/docs/WritingAnLLVMPass.html contains a lot more. Reviewed By: ychen, asbirlea Differential Revision: https://reviews.llvm.org/D86979	2020-09-14 13:26:03 -07:00
Balazs Benics	d7ae9696e3	[analyzer][docs][NFC] Document the ento namespace in the llvm/Lexicon Document the `ento` namespace in the Lexicon according to @nicolas17 on the mailing list (http://lists.llvm.org/pipermail/cfe-dev/2020-August/066577.html). The analyzer lived at different namespaces at different times. Originally lived at the `GR` aka. (Graph Reachability) namespace [7], later it moved under the `ento` namespace [9]. The Static Analyzer's code lived at many other places as well: `Analysis` -[2]-> `Checker` -[5]-> `GR` -[10]> `entoSA` -[11]-> `StaticAnalyzer` The relevant code motion, refactor commits, cfe-dev mailing in chronological order: 1) 2008-03-15 Make a major restructuring of the clang tree: introduce a ... `7a51313d8a` 2) 2010-01-25 Split libAnalysis into two libraries: libAnalysis and libChecker `d6b8708643` 3) 2010-12-21 Reorganization of Checker files http://lists.llvm.org/pipermail/cfe-dev/2010-December/012694.html 4) 2010-12-22 Refactoring: include/clang/Checker -> include/clang/GR `8d602a8aa8` 5) 2010-12-22 Refactoring: lib/Checker -> lib/GR `2ff5ab1516` 6) 2010-12-22 Refactoring: Move checkers into lib/GR/Checkers and their own `a700e976b6` 7) 2010-12-22 Refactoring: Move stuff into namespace 'GR' `ca08fba414` 8) 2010-12-22 Refactoring: Drop the 'GR' prefix. `1696f508e2` 9) 2010-12-23 Rename static analyzer namespace 'GR' to 'ento' `98857c9860` 10) 2010-12-23 Rename headers: 'clang/GR' 'clang/EntoSA' and update Makefile `ef33f0996c` 11) 2010-12-23 Chris Lattner has strong opinions about directory `d99bd55a5e` 12) 2010-12-24 Remove the EntoSA directories. `9d6af5328e` Reviewed By: Szelethus,martong,ASDenysPetrov,xazax.hun Differential Revision: https://reviews.llvm.org/D86446	2020-09-14 08:43:56 +02:00
Dave Lee	6e42cadf10	[docs] Document LLVM_EXTERNALIZE_DEBUGINFO CMake option Add `LLVM_EXTERNALIZE_DEBUGINFO` to CMake.rst. This should help make dSYM generation more discoverable. Differential Revision: https://reviews.llvm.org/D87591	2020-09-13 21:39:27 -07:00
Sanjay Patel	3a8ea8609b	[Intrinsics] define semantics for experimental fmax/fmin vector reductions As discussed on llvm-dev: http://lists.llvm.org/pipermail/llvm-dev/2020-April/140729.html This is hopefully the final remaining showstopper before we can remove the 'experimental' from the reduction intrinsics. No behavior was specified for the FP min/max reductions, so we have a mess of different interpretations. There are a few potential options for the semantics of these max/min ops. I think this is the simplest based on current behavior/implementation: make the reductions inherit from the existing llvm.maxnum/minnum intrinsics. These correspond to libm fmax/fmin, and those are similar to the (now deprecated?) IEEE-754 maxNum/minNum functions (NaNs are treated as missing data). So the default expansion creates calls to libm functions. Another option would be to inherit from llvm.maximum/minimum (NaNs propagate), but most targets just crash in codegen when given those nodes because no default expansion was ever implemented AFAICT. We could also just assume 'nnan' semantics by default (we are already assuming 'nsz' semantics in the maxnum/minnum intrinsics), but some targets (AArch64, PowerPC) support the more defined behavior, so it doesn't make much sense to not allow a tighter spec. Fast-math-flags (nnan) can be used to loosen the semantics. (Note that D67507 was proposed to update the LangRef to acknowledge the more recent IEEE-754 2019 standard, but that patch seems to have stalled. If we do update based on the new standard, the reduction instructions can seamlessly inherit from whatever updates are made to the max/min intrinsics.) x86 sees a regression here on 'nnan' tests because we have underlying, longstanding bugs in FMF creation/propagation. Those need to be fixed apart from this change (for example: https://llvm.org/PR35538). The expansion sequence before this patch may not have been correct. Differential Revision: https://reviews.llvm.org/D87391	2020-09-12 09:10:28 -04:00
YangZhihui	f2bb4b8855	[docs] Fix typos Differential Revision: https://reviews.llvm.org/D87356	2020-09-11 17:58:07 +02:00
YangZhihui	e5d92691bd	Fix typo in dsymutil.rst Differential revision: https://reviews.llvm.org/D87438	2020-09-10 09:46:10 -07:00
Guillaume Chatelet	ed95f7c7ce	Fix broken link for Sphinx installation	2020-09-10 12:27:49 +00:00
Tony	72e2fbde54	[AMDGPU] Correct gfx1031 XNACK setting documentation - gfx1031 does not support XNACK. Differential Revision: https://reviews.llvm.org/D87198	2020-09-09 19:43:02 +00:00
Nate Voorhies	76a2c434f2	Insert missing bracket in docs. Body of unrolled loop was missing opening bracket. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D87329	2020-09-08 15:20:39 -07:00
Paul C. Anagnostopoulos	66310aafa0	fix typos; improve a couple of descriptions; add release note	2020-09-08 15:48:18 -04:00
Paul C. Anagnostopoulos	1f870bd928	Add detailed reference for the SearchableTables backend.	2020-09-08 13:48:12 -04:00
Florian Hahn	1ddb3a369f	[LangRef] Adjust guarantee for llvm.memcpy to also allow equal arguments. This adjusts the description of `llvm.memcpy` to also allow operands to be equal. This is in line with what Clang currently expects. This change is intended to be temporary and followed by re-introduce a variant with the non-overlapping guarantee for cases where we can actually ensure that property in the front-end. See the links below for more details: http://lists.llvm.org/pipermail/cfe-dev/2020-August/066614.html and PR11763. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D86815	2020-09-05 19:18:23 +01:00
Yang Zhihui	691d436685	Fix typos in doc LangRef.rst Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D87077	2020-09-04 05:17:31 -07:00
JF Bastien	baa74e013f	Step down from security group Propose Ahmed as a replacement. He's fixed many security issues in LLVM for Apple in the last few years, as such he'll fit the "Individual contributors" description. Differential Revision: https://reviews.llvm.org/D86742	2020-09-03 08:44:27 -07:00
Michael Kruse	137dfd616a	[LangRef] Fix condition for when a loop is considered parallel. The wording before this patch applies to llvm.mem.parallel_loop_access, not access groups. Reviewed By: mppf, hfinkel Differential Revision: https://reviews.llvm.org/D83781	2020-09-01 15:41:59 -05:00
Arthur Eubanks	96f0b57568	[Bindings] Add LLVMAddInstructionSimplifyPass Reviewed By: sroland Differential Revision: https://reviews.llvm.org/D86764	2020-09-01 12:38:49 -07:00
Hans Wennborg	40fed00486	First commit on the release/11.x branch.	2020-09-01 11:44:02 -07:00
Arthur Eubanks	61e15ecab5	[docs] Fix indentation in FileCheck.rst Fixes C:\src\llvm-project\llvm\docs\CommandGuide\FileCheck.rst:745:Bullet list ends without a blank line; unexpected unindent.	2020-08-31 13:20:04 -07:00
Alexandre Ganea	9026d3b2f9	Fix sphinx documentation after `a6a37a2fcd`	2020-08-31 08:06:13 -04:00
Thomas Preud'homme	998709b7d5	[FileCheck] Add precision to format specifier Add printf-style precision specifier to pad numbers to a given number of digits when matching them if the value is smaller than the given precision. This works on both empty numeric expression (e.g. variable definition from input) and when matching a numeric expression. The syntax is as follows: [[#%.<precision><format specifier>, ...] where <format specifier> is optional and ... can be a variable definition or not with an empty expression or not. In the absence of a precision specifier, a variable definition will accept leading zeros. Reviewed By: jhenderson, grimar Differential Revision: https://reviews.llvm.org/D81667	2020-08-30 19:40:57 +01:00
Juneyoung Lee	09dcb52ca8	[LangRef] Apply a missing comment from D86189	2020-08-30 14:56:17 +09:00
Juneyoung Lee	98e5776897	[LangRef] State that storing an aggregate fills padding with undef This patch makes LangRef be explicit about the value of padding when storing an aggregate. It states that when an aggregate is stored into memory, padding is filled with undef. Here is a clue that supports this change (edited to reflect the discussion from llvm-dev): - IPSCCP ignores padding and directly stores a constant aggregate if possible. It loses the data stored in the padding. https://godbolt.org/z/xzenYs Memcpyopt ignores (the preexisting value of) padding when copying an aggregate or storing a constant: https://godbolt.org/z/hY6ndd / https://godbolt.org/z/3WMP5a The two items below are not relevant with this patch because Clang lowers load/store of individual field of struct into load/stores of the corresponding pointer with a primitive type. Also, when copy is needed, it uses memcpy instead of load/store of an aggregate, as discussed in the llvm-dev. However, this patch is still valid (as discussed) because it is needed to explain the two optimizations above. - According to C17, the value of padding bytes when storing values in structures or unions is unspecified. - I updated Alive2 and it did not find any problematic transformation from LLVM unit tests and while running translation validation of a few C programs. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D86189	2020-08-30 14:53:20 +09:00
JF Bastien	82d29b397b	Add an unsigned shift base sanitizer It's not undefined behavior for an unsigned left shift to overflow (i.e. to shift bits out), but it has been the source of bugs and exploits in certain codebases in the past. As we do in other parts of UBSan, this patch adds a dynamic checker which acts beyond UBSan and checks other sources of errors. The option is enabled as part of -fsanitize=integer. The flag is named: -fsanitize=unsigned-shift-base This matches shift-base and shift-exponent flags. <rdar://problem/46129047> Differential Revision: https://reviews.llvm.org/D86000	2020-08-27 19:50:10 -07:00
Alexandre Ganea	a6a37a2fcd	[Support] On Windows, add optional support for {rpmalloc\|snmalloc\|mimalloc} This patch optionally replaces the CRT allocator (i.e., malloc and free) with rpmalloc (mixed public domain licence/MIT licence) or snmalloc (MIT licence) or mimalloc (MIT licence). Please note that the source code for these allocators must be available outside of LLVM's tree. To enable, use `cmake ... -DLLVM_INTEGRATED_CRT_ALLOC=D:/git/rpmalloc -DLLVM_USE_CRT_RELEASE=MT` where `D:/git/rpmalloc` has already been git clone'd from `https://github.com/mjansson/rpmalloc`. The same applies to snmalloc and mimalloc. When enabled, the allocator will be embeded (statically linked) into the LLVM tools & libraries. This currently only works with the static CRT (/MT), although using the dynamic CRT (/MD) could potentially work as well in the future. When enabled, this changes the memory stack from: new/delete -> MS VC++ CRT malloc/free -> HeapAlloc -> VirtualAlloc to: new/delete -> {rpmalloc\|snmalloc\|mimalloc} -> VirtualAlloc The goal of this patch is to bypass the application's global heap - which is thread-safe thus inducing locking - and instead take advantage of a modern lock-free, thread cache, allocator. On a 6-core Xeon Skylake we observe a 2.5x decrease in execution time when linking a large scale application with LLD and ThinLTO (12 min 20 sec -> 5 min 34 sec), when all hardware threads are being used (using LLD's flag /opt:lldltojobs=all). On a dual 36-core Xeon Skylake with all hardware threads used, we observe a 24x decrease in execution time (1 h 2 min -> 2 min 38 sec) when linking a large application with LLD and ThinLTO. Clang build times also see a decrease in the range 5-10% depending on the configuration. Differential Revision: https://reviews.llvm.org/D71786	2020-08-27 11:09:46 -04:00
Sjoerd Meijer	ff6dbb2319	Follow up of rGca243b07276a: fixed a typo. NFC.	2020-08-27 10:53:41 +01:00
Sjoerd Meijer	ca243b0727	[LangRef] get.active.lane.mask can produce poison value We had already specified that second argument `n` of this intrinsic is `n > 0`, but now add to this that the result is a poison value if this is not the case. Differential Revision: https://reviews.llvm.org/D86637	2020-08-27 08:57:35 +01:00
Craig Topper	2d13693bfc	[X86] Update release notes for -mtune support.	2020-08-26 16:16:56 -07:00
Arthur Eubanks	486ed88533	[ConstProp] Remove ConstantPropagation As discussed in http://lists.llvm.org/pipermail/llvm-dev/2020-July/143801.html. Currently no users outside of unit tests. Replace all instances in tests of -constprop with -instsimplify. Notable changes in tests: * vscale.ll - @llvm.sadd.sat.nxv16i8 is evaluated by instsimplify, use a fake intrinsic instead * InsertElement.ll - insertelement undef is removed by instsimplify in @insertelement_undef llvm/test/Transforms/ConstProp moved to llvm/test/Transforms/InstSimplify/ConstProp Reviewed By: lattner, nikic Differential Revision: https://reviews.llvm.org/D85159	2020-08-26 15:51:30 -07:00
Juneyoung Lee	24dd04116d	[LangRef] Memset/memcpy/memmove can take undef/poison pointer if the size is 0 According to the current LangRef, Memset/memcpy/memmove can take a null/dangling pointer if the size is zero. (Relevant thread: http://lists.llvm.org/pipermail/llvm-dev/2017-July/115665.html ) This patch expands it and allows the functions to take undef/poison pointers too. This required the updates in the align attribute since it isn't specified what is the alignment of undef/poison pointers. This patch states that their alignment is 1. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D86643	2020-08-27 06:19:28 +09:00
Craig Topper	09288bcbf5	[X86] Add assembler support for .d32 and .d8 mnemonic suffixes to control displacement size. This is an older syntax than the {disp32} and {disp8} pseudo prefixes that were added a few weeks ago. We can reuse most of the support for that to support .d32 and .d8 as well.	2020-08-26 10:45:50 -07:00
Shoaib Meenai	22cd6bee4a	[llvm-libtool-darwin] Address post-commit feedback Address James Henderson's comments on https://reviews.llvm.org/D86359.	2020-08-25 15:04:23 -07:00
Craig Topper	01eb1233db	[X86] Mention -march=sapphirerapids in the release notes. This was just added in `e02d081f2b`.	2020-08-25 11:57:34 -07:00
Sjoerd Meijer	2002bb4878	[LangRef] Revise semantics of intrinsic get.active.lane.mask A first version of get.active.lane.mask was committed in rG7fb8a40e5220. One of the main purposes and uses of this intrinsic is to communicate information from the middle-end to the back-end, but its current definition and semantics make this actually very difficult. The intrinsic was defined as: @llvm.get.active.lane.mask(%IV, %BTC) where %BTC is the Backedge-Taken Count (variable names are different in the LangRef spec). This allows to implicitly communicate the loop tripcount, which can be reconstructed by calculating BTC + 1. But it has been very difficult to prove that calculating BTC + 1 is safe and doesn't overflow. We need complicated range and SCEV analysis, and thus the problem is that this intrinsic isn't really doing what it was supposed to solve. Examples of the overflow checks that are required in the (ARM) back-end are D79175 and D86074, which aren't even complete/correct yet. To solve this problem, we are revising the definitions/semantics for get.active.lane.mask to avoid all the complicated overflow analysis. This means that instead of communicating the BTC, we are now using the loop tripcount. Now using LangRef's variable names, its semantics is changed from: icmp ule (%base + i), %n to: icmp ult (%base + i), %n with %n > 0 and corresponding to the loop tripcount. The intrinsic signature remains the same. Differential Revision: https://reviews.llvm.org/D86147	2020-08-25 16:23:51 +01:00
Yang Zhihui	70b39506a1	[FileCheck][docs] Fix word errors ouput -> output Reviewed By: thopre Differential Revision: https://reviews.llvm.org/D86504	2020-08-25 09:53:52 +01:00
vnalamot	b9496efbb9	[AMDGPU, docs] Fix typos Reviewed By: t-tye, Flakebi Differential Revision: https://reviews.llvm.org/D86340	2020-08-25 00:00:23 +05:30
Sourabh Singh Tomar	f91d18eaa9	[DebugInfo][flang]Added support for representing Fortran assumed length strings This patch adds support for representing Fortran `character(n)`. Primarily patch is based out of D54114 with appropriate modifications. Test case IR is generated using our downstream classic-flang. We're in process of upstreaming flang PR's but classic-flang has dependencies on llvm, so this has to get in first. Patch includes functional test case for both IR and corresponding dwarf, furthermore it has been manually tested as well using GDB. Source snippet: ``` program assumedLength call sub('Hello') call sub('Goodbye') contains subroutine sub(string) implicit none character(len=), intent(in) :: string print , string end subroutine sub end program assumedLength ``` GDB: ``` (gdb) ptype string type = character (5) (gdb) p string $1 = 'Hello' ``` Reviewed By: aprantl, schweitz Differential Revision: https://reviews.llvm.org/D86305	2020-08-22 10:13:40 +05:30
Paul C. Anagnostopoulos	196e6f9f18	Replace TableGen range piece punctuator with '...' The TableGen range piece punctuator is currently '-' (e.g., {0-9}), which interacts oddly with the fact that an integer literal's sign is part of the literal. This patch replaces the '-' with the new punctuator '...'. The '-' punctuator is deprecated. Differential Revision: https://reviews.llvm.org/D85585 Change-Id: I3d53d14e23f878b142d8f84590dd465a0fb6c09c	2020-08-21 23:33:57 +02:00
Paul C. Anagnostopoulos	e0c01e6cb0	New TableGen Programmer's Reference document This new TableGen Programmer's Reference document replaces the current Language Introduction and Language Reference documents. It brings all the TableGen reference information into one document. As an experiment, I numbered the sections in the document. See what you think about that. Reviewed By: lattner Differential Revision: https://reviews.llvm.org/D85838 (changes by Nicolai Hähnle <nicolai.haehnle@amd.com>: - fixed build error due to toctree in docs/LangRef/index.rst - fixed reference to ProgRef) Change-Id: Ifbdfa39768b8a460aae2873103d31c7b347aff00	2020-08-21 23:18:32 +02:00
Dmitry Preobrazhensky	3f7985e6ec	[AMDGPU][MC][NFC][DOC] Updated AMD GPU assembler syntax description. Summary of changes: - added description of MTBUF instructions and format modifier; - described limitations of f16 inline constants when used with integer operands; - updated description of gfx9+ flat global addressing modes; - v_accvgpr_write_b32 src0 corrections (gfx908); - minor bugfixing and improvements.	2020-08-21 14:25:14 +03:00
Tony	b690c1157e	[AMDGPU] Correct DWARF register defintions - Rename AMDGPU SCC DWARF register to STATUS since the scalar condition code is a bit within the STATUS register. - Correct bit size of the VCC_64 register to 64 which is the size in wave64 mode. Differential Revision: https://reviews.llvm.org/D86259	2020-08-20 01:15:04 +00:00
Florian Hahn	0814fcb727	[docs] Clarify ENABLE_MODULES uses Clang Header Modules. Suggested post-commit by @dblaikie, thanks!	2020-08-19 17:38:34 +01:00
madhur13490	0313c540c2	[NFC] Fix typo in AMDGPU doc Reviewed By: t-tye, arsenm Differential Revision: https://reviews.llvm.org/D86206	2020-08-19 14:33:26 +00:00
Hongtao Yu	de0c7a044b	[llvm-objdump] Attempt to fix html doc generation issue. https://reviews.llvm.org/D84191 caused a html doc build issue with the changes in `llvm-objdump.rst`. It looks like a blank line is missing from the `code-block` directives. Test Plan: Differential Revision: https://reviews.llvm.org/D86123	2020-08-17 18:06:22 -07:00
Hongtao Yu	819b2d9c79	[llvm-objdump] Symbolize binary addresses for low-noisy asm diff. When diffing disassembly dump of two binaries, I see lots of noises from mismatched jump target addresses and global data references, which unnecessarily causes diffs on every function, making it impractical. I'm trying to symbolize the raw binary addresses to minimize the diff noise. In this change, a local branch target is modeled as a label and the branch target operand will simply be printed as a label. Local labels are collected by a separate pre-decoding pass beforehand. A global data memory operand will be printed as a global symbol instead of the raw data address. Unfortunately, due to the way the disassembler is set up and to be less intrusive, a global symbol is always printed as the last operand of a memory access instruction. This is less than ideal but is probably acceptable from checking code quality point of view since on most targets an instruction can have at most one memory operand. So far only the X86 disassemblers are supported. Test Plan: llvm-objdump -d --x86-asm-syntax=intel --no-show-raw-insn --no-leading-addr : ``` Disassembly of section .text: <_start>: push rax mov dword ptr [rsp + 4], 0 mov dword ptr [rsp], 0 mov eax, dword ptr [rsp] cmp eax, dword ptr [rip + 4112] # 202182 <g> jge 0x20117e <_start+0x25> call 0x201158 <foo> inc dword ptr [rsp] jmp 0x201169 <_start+0x10> xor eax, eax pop rcx ret ``` llvm-objdump -d --symbolize-operands --x86-asm-syntax=intel --no-show-raw-insn --no-leading-addr : ``` Disassembly of section .text: <_start>: push rax mov dword ptr [rsp + 4], 0 mov dword ptr [rsp], 0 <L1>: mov eax, dword ptr [rsp] cmp eax, dword ptr <g> jge <L0> call <foo> inc dword ptr [rsp] jmp <L1> <L0>: xor eax, eax pop rcx ret ``` Note that the jump instructions like `jge 0x20117e <_start+0x25>` without this work is printed as a real target address and an offset from the leading symbol. With a change in the optimizer that adds/deletes an instruction, the address and offset may shift for targets placed after the instruction. This will be a problem when diffing the disassembly from two optimizers where there are unnecessary false positives due to such branch target address changes. With `--symbolize-operand`, a label is printed for a branch target instead to reduce the false positives. Similarly, the disassemble of PC-relative global variable references is also prone to instruction insertion/deletion. Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D84191	2020-08-17 16:55:12 -07:00
Matt Arsenault	a128292b90	GlobalISel: Make type for lower action more consistently optional Some of the lower implementations were relying on this, however the type was not set depending on which form .lower* helper form you were using. For instance, if you used an unconditonal lower(), the type was never set. Most of the lower actions do not benefit from a type parameter, and just expand in terms of the original operation's types. However, some lowerings could benefit from an additional type hint to combine a promotion and an expansion. An example of this is for add/sub sat. The DAG integer legalization tries to use smarter expansions directly when promoting the integer type, and doesn't always produce the same instruction with a wider type. Treat this as an optional hint argument, that only means something for specific lower actions. It may be useful to generalize this mechanism to pass a full list of type indexes and desired types, but I haven't run into a case like that yet.	2020-08-17 16:24:55 -04:00
Philip Reames	48f4312d4e	Remove inline gc arguments from statepoints The "gc-live" operand bundles were recently added, and all tests have been updated to use that format. A migration period was provided, though it's worth noting these intrinsics are experimental, so formally there is no compatibile requirement. This is an extension to `a96fc46`. "gc-live" hadn't been implemented at the point that patch was initially posted.	2020-08-14 19:44:24 -07:00
Philip Reames	a96fc4638b	Remove deopt and gc transition arguments from gc.statepoint intrinsic (Forgot to land this a couple of weeks back.) In a recent series of changes, I've introduced support for using the respective operand bundle kinds on the statepoint. At the moment, code supports either/or, but there's no need to keep the old support around. For the moment, I am simply changing the specification and verifier to require zero length argument sets in the intrinsic. The intrinsic itself is experimental. Given that, there's no forward serialization needed. The in tree uses and generation have already been updated to use the new operand bundle based forms, the only folks broken by the change will be those with frontends generating statepoints directly and the updates should be easy. Why not go ahead and just remove the arguments entirely? Well, I plan to. But while working on this I've found that almost all of the arguments to the statepoint can be expressed via operand bundles or attributes. Given that, I'm planning a radical simplification of the arguments and figured I'd do one update not several small ones. Differential Revision: https://reviews.llvm.org/D80892	2020-08-14 16:07:40 -07:00
Sameer Arora	1aed1e72e8	[llvm-libtool-darwin] Add support for -l and -L Add support for passing in libraries via `-l` and `-L` options to `llvm-libtool-darwin`. Reviewed by jhenderson, smeenai Differential Revision: https://reviews.llvm.org/D85540	2020-08-14 11:44:17 -07:00
Sameer Arora	bd2853f799	[llvm-libtool-darwin] Add support for -arch_only Add support for -arch_only option for llvm-libtool-darwin. This diff also adds support for accepting universal files as input and flattening them to create the required static library. Supports input universal files contaning both Mach-O object files or archives. Differences from cctools' libtool: - `-arch_only` can be specified multiple times - archives containing universal files are considered invalid (libtool allows such archives) Reviewed by jhenderson, smeenai Differential Revision: https://reviews.llvm.org/D84770	2020-08-13 11:08:46 -07:00
Sameer Arora	612b4dda76	[llvm-install-name-tool] Add more documentation Add documentation for the remaining options of `llvm-install-name-tool`. Reviewed by jhenderson, smeenai Differential Revision: https://reviews.llvm.org/D85655	2020-08-13 10:47:47 -07:00
Sebastian Neubauer	ca227d73e1	[AMDGPU] Fix typo. NFC	2020-08-13 10:41:48 +02:00
Jay Foad	fa2b836ea3	[GlobalISel] Add G_ABS This is equivalent to the new llvm.abs intrinsic added by D84125 with is_int_min_poison=0. Differential Revision: https://reviews.llvm.org/D85718	2020-08-11 16:34:37 +01:00
Dávid Bolvanský	36e1fc5f68	[Docs] Fixed missing closing quote character	2020-08-11 11:21:15 +02:00
Fangrui Song	0b7f125219	[llvm-symbolizer] Add back --version and add a -v alias The switch from llvm::cl to OptTable (D83530) dropped --version, which is needed by some users. This patch also adds a -v alias, which is available in GNU addr2line. The version dumping is similar to llvm-objcopy --version (exotic): ``` llvm-symbolizer LLVM (http://llvm.org/): LLVM version 12.0.0git Optimized build with assertions. Default target: x86_64-unknown-linux-gnu Host CPU: skylake-avx512 ``` Reviewed By: dyung, jhenderson Differential Revision: https://reviews.llvm.org/D85624	2020-08-10 08:21:43 -07:00
Kazu Hirata	a31b3893c7	[docs] Fix typos	2020-08-09 19:31:49 -07:00
Sameer Arora	71a1f135e4	[llvm-libtool-darwin] Add support for -D and -U options Add support for `-D` and `-U` options for llvm-libtool-darwin. `-D` allows for using zero for timestamps and UIDs/GIDs. `-U` allows for using actual timestamps and UIDs/GIDs. Reviewed by jhenderson, smeenai Differential Revision: https://reviews.llvm.org/D84209	2020-08-07 14:44:32 -07:00
Sameer Arora	d9a9192984	[llvm-libtool-darwin] Add support for -filelist option Add support for `-filelist` option for llvm-libtool-darwin. `-filelist` option allows for passing in a file containing a list of filenames. Reviewed by jhenderson, smeenai Differential Revision: https://reviews.llvm.org/D84206	2020-08-07 14:29:24 -07:00
Sameer Arora	d6c00edf2e	[FileCheck] Add docs for --allow-empty This diff adds documentation for `allow-empty` flag under FileCheck docs. Reviewed by jhenderson, smeenai, thopre Differential Revision: https://reviews.llvm.org/D83682	2020-08-07 13:27:57 -07:00
Sameer Arora	bb4b70f792	[llvm-install-name-tool] Adds docs for llvm-install-name-tool Adding documentation for llvm-install-name-tool. Reviewed by smeenai, Ktwu Differential Revision: https://reviews.llvm.org/D81944	2020-08-07 12:51:58 -07:00
Bevin Hansson	5de6c56f7e	[Intrinsic] Add sshl.sat/ushl.sat, saturated shift intrinsics. Summary: This patch adds two intrinsics, llvm.sshl.sat and llvm.ushl.sat, which perform signed and unsigned saturating left shift, respectively. These are useful for implementing the Embedded-C fixed point support in Clang, originally discussed in http://lists.llvm.org/pipermail/llvm-dev/2018-August/125433.html and http://lists.llvm.org/pipermail/cfe-dev/2018-May/058019.html Reviewers: leonardchan, craig.topper, bjope, jdoerfert Subscribers: hiraditya, jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83216	2020-08-07 15:09:24 +02:00
Bevin Hansson	177735aac7	[LangRef] Minor fixes to intrinsic headers and descriptions. NFC.	2020-08-07 15:09:24 +02:00
Nico Weber	ecbf2b3496	fix doc typo to cycle bots	2020-08-06 21:02:41 -04:00
Tony	ce74e97d9b	[AMDGPU] Correct missing sram-ecc target feature for gfx906 Differential Revision: https://reviews.llvm.org/D85476	2020-08-06 22:12:25 +00:00
Stanislav Mekhanoshin	ea7d0e2996	[AMDGPU] gfx1031 target Differential Revision: https://reviews.llvm.org/D85337	2020-08-05 12:36:26 -07:00
Matt Morehouse	b0c50ef759	Revert "Add libFuzzer shared object build output" This reverts commit `98d91aecb2` since it breaks on platforms without libstdc++.	2020-08-05 12:11:24 -07:00
Matt Morehouse	98d91aecb2	Add libFuzzer shared object build output This change adds a CMake rule to produce shared object versions of libFuzzer (no-main). Like the static library versions, these shared libraries have a copy of libc++ statically linked in. For i386 we don't link with libc++ since i386 does not support mixing position- independent and non-position-independent code in the same library. Patch By: IanPudney Reviewed By: morehouse Differential Revision: https://reviews.llvm.org/D84947	2020-08-05 09:03:22 -07:00
Hans Wennborg	271d9c507c	Bump forgotten version nbr in llvm/docs/conf.py	2020-08-05 17:11:59 +02:00
Jordan Rupprecht	4963ca4658	[docs] Document pattern of using CHECK-SAME to skip irrelevant lines This came up during the review for D67656. It's nice but also subtle, so documenting it as an idiom will make tests easier to understand. Reviewed By: probinson Differential Revision: https://reviews.llvm.org/D68061	2020-08-05 11:03:56 +01:00
Florian Hahn	05aa29efd7	[docs] Mention LLVM_ENABLE_MODULES.	2020-08-04 16:59:39 +01:00
Fangrui Song	593e196297	[llvm-symbolizer] Switch command line parsing from llvm::cl to OptTable for the advantage outlined by D83639 ([OptTable] Support grouped short options) Some behavior changes: * -i={0,false} is removed. Use --no-inlines instead. * --demangle={0,false} is removed. Use --no-demangle instead * -untag-addresses={0,false} is removed. Use --no-untag-addresses instead Added a higher level API OptTable::parseArgs which handles optional initial options populated from an environment variable, expands response files recursively, and parses options. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D83530	2020-08-04 08:53:15 -07:00
Simon Pilgrim	cc0b670abf	Fix sphinx "Title underline too short" warning	2020-08-04 16:36:00 +01:00
Simon Pilgrim	6e727551b9	Fix sphinx indentation warning to stop newline in byref section html output.	2020-08-04 16:12:50 +01:00
Simon Pilgrim	feb9d8bd8e	Fix sphinx indentation warning. Don't double indent and make it clear we're referting to the latency mode.	2020-08-04 15:57:46 +01:00
Fangrui Song	bcea3a7a28	Add test utility 'split-file' See https://lists.llvm.org/pipermail/llvm-dev/2020-July/143373.html "[llvm-dev] Multiple documents in one test file" for some discussions. This patch has explored several alternatives. The current semantics are similar to what @dblaikie proposed. `split-file filename output` splits the input file into multiple parts separated by regex `^(.\|//)--- filename` and write each part to the file `output/filename` (`filename` can include path separators). Use case A (organizing input of different formats (e.g. linker script+assembly) in one file). ``` # RUN: split-file %s %t # RUN: llvm-mc %t/asm -o %t.o # RUN: ld.lld -T %t/lds %t.o -o %t This is sometimes better than the %S/Inputs/ approach because the user can see the auxiliary files immediately and don't have to open another file. # asm ... # lds ... ``` Use case B (for utilities which don't have built-in input splitting feature): ``` // RUN: split-file %s %t // RUN: llc < %t/1.ll \| FileCheck %s --check-prefix=CASE1 // RUN: llc < %t/2.ll \| FileCheck %s --check-prefix=CASE2 Combing tests prudently can improve readability. For example, when testing parsing errors if the recovery mechanism isn't possible, grouping the tests in one file can more readily see test coverage/strategy. //--- 1.ll ... //--- 2.ll ... ``` Since this is a new utility, there is no git history concerns for UpperCase variable names. I use lowerCase variable names like mlir/lld. Reviewed By: jhenderson, lattner Differential Revision: https://reviews.llvm.org/D83834	2020-08-03 20:42:09 -07:00
Florian Hahn	599955eb56	Recommit "[IPConstProp] Remove and move tests to SCCP." This reverts commit `59d6e814ce`. The cause for the revert (3 clang tests running opt -ipconstprop) was fixed by removing those lines.	2020-08-02 22:23:54 +01:00
Fangrui Song	c068e9c8c1	[Support][CommandLine] Delete unused llvm:🆑:ParseEnvrironmentOptions The function was added in 2003. It is not used and can be emulated with ParseCommandLineOptions.	2020-07-31 10:48:09 -07:00
Mircea Trofin	abb8128237	[doc] Describe the header guard style clang-tidy's llvm-header-guard rule references the LLVM style - where it's missing. Differential Revision: https://reviews.llvm.org/D84989	2020-07-30 16:08:07 -07:00
Florian Hahn	59d6e814ce	Revert "[IPConstProp] Remove and move tests to SCCP." This reverts commit `e77624a3be`. Looks like some clang tests manually invoke -ipconstprop via opt.....	2020-07-30 13:06:54 +01:00
Florian Hahn	e77624a3be	[IPConstProp] Remove and move tests to SCCP. As far as I know, ipconstprop has not been used in years and ipsccp has been used instead. This has the potential for confusion and sometimes leads people to spend time finding & reporting bugs as well as updating it to work with the latest API changes. This patch moves the tests over to SCCP. There's one functional difference I am aware of: ipconstprop propagates for each call-site individually, so for functions that are called with different constant arguments it can sometimes produce better results than ipsccp (at much higher compile-time cost).But IPSCCP can be thought to do so as well for internal functions and as mentioned earlier, the pass seems unused in practice (and there are no plans on working towards enabling it anytime). Also discussed on llvm-dev: http://lists.llvm.org/pipermail/llvm-dev/2020-July/143773.html Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D84447	2020-07-30 12:36:27 +01:00
Tony	629467eb98	[AMDGPU] Fix DWARF extensions User Guide table of contents	2020-07-30 05:10:21 +00:00
Tony	e24f5f3149	[AMDGPU] DWARF proposal changes - Clarify that these are extensions to DWARF 5 and not as yet a proposal. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D70523	2020-07-30 05:07:09 +00:00
Tony	5aa2fd88cf	[AMDGPU] DWARF proposal changes for expression context - Clarify what context is used in DWARF expression evaluation. - Define location descriptions to fully resolve the context and so include the context in their result. - As a consequence of location descriptions being fully resoved, change address spaces so only a swizzled and unswizzled private address space is defined. The lane is now part of the location description context. - Clarify how call frame information is used to fully resolve expressions that specify registers. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D70523	2020-07-30 01:59:22 +00:00
Varun Gandhi	417d3d495f	[docs] [lit] Add a more helpful description for lit.py's -s flag. Reviewed By: yln Differential Revision: https://reviews.llvm.org/D82808	2020-07-28 14:36:03 -07:00
Fangrui Song	dd405f1a53	Revert D83834 "Add test utility 'extract'" This reverts commit `d054c7ee2e`. There are discussions about the utility name, its functionality and user interface. Revert before we reach consensus.	2020-07-28 13:26:33 -07:00
Arthur Eubanks	2ca6c422d2	[FunctionAttrs] Rename functionattrs -> function-attrs To match NewPM pass name, and also for readability. Also rename rpo-functionattrs -> rpo-function-attrs while we're here. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D84694	2020-07-28 09:09:13 -07:00
Jinsong Ji	d28f86723f	Re-land "[PowerPC] Remove QPX/A2Q BGQ/BGP CNK support" This reverts commit `bf544fa1c3`. Fixed the typo in PPCInstrInfo.cpp.	2020-07-28 14:00:11 +00:00
Wei Mi	a23f62343c	Supplement instr profile with sample profile. PGO profile is usually more precise than sample profile. However, PGO profile needs to be collected from loadtest and loadtest may not be representative enough to the production workload. Sample profile collected from production can be used as a supplement -- for functions cold in loadtest but warm/hot in production, we can scale up the related function in PGO profile if the function is warm or hot in sample profile. The implementation contains changes in compiler side and llvm-profdata side. Given an instr profile and a sample profile, for a function cold in PGO profile but warm/hot in sample profile, llvm-profdata will either mark all the counters in the profile to be -1 or scale up the max count in the function to be above hot threshold, depending on the zero counter ratio in the profile. The assumption is if there are too many counters being zero in the function profile, the profile is more likely to cause harm than good, then llvm-profdata will mark all the counters to be -1 indicating the function is hot but the profile is unaccountable. In compiler side, if a function profile with all -1 counters is seen, the function entry count will be set to be above hot threshold but its internal profile will be dropped. In the long run, it may be useful to let compiler support using PGO profile and sample profile at the same time, but that requires more careful design and more substantial changes to make two profiles work seamlessly. The patch here serves as a simple intermediate solution. Differential Revision: https://reviews.llvm.org/D81981	2020-07-27 20:17:40 -07:00
Jinsong Ji	bf544fa1c3	Revert "[PowerPC] Remove QPX/A2Q BGQ/BGP CNK support" This reverts commit `adffce7153`. This is breaking test-suite, revert while investigation.	2020-07-27 21:07:00 +00:00
Jinsong Ji	adffce7153	[PowerPC] Remove QPX/A2Q BGQ/BGP CNK support Per RFC http://lists.llvm.org/pipermail/llvm-dev/2020-April/141295.html no one is making use of QPX/A2Q/BGQ/BGP CNK anymore. This patch remove the support of QPX/A2Q in llvm, BGQ/BGP in clang, CNK support in openmp/polly. Reviewed By: hfinkel Differential Revision: https://reviews.llvm.org/D83915	2020-07-27 19:24:39 +00:00
Matt Morehouse	34ddf0b2b0	Replace fuzzer::FuzzerDriver's INTERFACE marking with new LLVMRunFuzzerDriver. This adds a new extern "C" function that serves the same purpose. This removes the need for external users to depend on internal headers in order to use this feature. It also standardizes the interface in a way that other fuzzing engines will be able to match. Patch By: IanPudney Reviewed By: kcc Differential Revision: https://reviews.llvm.org/D84561	2020-07-27 18:38:04 +00:00
Vy Nguyen	ee7caa7593	Reland [llvm-exegesis] Add benchmark latency option on X86 that uses LBR for more precise measurements. Starting with Skylake, the LBR contains the precise number of cycles between the two consecutive branches. Making use of this will hopefully make the measurements more precise than the existing methods of using RDTSC. Differential Revision: https://reviews.llvm.org/D77422 New change: check for existence of field `cycles` in perf_branch_entry before enabling this mode. This should prevent compilation errors when building for older kernel whose headers don't support it.	2020-07-27 12:38:05 -04:00
Afanasyev Ivan	8b74596b7e	[Docs] remove unused arguments in documentation examples on vectorization passes Reviewers: nadav, tyler.nowicki Reviewed By: nadav Differential Revision: https://reviews.llvm.org/D83851	2020-07-27 10:20:26 +01:00
Fangrui Song	d054c7ee2e	Add test utility 'extract' See https://lists.llvm.org/pipermail/llvm-dev/2020-July/143373.html "[llvm-dev] Multiple documents in one test file" for some discussions. `extract part filename` splits the input file into multiple parts separated by regex `^(.\|//)--- ` and extract the specified part to stdout or the output file (if specified). Use case A (organizing input of different formats (e.g. linker script+assembly) in one file). ``` // RUN: extract lds %s -o %t.lds // RUN: extract asm %s -o %t.s // RUN: llvm-mc %t.s -o %t.o // RUN: ld.lld -T %t.lds %t.o -o %t This is sometimes better than the %S/Inputs/ approach because the user can see the auxiliary files immediately and don't have to open another file. ``` Use case B (for utilities which don't have built-in input splitting feature): ``` // RUN: extract case1 %s \| llc \| FileCheck %s --check-prefix=CASE1 // RUN: extract case2 %s \| llc \| FileCheck %s --check-prefix=CASE2 Combing tests prudently can improve readability. This is sometimes better than having multiple test files. ``` Since this is a new utility, there is no git history concerns for UpperCase variable names. I use lowerCase variable names like mlir/lld. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D83834	2020-07-23 19:15:35 -07:00
Roman Lebedev	fef0cf0810	[LangRef] Add integer min/max/abs intrinsics Add LangRef specification for the llvm.abs, llvm.umin, llvm.umax, llvm.smin, and llvm.smax integer intrinsics. Link to RFC: https://lists.llvm.org/pipermail/llvm-dev/2020-June/142257.html Proposed alive2 implementation: https://github.com/AliveToolkit/alive2/pull/353 Differential Revision: https://reviews.llvm.org/D81829	2020-07-23 20:56:18 +02:00
Craig Topper	68382d5852	[X86][docs] Add mention of removal of 'mpx' backend feature to the release notes. I removed the feature from X86.td in `ebe5f17f9c`	2020-07-23 08:25:34 -07:00
Russell Gallop	c798628fbd	[docs] Fix TestSuiteGuide.md to mention scipy This has been required since https://reviews.llvm.org/D57828. Differential Revision: https://reviews.llvm.org/D82379	2020-07-23 14:21:59 +01:00
Louis Dionne	afa1afd410	[CMake] Bump CMake minimum version to 3.13.4 This upgrade should be friction-less because we've already been ensuring that CMake >= 3.13.4 is used. This is part of the effort discussed on llvm-dev here: http://lists.llvm.org/pipermail/llvm-dev/2020-April/140578.html Differential Revision: https://reviews.llvm.org/D78648	2020-07-22 14:25:07 -04:00
Sameer Arora	303a7f7a26	[llvm-libtool-darwin] Add support for -static option Add support for creating static libraries when the input includes only Mach-O binaries (and not libraries/archives themselves). Reviewed by alexshap, Ktwu, smeenai, jhenderson, MaskRay, mtrent Differential Revision: https://reviews.llvm.org/D83002	2020-07-21 13:08:49 -07:00
Chris Morin	28da5759bd	Fix typo in tutorial	2020-07-21 17:28:24 +02:00
Yuanfang Chen	589c646a7e	[llc] (almost) remove `--print-machineinstrs` Its effect could be achieved by `-stop-after`,`-print-after`,`-print-after-all`. But a few tests need to print MIR after ISel which could not be done with `-print-after`/`-stop-after` since isel pass does not have commandline name. That's the reason `--print-machineinstrs` is downgraded to `--print-after-isel` in this patch. `--print-after-isel` could be removed after we switch to new pass manager since isel pass would have a commandline text name to use `print-after` or equivalent switches. The motivation of this patch is to reduce tests dependency on would-be-deprecated feature. Reviewed By: arsenm, dsanders Differential Revision: https://reviews.llvm.org/D83275	2020-07-20 10:43:28 -07:00
Alok Kumar Sharma	2d10258a31	[DebugInfo] Support for DW_AT_associated and DW_AT_allocated. Summary: This support is needed for the Fortran array variables with pointer/allocatable attribute. This support enables debugger to identify the status of variable whether that is currently allocated/associated. for pointer array (before allocation/association) without DW_AT_associated (gdb) pt ptr type = integer (140737345375288:140737354129776) (gdb) p ptr value requires 35017956 bytes, which is more than max-value-size with DW_AT_associated (gdb) pt ptr type = integer (:) (gdb) p ptr $1 = <not associated> for allocatable array (before allocation) without DW_AT_allocated (gdb) pt arr type = integer (140737345375288:140737354129776) (gdb) p arr value requires 35017956 bytes, which is more than max-value-size with DW_AT_allocated (gdb) pt arr type = integer, allocatable (:) (gdb) p arr $1 = <not allocated> Testing - unit test cases added - check-llvm - check-debuginfo Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D83544	2020-07-20 19:54:35 +05:30
Matt Arsenault	5e999cbe8d	IR: Define byref parameter attribute This allows tracking the in-memory type of a pointer argument to a function for ABI purposes. This is essentially a stripped down version of byval to remove some of the stack-copy implications in its definition. This includes the base IR changes, and some tests for places where it should be treated similarly to byval. Codegen support will be in a future patch. My original attempt at solving some of these problems was to repurpose byval with a different address space from the stack. However, it is technically permitted for the callee to introduce a write to the argument, although nothing does this in reality. There is also talk of removing and replacing the byval attribute, so a new attribute would need to take its place anyway. This is intended avoid some optimization issues with the current handling of aggregate arguments, as well as fixes inflexibilty in how frontends can specify the kernel ABI. The most honest representation of the amdgpu_kernel convention is to expose all kernel arguments as loads from constant memory. Today, these are raw, SSA Argument values and codegen is responsible for turning these into loads. Background: There currently isn't a satisfactory way to represent how arguments for the amdgpu_kernel calling convention are passed. In reality, arguments are passed in a single, flat, constant memory buffer implicitly passed to the function. It is also illegal to call this function in the IR, and this is only ever invoked by a driver of some kind. It does not make sense to have a stack passed parameter in this context as is implied by byval. It is never valid to write to the kernel arguments, as this would corrupt the inputs seen by other dispatches of the kernel. These argumets are also not in the same address space as the stack, so a copy is needed to an alloca. From a source C-like language, the kernel parameters are invisible. Semantically, a copy is always required from the constant argument memory to a mutable variable. The current clang calling convention lowering emits raw values, including aggregates into the function argument list, since using byval would not make sense. This has some unfortunate consequences for the optimizer. In the aggregate case, we end up with an aggregate store to alloca, which both SROA and instcombine turn into a store of each aggregate field. The optimizer never pieces this back together to see that this is really just a copy from constant memory, so we end up stuck with expensive stack usage. This also means the backend dictates the alignment of arguments, and arbitrarily picks the LLVM IR ABI type alignment. By allowing an explicit alignment, frontends can make better decisions. For example, there's real no advantage to an aligment higher than 4, so a frontend could choose to compact the argument layout. Similarly, there is a high penalty to using an alignment lower than 4, so a frontend could opt into more padding for small arguments. Another design consideration is when it is appropriate to expose the fact that these arguments are all really passed in adjacent memory. Currently we have a late IR optimization pass in codegen to rewrite the kernel argument values into explicit loads to enable vectorization. In most programs, unrelated argument loads can be merged together. However, exposing this property directly from the frontend has some disadvantages. We still need a way to track the original argument sizes and alignments to report to the driver. I find using some side-channel, metadata mechanism to track this unappealing. If the kernel arguments were exposed as a single buffer to begin with, alias analysis would be unaware that the padding bits betewen arguments are meaningless. Another family of problems is there are still some gaps in replacing all of the available parameter attributes with metadata equivalents once lowered to loads. The immediate plan is to start using this new attribute to handle all aggregate argumets for kernels. Long term, it makes sense to migrate all kernel arguments, including scalars, to be passed indirectly in the same manner. Additional context is in D79744.	2020-07-20 10:23:09 -04:00
Elvina Yakubova	df952cb914	[llvm-readobj] Print error when executed with no input files This patch changes llvm-readelf (and llvm-readobj for consistency) behavior to print an error when executed with no input files. Reading from stdin can be achieved via a '-' for the input object. Fixes https://bugs.llvm.org/show_bug.cgi?id=46400 Differential Revision: https://reviews.llvm.org/D83704 Reviewed by: jhenderson, MaskRay, sbc, jyknight	2020-07-20 10:39:05 +01:00
Sameer Arora	6c43ed608d	Introducing llvm-libtool-darwin This diff starts the implementation of llvm-libtool-darwin (an llvm based replacement of cctool's libtool). Libtool is used for creating static and dynamic libraries from a bunch of object files given as input. Reviewed by alexshap, smeenai, jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D82923	2020-07-17 08:07:02 -07:00
Clement Courbet	6bddd099ac	Revert "[llvm-exegesis] Add benchmark latency option on X86 that uses LBR for more precise measurements." From @erichkeane: ``` This patch doesn't seem to build for me: /iusers/ekeane1/workspaces/llvm-project/llvm/tools/llvm-exegesis/lib/X86/X86Counter.cpp: In function ‘llvm::Error llvm::exegesis::parseDataBuffer(const char, size_t, const void, const void, llvm::SmallVector<long int, 4>)’: /iusers/ekeane1/workspaces/llvm-project/llvm/tools/llvm-exegesis/lib/X86/X86Counter.cpp:99:37: error: ‘struct perf_branch_entry’ has no member named ‘cycles’ CycleArray->push_back(Entry.cycles); I'm on RHEL7, so I have kernel 3.10, so it doesn't have 'cycles'. According ot this: https://elixir.bootlin.com/linux/v4.3/source/include/uapi/linux/perf_event.h#L963 kernel 4.3 is the first time that 'cycles' appeared in this structure. ```	2020-07-17 16:55:17 +02:00
Juneyoung Lee	fd1f8072a8	[LangRef] Mention that freeze does not consider aggregate's paddings Make explicit that freeze does not touch paddings of an aggregate. (Relevant comment: https://reviews.llvm.org/D83752#2152550) This implies that `v = freeze(load p); store v, q` may still leave undef bits or poison in memory if `v` is an aggregate, but it still happens for non-byte integers such as i1. Differential Revision: https://reviews.llvm.org/D83927	2020-07-17 11:53:26 +09:00
Matt Arsenault	a2a3adcc66	Fix incorrect file path in documentation	2020-07-16 15:53:11 -04:00
Jinsong Ji	32d36d9edc	[docs] fix ident in llvm-exegesis.rst	2020-07-16 17:30:09 +00:00
Jinsong Ji	971dd3f150	[docs][lldb] Fix lldb item in releasenotes Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D83962	2020-07-16 17:07:53 +00:00
Vy Nguyen	1360e140cc	[llvm-exegesis] Add benchmark latency option on X86 that uses LBR for more precise measurements. Starting with Skylake, the LBR contains the precise number of cycles between the two consecutive branches. Making use of this will hopefully make the measurements more precise than the existing methods of using RDTSC. Differential Revision: https://reviews.llvm.org/D77422	2020-07-16 12:12:46 -04:00
Sjoerd Meijer	15d058f16e	Follow up of 2b3c505d0f6e: fixed a typo, and added some more formatting. NFC.	2020-07-16 11:16:48 +01:00
Mehdi Amini	221979b691	Document the testing of Analyses in the LLVM testing guide (NFC) This came up in a recent review, someone was wondering were was this all documented and I couldn't find a reference to provide. Differential Revision: https://reviews.llvm.org/D83816	2020-07-15 21:11:49 +00:00
Mehdi Amini	140c296ef5	Clarify a bit the guideline on omitting braces, including more examples (NFC) Like most readability rules, it isn't absolute and there is a matter of taste to it. I think more recent part of the project may be more consistent in the current application of the guideline. I suspect sources like mlir/lib/Dialect/StandardOps/IR/Ops.cpp may be examples of this at the moment. Differential Revision: https://reviews.llvm.org/D82594	2020-07-15 21:11:30 +00:00
Hans Wennborg	7ab7b979d2	Bump the trunk major version to 12 and clear the release notes.	2020-07-15 12:05:05 +02:00
Tim Northover	5165b2b5fd	AArch64+ARM: make LLVM consider system registers volatile. Some of the system registers readable on AArch64 and ARM platforms return different values with each read (for example a timer counter), these shouldn't be hoisted outside loops or otherwise interfered with, but the normal @llvm.read_register intrinsic is only considered to read memory. This introduces a separate @llvm.read_volatile_register intrinsic and maps all system-registers on ARM platforms to use it for the __builtin_arm_rsr calls. Registers declared with asm("r9") or similar are unaffected.	2020-07-15 09:47:36 +01:00
Sjoerd Meijer	2b3c505d0f	[Matrix] Intrinsic descriptions This changes the matrix load/store intrinsic definitions to load/store from/to a pointer, and not from/to a pointer to a vector, as discussed in D83477. This also includes the recommit of "[Matrix] Tighten LangRef definitions and Verifier checks" which adds improved language reference descriptions of the matrix intrinsics and verifier checks. Differential Revision: https://reviews.llvm.org/D83785	2020-07-14 19:58:16 +01:00
Michael Kruse	322e7cfab5	[docs] Update llvm.loop metadata documentation. Loop metadata nodes do not adhere to the documented property: (a) LoopIDs are not unique: Any pass that duplicates IR will do it including its metadata (e.g. LoopVersioning) such that multiple loops are linked with the same LoopID. There is even a test case (Transforms/LoopUnroll/unroll-pragmas-disabled.ll) for multiple loops with the same LoopID. (b) LoopIDs are not persistent: Adding or removing an item from a LoopID can only be done by creating a new MDNode and assigning it to the loop's branch(es). Passes such as LoopUnroll (llvm.loop.unroll.disable) and LoopVectorize (llvm.loop.isvectorized) use this to mark loops to not be transformed multiple times or to avoid that a LoopVersioned original loop is transformed. Update the documentation according to how llvm.loop is used in practice. Differential Revision: https://reviews.llvm.org/D55290	2020-07-14 11:03:57 -05:00
Sjoerd Meijer	4ff7ed3310	Revert "[Matrix] Tighten LangRef definitions and Verifier checks." This reverts commit `f4d29d6e8c`. Hm, some build bot failures, reverting it while I investigate that.	2020-07-12 19:19:25 +01:00
Sjoerd Meijer	f4d29d6e8c	[Matrix] Tighten LangRef definitions and Verifier checks. This tightens the matrix intrinsic definitions in LLVM LangRef and adds correspondings checks to the IR Verifier. Differential Revision: https://reviews.llvm.org/D83477	2020-07-12 19:07:22 +01:00
Sanjay Patel	39009a8245	[DAGCombiner] tighten fast-math constraints for fma fold fadd (fma A, B, (fmul C, D)), E --> fma A, B, (fma C, D, E) This is only allowed when "reassoc" is present on the fadd. As discussed in D80801, this transform goes beyond what is allowed by "contract" FMF (-ffp-contract=fast). That is because we are fusing the trailing add of 'E' with a multiply, but without "reassoc", the code mandates that the products AB and CD are added together before adding in 'E'. I've added this example to the LangRef to try to clarify the meaning of "contract". If that seems reasonable, we should probably do something similar for the clang docs because there does not appear to be any formal spec for the behavior of -ffp-contract=fast. Differential Revision: https://reviews.llvm.org/D82499	2020-07-12 08:51:49 -04:00
JF Bastien	7bf73bcf6d	[docs] LLVM Security Group and Process Summary: See the corresponding RFC on llvm-dev for a discussion of this proposal. http://lists.llvm.org/pipermail/llvm-dev/2019-November/136839.html Subscribers: jkorous, dexonsmith, arphaman, ributzka, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70326	2020-07-10 15:24:02 -07:00
Matt Arsenault	31f4e43f3f	AMDGPU: Remove .value_type from kernel metadata This doesn't appear used for anything, and is emitted incorrectly based on the description. This also depends on the IR type, and pointee element type.	2020-07-10 18:16:31 -04:00
Joel E. Denny	6dda6ff0e0	[FileCheck] Fix up -dump-input* docs In FileCheck.rst, add `-dump-input-context` and `-dump-input-filter`, and fix some `-dump-input` documentation. In `FileCheck -help`, `cl::value_desc("kind")` is being ignored for `-dump-input-filter`, so just drop it. Extend `-dump-input=help` to mention FILECHECK_OPTS.	2020-07-10 17:21:01 -04:00
Roman Lebedev	29a9dd5bfe	[Docs] CodingStandards: for_each is discouraged Summary: As per disscussion in D83351, using `for_each` is potentially confusing, at least in regards to inconsistent style (there's less than 100 `for_each` usages in LLVM, but ~100.000 `for` range-based loops Therefore, it should be avoided. Reviewers: dblaikie, nickdesaulniers Reviewed By: dblaikie, nickdesaulniers Subscribers: hubert.reinterpretcast, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83431	2020-07-09 23:10:42 +03:00
Oliver Stannard	dc4a6f5db4	[llvm-objdump] Display locations of variables alongside disassembly This adds the --debug-vars option to llvm-objdump, which prints locations (registers/memory) of source-level variables alongside the disassembly based on DWARF info. A vertical line is printed for each live-range, with a label at the top giving the variable name and location, and the position and length of the line indicating the program counter range in which it is valid. Differential revision: https://reviews.llvm.org/D70720	2020-07-09 09:58:00 +01:00
Vitaly Buka	e38727a0bb	[StackSafety,NFC] Update documentation It's follow up for D80908 Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D82941	2020-07-08 23:57:13 -07:00
Mitch Phillips	5a98581d19	[NFC] Fix some docs warnings Summary: Fixes two minor issues in the docs present under `ninja docs-llvm-html`: 1 - A header is too small: ``` Warning, treated as error: llvm/llvm/docs/Passes.rst:70:Title underline too short. ``-basic-aa``: Basic Alias Analysis (stateless AA impl) ------------------------------------------------------ ``` 2 - Multiple definitions on a non-anonymous target (llvm-dev mailing list): ``` Warning, treated as error: llvm/llvm/docs/DeveloperPolicy.rst:3:Duplicate explicit target name: "llvm-dev mailing list". ``` Reviewers: lattner Reviewed By: lattner Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83416	2020-07-08 16:30:12 -07:00
Gui Andrade	89f1ad88b3	[LangRef] Introduce `noundef` attribute for fully defined function params LLVM currently does not require function parameters or return values to be fully initialized, and does not care if they are poison. This can be useful if the frontend ABI makes no such demands, but may prevent helpful backend transformations in case they do. Specifically, the C and C++ languages require all scalar function operands to be fully determined. Introducing this attribute is of particular use to MemorySanitizer today, although other transformations may benefit from it as well. We can modify MemorySanitizer instrumentation to provide modest (17%) space savings where `frozen` is present. This commit only adds the attribute to the Language Reference, and the actual implementation of the attribute will follow in a separate commit. Differential Revision: https://reviews.llvm.org/D82316	2020-07-08 19:02:04 +00:00
Arthur Eubanks	470bf7b5a2	[Preallocated] Add @llvm.call.preallocated.teardown This cleans up the stack allocated by a @llvm.call.preallocated.setup. Should either call the teardown or the preallocated call to clean up the stack. Calling both is UB. Add LangRef. Add verifier check that the token argument is a @llvm.call.preallocated.setup. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D83354	2020-07-08 08:48:44 -07:00
Michał Górny	446e3df254	[llvm] [docs] Do not require recommonmark for manpage build Do not enforce recommonmark dependency if sphinx is called to build manpages. In order to do this, try to import recommonmark first and do not configure it if it's not available. Additionally, declare a custom tags for the selected builder via CMake, and ignore recommonmark import failure when 'man' target is used. This will permit us to avoid the problematic recommonmark dependency for the majority of Gentoo users that do not need to locally build the complete documentation but want to have tool manpages. Differential Revision: https://reviews.llvm.org/D83161	2020-07-07 20:59:02 +02:00
Chris Lattner	79b30af0ec	Expand the LLVM Developer Policy to include new sections on adding a project to the LLVM Monorepo, and a second about the LLVM Incubator projects. Differential Revision: https://reviews.llvm.org/D83182	2020-07-07 10:30:24 -07:00
Nico Weber	003ea14220	fix typos to cycle bots	2020-07-06 20:37:11 -04:00
jasonliu	572dde55ee	[XCOFF][AIX] Use 'L..' instead of '.L' for getPrivateGlobalPrefix in DataLayout Summary: D80831 changed part of the prefix usage for AIX. But there are other places getting prefix from DataLayout. This patch intends to make prefix usage consistent on AIX. Reviewed by: hubert.reinterpretcast, daltenty Differential Revision: https://reviews.llvm.org/D81270	2020-07-03 18:25:14 +00:00
Dmitry Preobrazhensky	1c9d681092	[AMDGPU][CODEGEN] Added support of new inline assembler constraints Added support for constraints 'I', 'J', 'B', 'C', 'DA', 'DB'. See https://gcc.gnu.org/onlinedocs/gcc/Machine-Constraints.html#Machine-Constraints. Reviewers: arsenm, rampitec Differential Revision: https://reviews.llvm.org/D81651	2020-07-02 17:20:15 +03:00
Tony	31fdcf64d2	[AMDGPU] Update DWARF proposal - Add reference to implicit conversion description.	2020-07-01 20:35:15 +00:00
Tony	76b2d9cbeb	[AMDGPU] Correct AMDGPUUsage.rst DW_AT_LLVM_lane_pc example - Correct typo of DW_OP_xaddr to DW_OP_addrx in AMDGPUUsage.rst for DW_AT_LLVM_lane_pc example. Change-Id: I1b0ee2b24362a0240388e4c2f044c1d4883509b9	2020-07-01 08:23:15 +00:00
Arthur Eubanks	f9348f70c2	[Docs][BasicAA] Rename some more basicaa -> basic-aa Follow up to https://reviews.llvm.org/D82607.	2020-06-30 17:03:45 -07:00
Arthur Eubanks	8b6f675f44	Fix wrong title underline length	2020-06-30 16:02:45 -07:00
Arthur Eubanks	3dfe1440ae	[Docs][BasicAA] Rename -basicaa to -basic-aa in docs Follow up to https://reviews.llvm.org/D82607.	2020-06-30 15:54:28 -07:00
Eric Christopher	8164f69e4c	Update the phabricator docs to reflect the monorepo change. Patch by Nathan Froyd! Differential Revision: https://reviews.llvm.org/D82389	2020-06-30 10:53:38 -07:00
Jonas Devlieghere	29ea1b4baa	[Sphinx] Support older recommonmark versions. The "new way" of enabling recommonmark is only supported in recommonmark 0.5 and later. Use the deprecated approach with versions of Sphinx that still support it. If I understand correctly there's no way to use older versions of recommonmark (<0.5) with newer versions of Sphinx (>3.0) because the old approach got removed. Differential revision: https://reviews.llvm.org/D75284	2020-06-29 09:48:34 -07:00
Mike Edwards	8cd117c24f	[LIT] Correcting max-failures option in lit documentation.	2020-06-27 14:57:04 -07:00
Gui Andrade	9aa9855a9c	[Docs] BitCodeFormat.rst: List missing attribute codes	2020-06-27 06:34:36 +00:00
Matt Arsenault	b091c9a3e1	LLParser: Accept align(N) as new syntax for parameter attribute Every other value parameter attribute uses parentheses, so accept this as the preferred modern syntax. Updating everything to use the new syntax is left for a future change.	2020-06-26 18:10:21 -04:00
Tony	990f8702c9	[AMDGPU] Define DWARF encoding for condition code registers Summary: - Define DWARF register numbers for vector and scalar condition codes. - Document intended purpose of reserved DWARF register numbers. Reviewers: yaxunl, kzhuravl, arsenm, rampitec, b-sumner Subscribers: jvesely, wdng, nhaehnle, aprantl, dstuttard, tpr, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82519	2020-06-26 17:53:55 -04:00
Mehdi Amini	4abf024336	Remove references to the 4.0 release as a major breaking (NFC) This is cleaning up comments (mostly in the bitcode handling) about removing some backward compatibility aspect in the 4.0 release. Historically, "4.0" was used during the development of the 3.x versions as "this future major breaking change version". At the time the major number was used to indicate the compatibility. When we reached 3.9 we decided to change the numbering, instead of going to 3.10 we went to 4.0 but after changing the meaning of the major number to not mean anything anymore with respect to bitcode backward compatibility. The current policy (https://llvm.org/docs/DeveloperPolicy.html#ir-backwards-compatibility) indicates only now: The current LLVM version supports loading any bitcode since version 3.0. Differential Revision: https://reviews.llvm.org/D82514	2020-06-25 23:49:07 +00:00
Djordje Todorovic	95435117ad	[docs][llvm-dwarfdump] Fix the warnings during docs-llvm-html buil Before the fix the build of docs-llvm-html would fail. The D80959 introduced options that are not recognized, so we have warning as: llvm-project/llvm/docs/CommandGuide/llvm-dwarfdump.rst:40\ :unknown option: --debug-info Differential Revision: https://reviews.llvm.org/D82460	2020-06-25 11:04:28 +02:00
Djordje Todorovic	019d7a32fe	[docs][GlobalISel] Fix the warnings during docs-llvm-html build Before the fix the build of docs-llvm-html would fail. The rG8bc03d216824 introduced a reference to an undefined label, so we have warning as: llvm-project/llvm/docs/GlobalISel/GenericOpcode.rst:295:\ undefined label: i_intr_llvm_ptrmask (if the link has no\ caption the label must precede a section header)	2020-06-25 10:53:39 +02:00
Mehdi Amini	a61c73dbe3	Add a git hook script that can be manually setup to run some checks on every push Right now it just catches arcanist noisy tags, and include a script to automatically clean these. Follow up on http://lists.llvm.org/pipermail/llvm-dev/2019-December/137848.html Differential Revision: https://reviews.llvm.org/D80978	2020-06-24 21:13:43 +00:00
Vedant Kumar	d65cdb498f	[docs] Fix typo	2020-06-24 11:51:21 -07:00
Tony	ea6df2fb8f	[AMDGPU] Update AMD GPU processor information Summary: - Add product names for some processors. - Correct XNACK support for a processor. Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82348	2020-06-23 18:47:56 -04:00
Peter Collingbourne	bd7defeb94	llvm-nm: Implement --special-syms. Differential Revision: https://reviews.llvm.org/D82251	2020-06-22 13:05:47 -07:00
Zhi Zhuang	37fb860301	Add support of __builtin_expect_with_probability Add a new builtin-function __builtin_expect_with_probability and intrinsic llvm.expect.with.probability. The interface is __builtin_expect_with_probability(long expr, long expected, double probability). It is mainly the same as __builtin_expect besides one more argument indicating the probability of expression equal to expected value. The probability should be a constant floating-point expression and be in range [0.0, 1.0] inclusive. It is similar to builtin-expect-with-probability function in GCC built-in functions. Differential Revision: https://reviews.llvm.org/D79830	2020-06-22 10:21:28 -07:00
Nikita Popov	93a0f0e4fe	[LangRef] Fix sphinx warnings	2020-06-21 13:51:07 +02:00
Nikita Popov	f26b420194	[Docs] Fix code block in MemorySSA docs (NFC)	2020-06-21 13:47:00 +02:00
Eric Christopher	8116d01905	Typos around a -> an.	2020-06-20 14:04:48 -07:00
Eric Christopher	ae2fa770e1	[docs/examples] As part of using inclusive language within the llvm project, migrate away from the use of blacklist and whitelist.	2020-06-20 00:51:18 -07:00
Matt Arsenault	ae5adb8da5	AMDGPU: Update private null pointer value in documentation Private pointers used to workaround IR semantics by artifically reserving an object at offset 0 so no user object would be allocated there. Since alloca now uses a non-0 address space, that workaround is unnecssary and 0 can be treated as a valid pointer.	2020-06-18 17:27:19 -04:00
Vedant Kumar	b4459b597a	[docs] Specify rules for updating debug locations Summary: Restructure HowToUpdateDebugInfo.rst to specify rules for when transformations should preserve, merge, or drop debug locations. The goal is to have clear, well-justified rules that come with a few examples and counter-examples, so that pass authors can pick the best strategy for managing debug locations depending on the specific task at hand. I've tried to set down sensible rules here that mostly align with what we already do in llvm today, and that take a diverse set of use cases into account (interactive debugging, crash triage, SamplePGO). Please do try to pick these rules apart and suggest clarifications or improvements :). Side note: Prior to `24660ea1`, this document was structured as a long list of very specific code transformations -- the idea being that we would fill in what to do in each specific case. I chose to reorganize the document as a list of actions to take because it drastically cuts down on the amount of redundant exposition/explanation needed. I hope that's fine... Reviewers: jmorse, aprantl, dblaikie Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81198	2020-06-18 14:05:45 -07:00
Jonas Devlieghere	9989e81679	[Sphinx] Adjust for source_parsers deprecation in Sphinx 3.0 Update the Sphinx configuration for the removal of source_parsers in Sphinx 3.0. The variable has been deprecated since version 1.8. > Version 1.8 deprecates and version 3.0 removes the source_parsers > configuration variable that was used by older recommonmark versions. https://www.sphinx-doc.org/en/master/usage/markdown.html Differential revision: https://reviews.llvm.org/D75284	2020-06-18 14:05:11 -07:00
Amara Emerson	84167a8d58	[docs] Clarify semantics of ordered fadd/fmul reductions. Differential Revision: https://reviews.llvm.org/D82034	2020-06-18 09:10:43 -07:00
Jean-Michel Gorius	b2f2adee00	[llvm][docs] Document the LLVM_INSTALL_UTILS CMake option (NFC)	2020-06-18 15:31:13 +02:00
Florian Hahn	6d18c2067e	[Matrix] Update load/store intrinsics. This patch adjust the load/store matrix intrinsics, formerly known as llvm.matrix.columnwise.load/store, to improve the naming and allow passing of extra information (volatile). The patch performs the following changes: * Rename columnwise.load/store to column.major.load/store. This is more expressive and also more in line with the naming in Clang. * Changes the stride arguments from i32 to i64. The stride can be larger than i32 and this makes things more uniform with the way things are handled in Clang. * A new boolean argument is added to indicate whether the load/store is volatile. The lowering respects that when emitting vector load/store instructions * MatrixBuilder is updated to require both Alignment and IsVolatile arguments, which are passed through to the generated intrinsic. The alignment is set using the `align` attribute. The changes are grouped together in a single patch, to have a single commit that breaks the compatibility. We probably should be fine with updating the intrinsics, as we did not yet officially support them in the last stable release. If there are any concerns, we can add auto-upgrade rules for the columnwise intrinsics though. Reviewers: anemet, Gerolf, hfinkel, andrew.w.kaylor, LuoYuanke, nicolasvasilache, rjmccall, ftynse Reviewed By: anemet, nicolasvasilache Differential Revision: https://reviews.llvm.org/D81472	2020-06-18 09:44:52 +01:00
Zequan Wu	bbf89644b5	[llvm-readobj] set --elf-cg-profile as alias of --cg-profile Summary: Rename --elf-cg-profile to --cg-profile and keep --elf-cg-profile as an alias of --cg-profile. Reviewers: jhenderson, MaskRay, espindola, hans Reviewed By: jhenderson, MaskRay Subscribers: emaste, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81855	2020-06-17 11:24:45 -07:00
Paul Walker	95db1e7fb9	[FileCheck] Implement * and / operators for ExpressionValue. Subscribers: arichardson, hiraditya, thopre, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80915	2020-06-17 09:39:17 +00:00
Matt Arsenault	59ce6ffe2d	GlobalISel: Add a note to G_BITCAST documentation This is currently different from the IR rules.	2020-06-16 11:04:46 -04:00
Erich Keane	fad9cba8f5	[Docs] Add missing space, requested on `c08ea07`	2020-06-15 16:20:32 -07:00
Stanislav Mekhanoshin	9ee272f13d	[AMDGPU] Add gfx1030 target Differential Revision: https://reviews.llvm.org/D81886	2020-06-15 16:18:05 -07:00
Dan Gohman	6604295959	[WebAssembly] WebAssembly doesn't support "protected" visibility Implement the `hasProtectedVisibility()` hook to indicate that, like Darwin, WebAssembly doesn't support "protected" visibility. On ELF, "protected" visibility is intended to be an optimization, however in practice it often [isn't], and ELF documentation generally ranges from [not mentioning it at all] to [strongly discouraging its use]. [isn't]: https://www.airs.com/blog/archives/307 [not mentioning it at all]: https://gcc.gnu.org/wiki/Visibility [strongly discouraging its use]: https://www.akkadia.org/drepper/dsohowto.pdf While here, also mention the new Reactor support in the release notes.	2020-06-12 19:52:35 -07:00
Erich Keane	884fb45ed2	Update Kaleidoscope tutorial inline code Reported on IRC, the tutorial code at the bottom of the page correctly namespaces the FunctionPassManager, but the as-you-go code does not. This patch adds the namespace to those.	2020-06-12 12:02:35 -07:00
Cyndy Ishida	28fefcc83c	[llvm][llvm-nm] add TextAPI/MachO support Summary: This completes the needed glueing to support reading tbd files from nm. This includes specifying which slice filtering with `--arch` and a new option specifically for tbd files `--add-inlinedinfo` which will show the reexported libraries that are appended in the tbd file. Reviewers: ributzka, steven_wu, JDevlieghere, jhenderson Reviewed By: JDevlieghere Subscribers: hiraditya, MaskRay, dexonsmith, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81614	2020-06-11 18:54:16 -07:00
Erich Keane	c08ea07716	Add to the Coding Standard our that single-line bodies omit braces This is a rule that seems to have been enforced for the better part of the decade, so we should document it for new contributors. Differential Revision: https://reviews.llvm.org/D80947	2020-06-11 12:46:15 -07:00
Thomas Preud'homme	47934c7cf9	FileCheck [11/12]: Add matching constraint specification This patch is part of a patch series to add support for FileCheck numeric expressions. This specific patch adds support for specifying the matching constraint for a numeric expression, ie. how the value being matched should relate to the numeric expression. This commit only adds the equality constraint where the numeric value matched must be equal to the numeric expression. It is the default matching constraint used when not specified. It is added to provision other matching constraint (e.g. inequality relations). Copyright: - Linaro (changes up to diff 183612 of revision D55940) - GraphCore (changes in later versions of revision D55940 and in new revision created off D55940) Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D60391	2020-06-10 15:56:10 +01:00
Vitaly Buka	4666953ce2	[StackSafety] Add info into function summary Summary: This patch adds optional field into function summary, implements asm and bitcode serialization. YAML serialization is omitted and can be added later if needed. This patch includes this information into summary only if module contains at least one sanitize_memtag function. In a near future MTE is the user of the analysis. Later if needed we can provede more direct control on when information is included into summary. Reviewers: eugenis Subscribers: hiraditya, steven_wu, dexonsmith, arphaman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80908	2020-06-10 02:43:28 -07:00
Paul Walker	8fd2270370	[FileCheck] Add function call support to numerical expressions. This patch extends numerical expressions to allow calls to predefined functions. These calls can be combined with the existing numerical operators, which includes nesting calls. The call syntax is: <func>(<args>) Where <func> is a predefined string literal, currently limited to one of add, max, min and sub. <arg> is a comma seperated list of numerical expressions. Subscribers: arichardson, hiraditya, thopre, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79936	2020-06-10 09:42:00 +00:00
Mehdi Amini	d31c9e5a46	Change filecheck default to dump input on failure Having the input dumped on failure seems like a better default: I debugged FileCheck tests for a while without knowing about this option, which really helps to understand failures. Remove `-dump-input-on-failure` and the environment variable FILECHECK_DUMP_INPUT_ON_FAILURE which are now obsolete. Differential Revision: https://reviews.llvm.org/D81422	2020-06-09 18:57:46 +00:00
Sanjay Patel	ad19b9cead	[Docs] fix typos for llvm-mca; NFC	2020-06-07 11:14:24 -04:00
Mike Edwards	972a73a347	[LIT] NFC adding max-failures option to lit documentation. Differential Revision: https://reviews.llvm.org/D81337	2020-06-06 18:26:45 -07:00
madhur13490	bca413b036	Fix a typo in AMDGPU docs Reviewers: t-tye, arsenm Reviewed By: arsenm Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81247	2020-06-05 13:30:17 +00:00
Mircea Trofin	fa42620afb	[docs] Referenced llvm workflow in HowToAddABuilder Reviewers: gkistanova, dblaikie Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81046	2020-06-04 16:39:11 -07:00
Vedant Kumar	24660ea11c	[docs] HowToUpdateDebugInfo: Minor cleanups - Change the reference to salvageDebugInfoOrUndef to salvageDebugInfo (in accordance with https://reviews.llvm.org/D78369). - Reorganize a few sections in preparation for an upcoming change that attempts to specify rules for updating debug locations. - Fix some intra-document links. - Some spelling / wording fixes.	2020-06-04 14:56:01 -07:00
Yuanfang Chen	f9ea86eaa1	[Docs] Add the entry for `Advanced builds` in UserGuide.rst Also add a link to it from ThinLTO.rst.	2020-06-04 14:52:51 -07:00
Jan Korous	5f5d972d83	[docs] Fix self-contradictory description of llvm_unreachable Just two paragraphs above it says: "If the compiler does not support this [skipping code generation for a particular branch], it will fall back to the "abort" implementation." And that actually correctly describes llvm_unreachable implementation. Differential Revision: https://reviews.llvm.org/D81130	2020-06-04 11:15:20 -07:00
Yevgeny Rouban	dcfa78a4cc	Extend InvokeInst !prof branch_weights metadata to unwind branches Allow InvokeInst to have the second optional prof branch weight for its unwind branch. InvokeInst is a terminator with two successors. It might have its unwind branch taken many times. If so the BranchProbabilityInfo unwind branch heuristic can be inaccurate. This patch allows a higher accuracy calculated with both branch weights set. Changes: - A new section about InvokeInst is added to the BranchWeightMetadata page. It states the old information that missed in the doc and adds new about the second branch weight. - Verifier is changed to allow either 1 or 2 branch weights for InvokeInst. - A new test is written for BranchProbabilityInfo to demonstrate the main improvement of the simple fix in calcMetadataWeights(). - Several new testcases are created for Inliner. Those check that both weights are accounted for invoke instruction weight calculation. - PGOUseFunc::setBranchWeights() is fixed to be applicable to InvokeInst. Reviewers: davidxl, reames, xur, yamauchi Tags: #llvm Differential Revision: https://reviews.llvm.org/D80618	2020-06-04 15:37:15 +07:00
Philip Reames	0e7c77053f	Introduce a "gc-live" bundle for the gc arguments of a statepoint Currently, gc.relocates are defined in terms of indices into the statepoint's operand list. Given the gc args are at the end of a variable length list of operands, this makes interpreting their indices by hand a tad challenging. We can simplify the statepoint sequence and improve readability quite a bit by pulling these new operands into their own named operand bundle. This patch defines a new operand bundle tag "gc-live". The semantics of the bundle are the same as the existing gc arguments of a statepoint. This patch simply introduces the definition and codegen for the bundle, future patches will migrate RS4GC to emitting the new form. Interestingly, with this done and the recent migration to using deopt and gc-transition bundles, we really don't have much left in the statepoint itself. It really looks like the existing ID and flags fields are redundant; we have (existing!) attributes for all of them. I think we'll be able to reduce the gc.statepoint signature to simply a wrapped call (e.g. actual target and actual arguments). Differential Revision: https://reviews.llvm.org/D80937	2020-06-03 15:00:24 -07:00
Braedy Kuzma	90e291912a	[LangRef] Fix description of shape args for matrix.multiply. Currently all code instances within the matrix lowering pass consider matrix A to be MxN and B to be NxK, producing C which is MxK. Anyone interacting with this API after reading the docs but without reading the pass would expect A: MxK, B: KxN, and C: MxN. These changes bring the documentation in line with the implementation. One point of concern with this, the original signature as described in the docs may be better or at least more expected. The interface as it was written reflected other common matrix multiplication interfaces such as BLAS'[1], where the matrices are MxK, KxN, MxN respectively. Choosing to honor this requires changing code and tests instead, but should be mostly just renaming of variables. Patch by Braedy Kuzma <braedy@ualberta.ca> [1] http://www.netlib.org/lapack/explore-html/db/dc9/group__single__blas__level3_gafe51bacb54592ff5de056acabd83c260.html#gafe51bacb54592ff5de056acabd83c260 Reviewers: anemet, LuoYuanke, nicolasvasilache, fhahn Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D80663	2020-06-03 11:25:44 +01:00
Nick Desaulniers	8eda71616f	[Clang][A32/T32][Linux] -O1 implies -fomit-frame-pointer Summary: An upgrade of LLVM for CrOS [0] containing [1] triggered a bunch of errors related to writing to reserved registers for a Linux kernel's arm64 compat vdso (which is a aarch32 image). After a discussion on LKML [2], it was determined that -f{no-}omit-frame-pointer was not being specified. Comparing GCC and Clang [3], it becomes apparent that GCC defaults to omitting the frame pointer implicitly when optimizations are enabled, and Clang does not. ie. setting -O1 (or above) implies -fomit-frame-pointer. Clang was defaulting to -fno-omit-frame-pointer implicitly unless -fomit-frame-pointer was set explicitly. Why this becomes a problem is that the Linux kernel's arm64 compat vdso contains code that uses r7. r7 is used sometimes for the frame pointer (for example, when targeting thumb (-mthumb)). See useR7AsFramePointer() in llvm/llvm-project/llvm/lib/Target/ARM/ARMSubtarget.h. This is mostly for legacy/compatibility reasons, and the 2019 Q4 revision of the ARM AAPCS looks to standardize r11 as the frame pointer for aarch32, though this is not yet implemented in LLVM. Users that are reliant on the implicit value if unspecified when optimizations are enabled should explicitly choose -fomit-frame-pointer (new behavior) or -fno-omit-frame-pointer (old behavior). [0] https://bugs.chromium.org/p/chromium/issues/detail?id=1084372 [1] https://reviews.llvm.org/D76848 [2] https://lore.kernel.org/lkml/20200526173117.155339-1-ndesaulniers@google.com/ [3] https://godbolt.org/z/0oY39t Reviewers: kristof.beyls, psmith, danalbert, srhines, MaskRay, ostannard, efriedma Reviewed By: psmith, danalbert, srhines, MaskRay, efriedma Subscribers: efriedma, olista01, MaskRay, vhscampos, cfe-commits, llvm-commits, manojgupta, llozano, glider, hctim, eugenis, pcc, peter.smith, srhines Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D80828	2020-06-02 15:54:14 -07:00
Diego Caballero	b78b98491a	Update 'git push' command in GettingStarted guide 'git push' command, without any other arguments, can do different things depending on the local configuration of Git. This patch updates the 'git push' command with extra arguments to be more resilient to any local configuration. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D79964	2020-06-02 21:25:29 +03:00
Jonas Devlieghere	5b460fb15e	[llvm-dwarfdump] Print [=<offset>] after --debug-* options in help output. Some of the --debug-* options can take an optional offset. Although the man page does a good job of making that clear, it's much harder to discover from the help output. Currently the only reference to this is the following sentence: > Where applicable these parameters take an optional =<offset> argument > to dump only the entry at the specified offset. This patch changes the help output from to print [=<offset>] after the options that take an offset. --debug-info[=<offset>] - Dump the .debug_info section rdar://problem/63150066 Differential revision: https://reviews.llvm.org/D80959	2020-06-02 11:06:11 -07:00
Vedant Kumar	b429a0fef0	[docs] Sketch outline for HowToUpdateDebugInfo.rst Summary: Sketch the outline for a new document that explains how to update debug info in various kinds of code transformations. Some of the guidelines that belong in HowToUpdateDebugInfo.rst were in SourceLevelDebugging.rst already under the debugify section. It seems like the distinction between the two docs ought to be that the former is more prescriptive, while the latter is more descriptive. To that end I've consolidated the "how to update debug info" guidelines which were in SourceLevelDebugging.rst into the new doc, along with the information about using "debugify" to test transformations. Since we've added a mir-debugify pass, I've described that as well. Reviewers: aprantl, jmorse, chrisjackson, dsanders Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80052	2020-06-01 16:45:18 -07:00
Tony	7318e24000	[AMDGPU] Add loaded code object path URI definition to AMDGPUUsage Differential Revision: https://reviews.llvm.org/D80407	2020-05-29 19:52:52 -04:00
Sjoerd Meijer	7fb8a40e52	New intrinsic @llvm.get.active.lane.mask() This is split off from D79100 and: - adds a intrinsic description/definition for @llvm.get.active.lane.mask(), and - describe its semantics in LangRef. As described (in more detail) in its LangRef section, it is semantically equivalent to an icmp with the vector induction variable and the back-edge taken count, and generates a mask of active/inactive vector lanes. It will have several use cases. First, it will be used by the ExpandVectorPredication pass for the VP intrinsics, to expand VP intrinsics for scalable vectors on targets that do not support the `%evl` parameter, see D78203. Also, this is part of, and essential for our ARM MVE tail-predication story: - this intrinsic will be emitted by the LoopVectorizer in D79100, when the scalar epilogue is tail-folded into the vector body. This new intrinsic will generate the predicate for the masked loads/stores, and it takes the back-edge taken count as an argument. The back-edge taken count represents the number of elements processed by the loop, which we need to setup MVE tail-predication. - Emitting the intrinsic is controlled by a new TTI hook, see D80597. - We pick up this new intrinsic in an ARM MVETailPredication backend pass, see D79175, and convert it to a MVE target specific intrinsic/instruction to create a tail-predicated loop. Differential Revision: https://reviews.llvm.org/D80596	2020-05-29 08:51:40 +01:00
Tony	b4668a268d	[AMDGPU] DWARF Proposal For Heterogeneous Debugging - Add introduction to DWARF Proposal For Heterogeneous Debugging. Differential Revision: https://reviews.llvm.org/D70523	2020-05-28 20:36:21 -04:00
Thomas Preud'homme	23ac16cf9b	FileCheck [10/12]: Add support for signed numeric values Summary: This patch is part of a patch series to add support for FileCheck numeric expressions. This specific patch adds support signed numeric values, thus allowing negative numeric values. As such, the patch adds a new class to represent a signed or unsigned value and add the logic for type promotion and type conversion in numeric expression mixing signed and unsigned values. It also adds the %d format specifier to represent signed value. Finally, it also adds underflow and overflow detection when performing a binary operation. Copyright: - Linaro (changes up to diff 183612 of revision D55940) - GraphCore (changes in later versions of revision D55940 and in new revision created off D55940) Reviewers: jhenderson, chandlerc, jdenny, probinson, grimar, arichardson Reviewed By: jhenderson, arichardson Subscribers: MaskRay, hiraditya, llvm-commits, probinson, dblaikie, grimar, arichardson, kristina, hfinkel, rogfer01, JonChesterfield Tags: #llvm Differential Revision: https://reviews.llvm.org/D60390	2020-05-28 10:44:21 +01:00
Sjoerd Meijer	880c35a554	[HardwareLoops] LangRef Intrinsic descriptions The HardwareLoop intrinsics were missing and not described in LangRef. This adds these descriptions/definitions. Differential Revision: https://reviews.llvm.org/D80316	2020-05-28 08:36:04 +01:00
Sourabh Singh Tomar	c1d5b831b1	[docs] Release notes for DIModule metadata Updated the release notes for the changes in the DIModule metadata. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D80614	2020-05-28 10:17:40 +05:30
Alex Richardson	3be5e53f20	[FileCheck] Allow parenthesized expressions With this change it is be possible to write FileCheck expressions such as [[#(VAR+1)-2]]. Currently, the only supported arithmetic operators are plus and minus, so this is not particularly useful yet. However, it our CHERI fork we have tests that benefit from having multiplication in FileCheck expressions. Allowing parenthesized expressions is the simplest way for us to work around the current lack of operator precedence in FileCheck expressions. Reviewed By: thopre, jhenderson Differential Revision: https://reviews.llvm.org/D77383	2020-05-27 16:31:39 +01:00
Matt Arsenault	8e3307f551	GlobalISel: Add a clarification to G_STORE documentation Mirror the note on G_LOAD. We probably do need to add an explicit G_TRUNCSTORE opcode for the vector case, although I do not have a use for it.	2020-05-26 21:20:30 -04:00
Alexander Shaposhnikov	842a8cc10c	[llvm-objcopy][MachO] Add support for removing Swift symbols cctools strip has the option "-T" which removes Swift symbols. This diff implements this option in llvm-strip for MachO. Test plan: make check-all Differential revision: https://reviews.llvm.org/D80099	2020-05-26 16:49:56 -07:00
Arthur Eubanks	9a0b0855a9	Modify verifier checks to support musttail + preallocated Summary: preallocated and musttail can work together, but we don't want to call @llvm.call.preallocated.setup() to modify the stack in musttail calls. So we shouldn't have the "preallocated" operand bundle when a preallocated call is musttail. Also disallow use of preallocated on calls without preallocated. Codegen not yet implemented. Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80581	2020-05-26 15:20:20 -07:00
Vedant Kumar	6e39379bbb	[DwarfExpression] Support entry values for indirect parameters Summary: A struct argument can be passed-by-value to a callee via a pointer to a temporary stack copy. Add support for emitting an entry value DBG_VALUE when an indirect parameter DBG_VALUE becomes unavailable. This is done by omitting DW_OP_stack_value from the entry value expression, to make the expression describe the location of an object. rdar://63373691 Reviewers: djtodoro, aprantl, dstenb Subscribers: hiraditya, lldb-commits, llvm-commits Tags: #lldb, #llvm Differential Revision: https://reviews.llvm.org/D80345	2020-05-26 14:22:28 -07:00
Stefanos Baziotis	ef94f60ff7	[MSSA][Doc] Fix typo	2020-05-26 22:16:13 +03:00
Stefanos Baziotis	2c7d63257d	[MSSA][Doc] Clobbers, more info on Defs / Def chain - Added more info about what we refer as a clobber in MSSA. - Added more info about MemoryDefs and how there is a single Def chain. - The doc portrayed MSSA as modeling the heap whileit is modeling the whole memory, so I changed the wording to not be heap-specific. Differential Revision: https://reviews.llvm.org/D80000	2020-05-26 20:43:17 +03:00
Matt Arsenault	8bc03d2168	GlobalISel: Merge G_PTR_MASK with llvm.ptrmask intrinsic Confusingly, these were unrelated and had different semantics. The G_PTR_MASK instruction predates the llvm.ptrmask intrinsic, but has a different format. G_PTR_MASK only allows clearing the low bits of a pointer, and only a constant number of bits. The ptrmask intrinsic allows an arbitrary mask. Replace G_PTR_MASK to match the intrinsic. Only selects the cases that look like the old instruction. More work is needed to select the general case. Also new legalization code is still needed to deal with the case where the incoming mask size does not match the pointer size, which has a specified behavior in the langref.	2020-05-26 11:48:13 -04:00
Serge Pavlov	4d20e31f73	[FPEnv] Intrinsic llvm.roundeven This intrinsic implements IEEE-754 operation roundToIntegralTiesToEven, and performs rounding to the nearest integer value, rounding halfway cases to even. The intrinsic represents the missed case of IEEE-754 rounding operations and now llvm provides full support of the rounding operations defined by the standard. Differential Revision: https://reviews.llvm.org/D75670	2020-05-26 19:24:58 +07:00
Nico Weber	5229dd1366	[build] Add LLVM_LOCAL_RPATH which can set an rpath on just unit test binaries After D80096, bots that build clang for distribution and that can't use system gcc / libstdc++ need to pass a working rpath so that unit test binaries can run. The method suggested in GettingStarted.rst works fine for local development, but it results in an absolute local rpath ending up even in distributed binaries like clang, which is both ugly and unnecessary. Add an explicit toggle that can be used to add an rpath only for the non-distributed binaries that need it. Differential Revision: https://reviews.llvm.org/D80534	2020-05-26 06:23:57 -04:00
Serge Pavlov	61f72dd8ac	[FPEnv] Small fixes to implementation of flt.rounds This change makes minor correction to the implementation of intrinsic `llvm.flt.rounds`: - Added documentation entry in LangRef, - Attributes of the intrinsic changed to be in line with other functions dependent of floating-point environment. Differential Revision: https://reviews.llvm.org/D79322	2020-05-26 13:19:01 +07:00
Dmitry Preobrazhensky	b087b91c91	[AMDGPU][CODEGEN] Added 'A' constraint for inline assembler Summary: 'A' constraint requires an immediate int or fp constant that can be inlined in an instruction encoding. Reviewers: arsenm, rampitec Differential Revision: https://reviews.llvm.org/D78494	2020-05-25 14:23:34 +03:00
Michal Paszkowski	335de55fa3	Revert "Added a new IRCanonicalizer pass." This reverts commit `14d358537f`.	2020-05-23 13:51:43 +02:00
Michal Paszkowski	14d358537f	Added a new IRCanonicalizer pass. Summary: Added a new IRCanonicalizer pass which aims to transform LLVM modules into a canonical form by reordering and renaming instructions while preserving the same semantics. The canonicalizer makes it easier to spot semantic differences when diffing two modules which have undergone different passes. Presentation: https://www.youtube.com/watch?v=c9WMijSOEUg Reviewed by: plotfi Differential Revision: https://reviews.llvm.org/D66029	2020-05-23 12:45:53 +02:00
Tony	8a9f09df42	[AMDGPU] DWARF Proposal For Heterogeneous Debugging - Change title to "DWARF Proposal For Heterogeneous Debugging".	2020-05-22 22:29:57 -04:00
Tony	1b58cbad01	[AMDGPU] DWARF For Heterogeneous Debugging - Change title to "DWARF For Heterogeneous Debugging". - Add "Examples" section that references the AMDGPUUsage DWARF section. - Make the "References" section a top level section. Differential Revision: https://reviews.llvm.org/D70523	2020-05-22 22:14:20 -04:00
Jinsong Ji	9b7fba1421	[docs][llvm-extract] Add missing alias/bb options llvm-extract get serveral new options, but we forgot to update doc. This patch update the doc. Reviewed By: volkan Differential Revision: https://reviews.llvm.org/D80413	2020-05-22 03:52:07 +00:00
Tony	e36be90c82	[AMDGPU] Correct formatting typos in documentation Summary: - Correct missing space in some "note" and "TODO" directives in AMDGPUUsage.rst - Correct warning for heading underline being too short in BitCodeFormat.rst Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80407	2020-05-21 20:36:46 -04:00
Jinsong Ji	628f008b20	[docs] Fix buildbot failures Buildbot has been failing since http://lab.llvm.org:8011/builders/llvm-sphinx-docs/builds/44711 This patch fix the minor issues that cause warnings.	2020-05-21 22:07:33 +00:00
Jonas Devlieghere	92fd3971e0	[dsymutil] Add reproducers to dsymutil Add support for generating a dsymutil reproducer. The result is a folder containing all the object files for linking. When --gen-reproducer is passed, dsymutil uses a FileCollectorFileSystem which keeps track of all the files used by dsymutil. These files are copied into a temporary directory when dsymutil exists. When this path is passed to --use-reproducer, dsymutil uses a RedirectingFileSystem that will use the files from the reproducer directory instead of the actual paths. This means you don't need to mess with the OSO path prefix. Differential revision: https://reviews.llvm.org/D79398	2020-05-21 10:59:49 -07:00
Eli Friedman	f26bdb539e	Make Value::getPointerAlignment() return an Align, not a MaybeAlign. If we don't know anything about the alignment of a pointer, Align(1) is still correct: all pointers are at least 1-byte aligned. Included in this patch is a bugfix for an issue discovered during this cleanup: pointers with "dereferenceable" attributes/metadata were assumed to be aligned according to the type of the pointer. This wasn't intentional, as far as I can tell, so Loads.cpp was fixed to stop making this assumption. Frontends may need to be updated. I updated clang's handling of C++ references, and added a release note for this. Differential Revision: https://reviews.llvm.org/D80072	2020-05-20 16:37:20 -07:00
Zola Bridges	b2d733c350	[llvm][docs] Add step by step git to GettingStarted Summary: Due to deleting the git llvm script, folks were asking for better documentation about how to use git in order to commit to the Github repo. I added some step by step git commands to make the usage clearer. Context link: http://lists.llvm.org/pipermail/llvm-dev/2020-May/141640.html Reviewed By: spatel, mehdi_amini Differential Revision: https://reviews.llvm.org/D80088	2020-05-19 12:14:17 -07:00
Jonas Devlieghere	b7924d6525	[dsymutil] Make sure the --help output and man page are consistent As suggested by Adrian in D79398.	2020-05-18 11:38:36 -07:00
Christudasan Devadasan	7c4e711ef8	[AMDGPU] Enable base pointer. When the callee requires a dynamic stack realignment, it is not possible to correcty access the incoming stack arguments using the stack pointer. We reserve a base pointer in such cases to access the function arguments inside the callee. The base pointer will hold the incoming stack pointer value before any kind of delta added to it. Reviewed By: arsenm, scott.linder Differential Revision: https://reviews.llvm.org/D78811	2020-05-17 16:13:55 +05:30
Nikita Popov	f89f7da999	[IR] Convert null-pointer-is-valid into an enum attribute The "null-pointer-is-valid" attribute needs to be checked by many pointer-related combines. To make the check more efficient, convert it from a string into an enum attribute. In the future, this attribute may be replaced with data layout properties. Differential Revision: https://reviews.llvm.org/D78862	2020-05-15 19:41:07 +02:00
Ties Stuij	8c24f33158	[IR][BFloat] Add BFloat IR type Summary: The BFloat IR type is introduced to provide support for, initially, the BFloat16 datatype introduced with the Armv8.6 architecture (optional from Armv8.2 onwards). It has an 8-bit exponent and a 7-bit mantissa and behaves like an IEEE 754 floating point IR type. This is part of a patch series upstreaming Armv8.6 features. Subsequent patches will upstream intrinsics support and C-lang support for BFloat. Reviewers: SjoerdMeijer, rjmccall, rsmith, liutianle, RKSimon, craig.topper, jfb, LukeGeeson, sdesmalen, deadalnix, ctetreau Subscribers: hiraditya, llvm-commits, danielkiss, arphaman, kristof.beyls, dexonsmith Tags: #llvm Differential Revision: https://reviews.llvm.org/D78190	2020-05-15 14:43:43 +01:00
Alok Kumar Sharma	4042ada1c1	[DebugInfo] support for DW_AT_data_location in llvm This patch adds support for DWARF attribute DW_AT_data_location. Summary: Dynamic arrays in fortran are described by array descriptor and data allocation address. Former is mapped to DW_AT_location and later is mapped to DW_AT_data_location. Testing: unit test cases added (hand-written) check llvm check debug-info Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D79592	2020-05-15 11:33:17 +05:30
Alok Kumar Sharma	ab699d78a2	[DebugInfo] llvm rejects DWARF operator DW_OP_push_object_address llvm rejects DWARF operator DW_OP_push_object_address.This DWARF operator is needed for Flang to support allocatable array. Summary: Currently llvm rejects DWARF operator DW_OP_push_object_address. below error is produced when llvm finds this operator. [..] invalid expression !DIExpression(151) warning: ignoring invalid debug info in pushobj.ll [..] There are some parts missing in support of this operator, need to be completed. Testing -added a unit testcase -check-debuginfo -check-llvm Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D79306	2020-05-15 11:10:35 +05:30
Wei Mi	67bb16049a	[llvm-profdata] Update CommandGuide Add a bunch of SampleFDO related flags added recently into llvm-profdata to its command guide. Differential Revision: https://reviews.llvm.org/D79911	2020-05-14 13:59:42 -07:00
Mircea Trofin	ee33ee68fe	[docs] Add link to zorg github project Reviewers: gkistanova Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79891	2020-05-13 13:41:16 -07:00
Joel E. Denny	a1fd188223	[FileCheck] Support comment directives Sometimes you want to disable a FileCheck directive without removing it entirely, or you want to write comments that mention a directive by name. The `COM:` directive makes it easy to do this. For example, you might have: ``` ; X32: pinsrd_1: ; X32: pinsrd $1, 4(%esp), %xmm0 ; COM: FIXME: X64 isn't working correctly yet for this part of codegen, but ; COM: X64 will have something similar to X32: ; COM: ; COM: X64: pinsrd_1: ; COM: X64: pinsrd $1, %edi, %xmm0 ``` Without this patch, you need to use some combination of rewording and directive syntax mangling to prevent FileCheck from recognizing the commented occurrences of `X32:` and `X64:` above as directives. Moreover, FileCheck diagnostics have been proposed that might complain about the occurrences of `X64` that don't have the trailing `:` because they look like directive typos: <http://lists.llvm.org/pipermail/llvm-dev/2020-April/140610.html> I think dodging all these problems can prove tedious for test authors, and directive syntax mangling already makes the purpose of existing test code unclear. `COM:` can avoid all these problems. This patch also updates the small set of existing tests that define `COM` as a check prefix: - clang/test/CodeGen/default-address-space.c - clang/test/CodeGenOpenCL/addr-space-struct-arg.cl - clang/test/Driver/hip-device-libs.hip - llvm/test/Assembler/drop-debug-info-nonzero-alloca.ll I think lit should support `COM:` as well. Perhaps `clang -verify` should too. Reviewed By: jhenderson, thopre Differential Revision: https://reviews.llvm.org/D79276	2020-05-13 11:29:48 -04:00
Zequan Wu	cb22ab7403	Add nomerge function attribute to supress tail merge optimization in simplifyCFG We want to add a way to avoid merging identical calls so as to keep the separate debug-information for those calls. There is also an asan usecase where having this attribute would be beneficial to avoid alternative work-arounds. Here is the link to the feature request: https://bugs.llvm.org/show_bug.cgi?id=42783. `nomerge` is different from `noline`. `noinline` prevents function from inlining at callsites, but `nomerge` prevents multiple identical calls from being merged into one. This patch adds `nomerge` to disable the optimization in IR level. A followup patch will be needed to let backend understands `nomerge` and avoid tail merge at backend. Reviewed By: asbirlea, rnk Differential Revision: https://reviews.llvm.org/D78659	2020-05-12 16:49:20 -07:00
Michael Kruse	5c707fd97c	[docs] Corrected inaccuracies in Common Problems section. Changed the language in LLVM_USE_LINKER to more strongly recommend LLD and to specify that the GNU gold linker is only useful if LLD is unavailable in binary form and it is the first build of LLVM. Added that LLD will help when used on ELF-based platforms. Corrected information in CMAKE_BUILD_TYPE regarding the Release build type and enabling assertions. Added option LLVM_ENABLE_ASSERTIONS and mentioned enabling this option with a Release build as an alternative to using a Debug build. Specified that the LLVM_OPTIMIZED_TABLEGEN option is only for Debug builds, that the LLVM_USE_SPLIT_DWARF option is only available on ELF host platforms, and that setting CLANG_ENABLE_STATIC_ANALYZER to OFF only slightly improves build time. These changes address comments made in D75425. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D77346	2020-05-12 10:09:37 -05:00
Joel E. Denny	d0e7fd6b62	Revert "[FileCheck] Support comment directives" This reverts commit `9a9a5f9893` to try to fix a bot: http://lab.llvm.org:8011/builders/llvm-clang-x86_64-expensive-checks-win/builds/23489	2020-05-11 19:41:22 -04:00
Joel E. Denny	9a9a5f9893	[FileCheck] Support comment directives Sometimes you want to disable a FileCheck directive without removing it entirely, or you want to write comments that mention a directive by name. The `COM:` directive makes it easy to do this. For example, you might have: ``` ; X32: pinsrd_1: ; X32: pinsrd $1, 4(%esp), %xmm0 ; COM: FIXME: X64 isn't working correctly yet for this part of codegen, but ; COM: X64 will have something similar to X32: ; COM: ; COM: X64: pinsrd_1: ; COM: X64: pinsrd $1, %edi, %xmm0 ``` Without this patch, you need to use some combination of rewording and directive syntax mangling to prevent FileCheck from recognizing the commented occurrences of `X32:` and `X64:` above as directives. Moreover, FileCheck diagnostics have been proposed that might complain about the occurrences of `X64` that don't have the trailing `:` because they look like directive typos: <http://lists.llvm.org/pipermail/llvm-dev/2020-April/140610.html> I think dodging all these problems can prove tedious for test authors, and directive syntax mangling already makes the purpose of existing test code unclear. `COM:` can avoid all these problems. This patch also updates the small set of existing tests that define `COM` as a check prefix: - clang/test/CodeGen/default-address-space.c - clang/test/CodeGenOpenCL/addr-space-struct-arg.cl - clang/test/Driver/hip-device-libs.hip - llvm/test/Assembler/drop-debug-info-nonzero-alloca.ll I think lit should support `COM:` as well. Perhaps `clang -verify` should too. Reviewed By: jhenderson, thopre Differential Revision: https://reviews.llvm.org/D79276	2020-05-11 14:53:48 -04:00
Matthias Schiffer	a2247d42e4	[LangRef] Describe linkage types, allocation size of declarations for global variables Linkage type was only referenced for functions, not for global variables. Clarify that LLVM doesn't make assumption about the allocation size when no definitive initializer for a global variable is known. Differential Revision: https://reviews.llvm.org/D78952	2020-05-08 16:21:30 -07:00
Jonas Devlieghere	7fb9bcd3da	[dsymutil] Add option to print statistics about the .debug_info size. This patch adds statistics about the contribution of each object file to the linked debug info. When --statistics is passed to dsymutil, it prints a table after linking as illustrated below. It lists the object file name, the size of the debug info in the object file in bytes, and the absolute size contribution to the linked dSYM and the percentage difference. The table is sorted by the output size, so the object files contributing the most to the link are listed first. .debug_info section size (in bytes) ------------------------------------------------------------------------------- Filename Object dSYM Change ------------------------------------------------------------------------------- basic2.macho.x86_64.o 210b 165b -24.00% basic3.macho.x86_64.o 177b 150b -16.51% basic1.macho.x86_64.o 125b 129b 3.15% ------------------------------------------------------------------------------- Total 512b 444b -14.23% ------------------------------------------------------------------------------- Differential revision: https://reviews.llvm.org/D79513	2020-05-06 19:48:45 -07:00
Stanislav Mekhanoshin	b856ff9782	[AMDGPU] Added 'a' constraint documentation. NFC. AGPR inline asm constraint was missing from the LangRef.rst.	2020-05-05 13:52:04 -07:00
Sanjay Patel	a954b8a363	[ValueTracking] fix CannotBeNegativeZero() to disregard 'nsz' FMF The 'nsz' flag is different than 'nnan' or 'ninf' in that it does not create poison. Make that explicit in the LangRef and fix ValueTracking analysis that misinterpreted the definition. This manifests as bugs in InstSimplify shown in the test diffs and as discussed in PR45778: https://bugs.llvm.org/show_bug.cgi?id=45778 Differential Revision: https://reviews.llvm.org/D79422	2020-05-05 16:04:59 -04:00
Christudasan Devadasan	375cec4b6c	[AMDGPU] Introduce more scratch registers in the ABI. The AMDGPU target has a convention that defined all VGPRs (execept the initial 32 argument registers) as callee-saved. This convention is not efficient always, esp. when the callee requiring more registers, ended up emitting a large number of spills, even though its caller requires only a few. This patch revises the ABI by introducing more scratch registers that a callee can freely use. The 256 vgpr registers now become: 32 argument registers 112 scratch registers and 112 callee saved registers. The scratch registers and the CSRs are intermixed at regular intervals (a split boundary of 8) to obtain a better occupancy. Reviewers: arsenm, t-tye, rampitec, b-sumner, mjbedy, tpr Reviewed By: arsenm, t-tye Differential Revision: https://reviews.llvm.org/D76356	2020-05-05 23:02:58 +05:30
James Henderson	5beb9fa4ab	[docs][llvm-objcopy] Update --output-target text with right defaults The --output-target documentation has slightly rotted, as the default is no longer purely based on the input file format, but also the value of --input-target. This patch updates the documentation to make this explicit. Reviewed by: MaskRay, alexshap Differential Revision: https://reviews.llvm.org/D79318	2020-05-05 11:22:56 +01:00
Djordje Todorovic	0a4defe8c8	[llvm-dwarfdump][Stats] Clean up This addresses: -Clean up the source code -Refactor the JSON fields -Fix the test cases -Improve the docs for the stats output Differential Revision: https://reviews.llvm.org/D77789	2020-05-04 09:35:40 +02:00
Thomas Preud'homme	0b85ea8533	[docs][FileCheck] Fix invalid example Summary: FileCheck documentation contains an example of a numeric variable defined and used on the same line. This is not currently supported by FileCheck so this commit fixes the example to use CHECK-SAME for the variable use. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D79253	2020-05-02 23:31:18 +01:00
James Henderson	91257fdb21	[docs][llvm-cxxfilt] Document --no-strip-underscore option This option was added several months ago in commit `e84468c1`. Reviewed by: MaskRay, erik.pilkington, steven_wu Differential Revision: https://reviews.llvm.org/D79166	2020-05-01 11:03:06 +01:00
Scott Linder	084f3cf92b	[AMDGPU] Update DWARF proposal encodings Update the tentative encodings to avoid a conflict with a GNU extension. Differential Revision: https://reviews.llvm.org/D70523	2020-04-30 14:02:54 -04:00
James Henderson	027eb25121	[docs][llvm-cxxfilt] Fix indentation in rst file This makes it consistent throughout the options, although the end result is unchanged.	2020-04-30 10:41:45 +01:00
Tony	756ba3548c	[AMDGPU] DWARF proposal review feedback - Rename DW_OP_LLVM_offset_constu to DW_OP_LLVM_offset_uconst to matches DW_OP_plus_uconst. - Correct DW_OP_LLVM_call_ref to be DW_OP_call_ref. - Move proposed changes to a separate section to clarify that the introduction section is not part of the changes. - Fix formatting typos and add missing reference. - Clarify why DW_OP_LLVM_offset et al do not wrap on overflow. - Correct syntax of augmentation string. Differential Revision: https://reviews.llvm.org/D70523	2020-04-28 00:56:25 -04:00
Arthur Eubanks	3b0450acec	Add IR constructs for preallocated (inalloca replacement) Add llvm.call.preallocated.{setup,arg} instrinsics. Add "preallocated" operand bundle which takes a token produced by llvm.call.preallocated.setup. Add "preallocated" parameter attribute, which is like byval but without the copy. Verifier changes for these IR constructs. See https://github.com/rnk/llvm-project/blob/call-setup-docs/llvm/docs/CallSetup.md Subscribers: hiraditya, jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74651	2020-04-27 16:15:50 -07:00
Sergei Trofimovich	41eb0fc00d	[Lexicon] fix typo "may is" -> "is" Reviewers: MaskRay Reviewed By: MaskRay Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78878	2020-04-26 19:35:25 +01:00
Jon Roelofs	42bf0756d4	[docs] Fix :option: links	2020-04-25 16:19:02 -06:00
James Y Knight	fb8152dcfe	[CallSite removal] Remove the text describing CallSite from the manual.	2020-04-23 22:17:19 -04:00
James Y Knight	248a5db3f2	Change callbr to only define its output SSA variable on the normal path, not the indirect targets. Fixes: PR45565. Differential Revision: https://reviews.llvm.org/D78341	2020-04-23 19:36:44 -04:00
Xing GUO	12224162a1	[dsymutil][doc] Improve documentation. This change helps improve `dsymutil` documentation. - Add missing options - Re-arrange options in alphabetical order - Wrap inline options in double-back-quote - `-v` is for `--version` not `--verbose` Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D78479	2020-04-23 20:06:52 +08:00
Kazuaki Ishizaki	0312b9f550	[llvm] NFC: Fix trivial typo in rst and td files Differential Revision: https://reviews.llvm.org/D77469	2020-04-23 14:26:32 +09:00
Jon Roelofs	dc5c1fa882	[docs] Fix :option: links	2020-04-22 14:00:30 -06:00
Jon Roelofs	b3f168274d	[docs] Document lit's --timeout=N flag	2020-04-22 12:57:25 -06:00
Mikhail Maltsev	089fbe6919	[Docs] Fixed formatting in release notes, NFC	2020-04-22 18:25:22 +01:00
Mikhail Maltsev	d7ab9e7c9b	[ARM] Release notes for the Custom Datapath Extension (CDE) Summary: This change mentions CDE assembly in the LLVM release notes and CDE intrinsics in both Clang and LLVM release notes. Reviewers: kristof.beyls, simon_tatham Reviewed By: kristof.beyls Subscribers: danielkiss, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D78481	2020-04-22 16:34:19 +01:00
Zola Bridges	0f12480bd1	[dfsan] Add "DataFlow" option to LLVM_USE_SANITIZER Summary: This patch add the dataflow option to LLVM_USE_SANITIZER and documents it. Tested via check-cxx (wip to fix the errors). Reviewers: morehouse, #libc! Subscribers: mgorny, cfe-commits, libcxx-commits Tags: #clang, #libc Differential Revision: https://reviews.llvm.org/D78390	2020-04-20 10:30:52 -07:00
Tyker	ff9379f4b2	[NFC] Remove waymarking because it improves performances Summary: This patch remove waymarking and replaces it with storing a pointer to the User in the Use. here are the results on the measurements for the CTMark tests of the test suite. ``` Metric: instructions_count Program baseline patched diff test-suite :: CTMark/ClamAV/clamscan.test 72557942065 71733653521 -1.1% test-suite :: CTMark/sqlite3/sqlite3.test 76281422939 75484840636 -1.0% test-suite :: CTMark/consumer-typeset/consumer-typeset.test 51364676366 50862185614 -1.0% test-suite :: CTMark/SPASS/SPASS.test 60476106505 59908437767 -0.9% test-suite :: CTMark/tramp3d-v4/tramp3d-v4.test 112578442329 111725050856 -0.8% test-suite :: CTMark/mafft/pairlocalalign.test 50846133013 50473644539 -0.7% test-suite :: CTMark/kimwitu++/kc.test 54692641250 54349070299 -0.6% test-suite :: CTMark/7zip/7zip-benchmark.test 182216614747 181216091230 -0.5% test-suite :: CTMark/Bullet/bullet.test 123459210616 122905866767 -0.4% Geomean difference -0.8% Metric: peak_memory_use Program baseline patched diff test-suite :: CTMark/tramp3d-v4/tramp3d-v4.test 326864 338524 3.6% test-suite :: CTMark/sqlite3/sqlite3.test 216412 221240 2.2% test-suite :: CTMark/7zip/7zip-benchmark.test 11808284 12022604 1.8% test-suite :: CTMark/Bullet/bullet.test 6831752 6945988 1.7% test-suite :: CTMark/SPASS/SPASS.test 2682552 2721820 1.5% test-suite :: CTMark/ClamAV/clamscan.test 5037256 5107936 1.4% test-suite :: CTMark/consumer-typeset/consumer-typeset.test 2752728 2790768 1.4% test-suite :: CTMark/mafft/pairlocalalign.test 1517676 1537244 1.3% test-suite :: CTMark/kimwitu++/kc.test 1090748 1103448 1.2% Geomean difference 1.8% Metric: compile_time Program baseline patched diff test-suite :: CTMark/consumer-typeset/consumer-typeset.test 14.71 14.38 -2.2% test-suite :: CTMark/sqlite3/sqlite3.test 23.18 22.73 -2.0% test-suite :: CTMark/7zip/7zip-benchmark.test 57.96 56.99 -1.7% test-suite :: CTMark/ClamAV/clamscan.test 20.75 20.49 -1.2% test-suite :: CTMark/kimwitu++/kc.test 18.35 18.15 -1.1% test-suite :: CTMark/SPASS/SPASS.test 18.72 18.57 -0.8% test-suite :: CTMark/mafft/pairlocalalign.test 14.09 14.00 -0.6% test-suite :: CTMark/Bullet/bullet.test 37.38 37.19 -0.5% test-suite :: CTMark/tramp3d-v4/tramp3d-v4.test 33.81 33.76 -0.2% Geomean difference -1.1% ``` i believe that it is worth trading +1.8% peak memory use for -1.1% compile time. also this patch removes waymarking which simplifies the Use and User classes. Reviewers: nikic, lattner Reviewed By: lattner Subscribers: russell.gallop, foad, ggreif, rriddle, ekatz, fhahn, lebedev.ri, mgorny, hiraditya, george.burgess.iv, asbirlea, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77144	2020-04-17 11:27:10 +02:00
Richard Smith	9a709dd2bb	llvm-addr2line: assume addresses on the command line are hexadecimal rather than attempting to guess the base based on the form of the number. Summary: This matches the behavior of GNU addr2line. We previously treated hexadecimal addresses as binary if they started with 0b, otherwise as octal if they started with 0, otherwise as decimal. This only affects llvm-addr2line; the behavior of llvm-symbolize is unaffected. Reviewers: ikudrin, rupprecht, jhenderson Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73306	2020-04-16 16:16:21 -07:00
Lang Hames	a9ade27a57	[docs] Fix an RST error introduced in `e823068306`. This should fix the 'Explicit markup ends without a blank line' error seen on http://lab.llvm.org:8011/builders/llvm-sphinx-docs. Thanks to Daniel Sanders for spotting this.	2020-04-15 14:37:58 -07:00
Tony	1eac2c55d8	[AMDGPU] Move DWARF proposal to separate file - Move DWARF proposal for heterogeneous debugging to a separate file. - Add references. Differential Revision: https://reviews.llvm.org/D70523	2020-04-15 17:19:39 -04:00
Craig Topper	8dfb9627b7	[X86] Make v32i16/v64i8 legal types without avx512bw. Use custom splitting instead. This moves v32i16/v64i8 to a model consistent with how we treat integer types with avx1. This does change the ABI for types vXi16/vXi8 vectors larger than 512 bits to pass in multiple zmms instead of multiple ymms. We'd already hacked some code to make v64i8/v32i16 pass in zmm. Cost model is still a bit of a mess. In some place I tried to match existing behavior. But really we need to account for splitting and concating costs. Cost model for shuffles is especially pessimistic. Differential Revision: https://reviews.llvm.org/D76212	2020-04-15 12:17:18 -07:00
Tony	b436124010	[AMDGPU] Update DWARF proposal - Unify the sections on DWARF expression and location lists. - Allow a location description to have one or more single location descriptions. - Define context of DWARF expression that includes an initial stack. Allow initial stack to be used when evaluating location list expression with overlapping PC ranges. - Reorganize the DWARF proposal in AMDGPUUsage so suitable for submission to the DWARF site. - Replace CFI instruction DW_CFA_LLVM_def_cfa_aspace with DW_CFA_def_aspace_cfa and DW_CFA_def_aspace_cfa_sf. This is to avoid the problem that DW_CFA_def_cfa and DW_CFA_def_cfa_sf cannot use a register that is not the size of an address in the CFA address space. - Clarify DWARF address class and DWARF address space. Define language values for DWARF address classes and specify how they are used by some common source languages. - Define rules for accessing registers and derefencing memory when the type size and register size or byte size operand do not match. - Numerous cleanups for consistency. Differential Revision: https://reviews.llvm.org/D70523	2020-04-14 20:05:15 -04:00
Lang Hames	840a23b0b5	[ORC] Update ORCv2 docs to reflect removal of ExecutionSession::getMainJITDylib. Thanks to Dibyendu Majumdar for spotting the issue.	2020-04-13 12:52:44 -07:00
Lang Hames	e823068306	[Support] Add support RTTI support for open class hierarchies. This patch extracts the RTTI part of llvm::ErrorInfo into its own class (RTTIExtends) so that it can be used in other non-error hierarchies, and makes it compatible with the existing LLVM RTTI function templates (isa, cast, dyn_cast, dyn_cast_or_null) by adding the classof method. Differential Revision: https://reviews.llvm.org/D39111	2020-04-13 12:52:44 -07:00
Benjamin Kramer	ebd5290ff2	Address sphinx warnings LanguageExtensions.rst:2191: WARNING: Title underline too short. llvm-symbolizer.rst:157: Error in "code-block" directive: maximum 1 argument(s) allowed, 30 supplied.	2020-04-13 14:41:55 +02:00
SCOTT-HAMILTON	4d62c34402	Typos correction.	2020-04-13 13:46:18 +02:00
Nico Weber	0fffece463	fix some doc typos to cycle bots	2020-04-13 06:28:59 -04:00
Stefanos Baziotis	72ffeb2d38	[LoopTerminology] LCSSA: Fix typo in code sample	2020-04-12 04:40:55 +03:00
Djordje Todorovic	3505226702	[docs][llvm-dwarfdump] Add the release notes about --show-section-sizes Note that the llvm-dwarfdump has the new option. Differential Revision: https://reviews.llvm.org/D77495	2020-04-10 10:35:18 +02:00
Qiu Chaofan	68460148d5	[Docs] Add more FP option description for llc This patch adds missing description of enable-no-signed-zeros-fp-math and enable-no-trapping-fp-math options of llc. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D77713	2020-04-09 17:13:01 +08:00
Serge Pavlov	c7ff5b38f2	[FPEnv] Use single enum to represent rounding mode Now compiler defines 5 sets of constants to represent rounding mode. These are: 1. `llvm::APFloatBase::roundingMode`. It specifies all 5 rounding modes defined by IEEE-754 and is used in `APFloat` implementation. 2. `clang::LangOptions::FPRoundingModeKind`. It specifies 4 of 5 IEEE-754 rounding modes and a special value for dynamic rounding mode. It is used in clang frontend. 3. `llvm::fp::RoundingMode`. Defines the same values as `clang::LangOptions::FPRoundingModeKind` but in different order. It is used to specify rounding mode in in IR and functions that operate IR. 4. Rounding mode representation used by `FLT_ROUNDS` (C11, 5.2.4.2.2p7). Besides constants for rounding mode it also uses a special value to indicate error. It is convenient to use in intrinsic functions, as it represents platform-independent representation for rounding mode. In this role it is used in some pending patches. 5. Values like `FE_DOWNWARD` and other, which specify rounding mode in library calls `fesetround` and `fegetround`. Often they represent bits of some control register, so they are target-dependent. The same names (not values) and a special name `FE_DYNAMIC` are used in `#pragma STDC FENV_ROUND`. The first 4 sets of constants are target independent and could have the same numerical representation. It would simplify conversion between the representations. Also now `clang::LangOptions::FPRoundingModeKind` and `llvm::fp::RoundingMode` do not contain the value for IEEE-754 rounding direction `roundTiesToAway`, although it is supported natively on some targets. This change defines all the rounding mode type via one `llvm::RoundingMode`, which also contains rounding mode for IEEE rounding direction `roundTiesToAway`. Differential Revision: https://reviews.llvm.org/D77379	2020-04-09 13:26:47 +07:00
Sanjay Patel	5c472420b6	[LangRef] update text for shufflevector D72467 updated the shufflevector instruction to include a constant mask rather than a mask operand. The LangRef text was vague enough to still make sense, but it is better to update here too, so there's no confusion about valid mask values. The text here is adapted from the documentation code comments for "class ShuffleVectorInst". Differential Revision: https://reviews.llvm.org/D77396	2020-04-08 09:01:01 -04:00
Djordje Todorovic	3a4d9f8335	[docs] Add the release notes about Debug Entry Values Note that x86, arm and aarch64 targets support the Debug Entry Values feature by default. Differential Revision: https://reviews.llvm.org/D77494	2020-04-07 12:08:22 +02:00
Louis Dionne	8a42bf24ae	[lit] Move the recursiveExpansionLimit setting to TestingConfig The LitConfig is shared across the whole test suite. However, since enabling recursive expansion can be a breaking change for some test suites, it's important to confine the setting to test suites that enable it explicitly. Note that other issues were raised with the way recursiveExpansionLimit operates. However, this commit simply moves the setting to the right place -- the mechanism by which it works can be improved independently. Differential Revision: https://reviews.llvm.org/D77415	2020-04-06 13:58:00 -04:00
diggerlin	a26a441b99	[llvm-objdump][XCOFF] Use symbol index+symbol name + storage mapping class as label for -D SUMMARY: For the llvm-objdump -D, the symbol name is used as a label in the disassembly for the specific address (when a symbol address is equal to the virtual address in the dump). In XCOFF, multiple symbols may have the same name, being differentiated by their storage mapping class. It is helpful to print the QualName and not just the name when forming the output label for a csect symbol. The symbol index further removes any ambiguity caused by duplicate names. To maintain compatibility with the binutils objdump, the XCOFF-specific --symbol-description option is added to enable the enhanced format. Reviewers: hubert.reinterpretcast, James Henderson, Jason Liu ,daltenty Subscribers: wuzish, nemanjai, hiraditya Differential Revision: https://reviews.llvm.org/D72973	2020-04-06 10:10:10 -04:00
vgxbj	948ef5b1a6	[llvm-objdump] Teach `llvm-objdump` dump dynamic symbols. Summary: This patch is to teach `llvm-objdump` dump dynamic symbols (`-T` and `--dynamic-syms`). Currently, this patch is not fully compatible with `gnu-objdump`, but I would like to continue working on this in next few patches. It has two issues. 1. Some symbols shouldn't be marked as global(g). (`-t/--syms` has same issue as well) (Fixed by D75659) 2. `gnu-objdump` can dump version information and dynamically insert before symbol name field. `objdump -T a.out` gives: ``` DYNAMIC SYMBOL TABLE: 0000000000000000 w D UND 0000000000000000 _ITM_deregisterTMCloneTable 0000000000000000 DF UND 0000000000000000 GLIBC_2.2.5 printf 0000000000000000 DF UND 0000000000000000 GLIBC_2.2.5 __libc_start_main 0000000000000000 w D UND 0000000000000000 __gmon_start__ 0000000000000000 w D UND 0000000000000000 _ITM_registerTMCloneTable 0000000000000000 w DF UND 0000000000000000 GLIBC_2.2.5 __cxa_finalize ``` `llvm-objdump -T a.out` gives: ``` DYNAMIC SYMBOL TABLE: 0000000000000000 w D UND 0000000000000000 _ITM_deregisterTMCloneTable 0000000000000000 g DF UND 0000000000000000 printf 0000000000000000 g DF UND 0000000000000000 __libc_start_main 0000000000000000 w D UND 0000000000000000 __gmon_start__ 0000000000000000 w D UND 0000000000000000 _ITM_registerTMCloneTable 0000000000000000 w DF UND 0000000000000000 __cxa_finalize ``` Reviewers: jhenderson, grimar, MaskRay, espindola Reviewed By: jhenderson, grimar Subscribers: emaste, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75756	2020-04-05 10:46:59 +08:00
Mehdi Amini	1ce0bc39ee	Add mention of advantages of `arc` in the Phabricator doc. Differential Revision: https://reviews.llvm.org/D76952	2020-04-04 03:22:29 +00:00
Guillaume Chatelet	9f5c786876	[NFC] G_DYN_STACKALLOC realign iff align > 1, update documentation Summary: I think it would be better to require the alignment to be >= 1. It is currently confusing to allow both values. Reviewers: courbet Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77372	2020-04-03 08:12:39 +00:00
Matt Arsenault	75cf30918f	AMDGPU: Assume f32 denormals are enabled by default This will likely introduce catastrophic performance regressions on older subtargets, but should be correct. A follow up change will remove the old fp32-denormals subtarget features, and switch to using the new denormal-fp-math/denormal-fp-math-f32 attributes. Frontends should be making sure to add the denormal-fp-math-f32 attribute when appropriate to avoid performance regressions.	2020-04-02 17:17:12 -04:00
Alexander Lanin	6668453dd2	[docs] use git diff instead of git format-patch Uploading output from `git format-patch` fails when version has more than 2 dots, e.g. git version 2.24.1.windows.2 which is currently recommended by e.g. GitExtensions or 2.24.1.rc on Linux. Differential Revision: https://reviews.llvm.org/D72374	2020-04-02 07:20:27 -07:00
Stefanos Baziotis	8348e9d71b	[LoopTerminology] Make term names bold Differential Revision: https://reviews.llvm.org/D77151	2020-04-02 14:53:18 +03:00
Djordje Todorovic	5e508b9bac	[llvm-dwarfdump] Add the --show-sections-sizes option Add an option to llvm-dwarfdump to calculate the bytes within the debug sections. Dump this numbers when using --statistics option as well. This is an initial patch (e.g. we should support other units, since we only support 'bytes' now). Differential Revision: https://reviews.llvm.org/D74205	2020-04-02 13:14:30 +02:00
Roman Lebedev	de22d7154b	[llvm-exegesis] 'Min' repetition mode Summary: As noted in documentation, different repetition modes have different trade-offs: > .. option:: -repetition-mode=[duplicate\|loop] > > Specify the repetition mode. `duplicate` will create a large, straight line > basic block with `num-repetitions` copies of the snippet. `loop` will wrap > the snippet in a loop which will be run `num-repetitions` times. The `loop` > mode tends to better hide the effects of the CPU frontend on architectures > that cache decoded instructions, but consumes a register for counting > iterations. Indeed. Example: >>! In D74156#1873657, @lebedev.ri wrote: > At least for `CMOV`, i'm seeing wildly different results > \| \| Latency \| RThroughput \| > \| duplicate \| 1 \| 0.8 \| > \| loop \| 2 \| 0.6 \| > where latency=1 seems correct, and i'd expect the througput to be close to 1/2 (since there are two execution units). This isn't great for analysis, at least for schedule model development. As discussed in excruciating detail in >>! In D74156#1924514, @gchatelet wrote: >>>! In D74156#1920632, @lebedev.ri wrote: >> ... did that explanation of the question i'm having made any sense? > > Thx for digging in the conversation ! > Ok it makes more sense now. > > I discussed it a bit with @courbet: > - We want the analysis tool to stay simple so we'd rather not make it knowledgeable of the repetition mode. > - We'd like to still be able to select either repetition mode to dig into special cases > > So we could add a third `min` repetition mode that would run both and take the minimum. It could be the default option. > Would you have some time to look what it would take to add this third mode? there appears to be an agreement that it is indeed sub-par, and that we should provide an optional, measurement (not analysis!) -time way to rectify the situation. However, the solutions isn't entirely straight-forward. We can just add an actual 'multiplexer' `MinSnippetRepetitor`, because if we just concatenate snippets produced by `DuplicateSnippetRepetitor` and `LoopSnippetRepetitor` and run+measure that, the measurement will naturally be different from what we'd get by running+measuring them separately and taking the min. ([[ https://www.wolframalpha.com/input/?i=%28x%2By%29%2F2+%21%3D+min%28x%2C+y%29 \| `time(D+L)/2 != min(time(D), time(L))` ]]) Also, it seems best to me to have a single snippet instead of generating a snippet per repetition mode, since the only difference here is that the loop repetition mode reserves one register for loop counter. As far as i can tell, we can either teach `BenchmarkRunner::runConfiguration()` to produce a single report given multiple repetitors (as in the patch), or do that one layer higher - don't modify `BenchmarkRunner::runConfiguration()`, produce multiple reports, don't actually print each one, but aggregate them somehow and only print the final one. Initially i've gone ahead with the latter approach, but it didn't look like a natural fit; the former (as in the diff) does seem like a better fit to me. There's also a question of the test coverage. It sure currently does work here: ``` $ ./bin/llvm-exegesis --opcode-name=CMOV64rr --mode=inverse_throughput --repetition-mode=duplicate Check generated assembly with: /usr/bin/objdump -d /tmp/snippet-8fb949.o --- mode: inverse_throughput key: instructions: - 'CMOV64rr RAX RAX R11 i_0x0' - 'CMOV64rr RBP RBP R15 i_0x0' - 'CMOV64rr RBX RBX RBX i_0x0' - 'CMOV64rr RCX RCX RBX i_0x0' - 'CMOV64rr RDI RDI R10 i_0x0' - 'CMOV64rr RDX RDX RAX i_0x0' - 'CMOV64rr RSI RSI RAX i_0x0' - 'CMOV64rr R8 R8 R8 i_0x0' - 'CMOV64rr R9 R9 RDX i_0x0' - 'CMOV64rr R10 R10 RBX i_0x0' - 'CMOV64rr R11 R11 R14 i_0x0' - 'CMOV64rr R12 R12 R9 i_0x0' - 'CMOV64rr R13 R13 R12 i_0x0' - 'CMOV64rr R14 R14 R15 i_0x0' - 'CMOV64rr R15 R15 R13 i_0x0' config: '' register_initial_values: - 'RAX=0x0' - 'R11=0x0' - 'EFLAGS=0x0' - 'RBP=0x0' - 'R15=0x0' - 'RBX=0x0' - 'RCX=0x0' - 'RDI=0x0' - 'R10=0x0' - 'RDX=0x0' - 'RSI=0x0' - 'R8=0x0' - 'R9=0x0' - 'R14=0x0' - 'R12=0x0' - 'R13=0x0' cpu_name: bdver2 llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 10000 measurements: - { key: inverse_throughput, value: 0.819, per_snippet_value: 12.285 } error: '' info: instruction has tied variables, using static renaming. assembled_snippet: 5541574156415541545348B8000000000000000049BB00000000000000004883EC08C7042400000000C7442404000000009D48BD000000000000000049BF000000000000000048BB000000000000000048B9000000000000000048BF000000000000000049BA000000000000000048BA000000000000000048BE000000000000000049B8000000000000000049B9000000000000000049BE000000000000000049BC000000000000000049BD0000000000000000490F40C3490F40EF480F40DB480F40CB490F40FA480F40D0480F40F04D0F40C04C0F40CA4C0F40D34D0F40DE4D0F40E14D0F40EC4D0F40F74D0F40FD490F40C35B415C415D415E415F5DC3 ... $ ./bin/llvm-exegesis --opcode-name=CMOV64rr --mode=inverse_throughput --repetition-mode=loop Check generated assembly with: /usr/bin/objdump -d /tmp/snippet-051eb3.o --- mode: inverse_throughput key: instructions: - 'CMOV64rr RAX RAX R11 i_0x0' - 'CMOV64rr RBP RBP RSI i_0x0' - 'CMOV64rr RBX RBX R9 i_0x0' - 'CMOV64rr RCX RCX RSI i_0x0' - 'CMOV64rr RDI RDI RBP i_0x0' - 'CMOV64rr RDX RDX R9 i_0x0' - 'CMOV64rr RSI RSI RDI i_0x0' - 'CMOV64rr R9 R9 R12 i_0x0' - 'CMOV64rr R10 R10 R11 i_0x0' - 'CMOV64rr R11 R11 R9 i_0x0' - 'CMOV64rr R12 R12 RBP i_0x0' - 'CMOV64rr R13 R13 RSI i_0x0' - 'CMOV64rr R14 R14 R14 i_0x0' - 'CMOV64rr R15 R15 R10 i_0x0' config: '' register_initial_values: - 'RAX=0x0' - 'R11=0x0' - 'EFLAGS=0x0' - 'RBP=0x0' - 'RSI=0x0' - 'RBX=0x0' - 'R9=0x0' - 'RCX=0x0' - 'RDI=0x0' - 'RDX=0x0' - 'R12=0x0' - 'R10=0x0' - 'R13=0x0' - 'R14=0x0' - 'R15=0x0' cpu_name: bdver2 llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 10000 measurements: - { key: inverse_throughput, value: 0.6083, per_snippet_value: 8.5162 } error: '' info: instruction has tied variables, using static renaming. assembled_snippet: 5541574156415541545348B8000000000000000049BB00000000000000004883EC08C7042400000000C7442404000000009D48BD000000000000000048BE000000000000000048BB000000000000000049B9000000000000000048B9000000000000000048BF000000000000000048BA000000000000000049BC000000000000000049BA000000000000000049BD000000000000000049BE000000000000000049BF000000000000000049B80200000000000000490F40C3480F40EE490F40D9480F40CE480F40FD490F40D1480F40F74D0F40CC4D0F40D34D0F40D94C0F40E54C0F40EE4D0F40F64D0F40FA4983C0FF75C25B415C415D415E415F5DC3 ... $ ./bin/llvm-exegesis --opcode-name=CMOV64rr --mode=inverse_throughput --repetition-mode=min Check generated assembly with: /usr/bin/objdump -d /tmp/snippet-c7a47d.o Check generated assembly with: /usr/bin/objdump -d /tmp/snippet-2581f1.o --- mode: inverse_throughput key: instructions: - 'CMOV64rr RAX RAX R11 i_0x0' - 'CMOV64rr RBP RBP R10 i_0x0' - 'CMOV64rr RBX RBX R10 i_0x0' - 'CMOV64rr RCX RCX RDX i_0x0' - 'CMOV64rr RDI RDI RAX i_0x0' - 'CMOV64rr RDX RDX R9 i_0x0' - 'CMOV64rr RSI RSI RAX i_0x0' - 'CMOV64rr R9 R9 RBX i_0x0' - 'CMOV64rr R10 R10 R12 i_0x0' - 'CMOV64rr R11 R11 RDI i_0x0' - 'CMOV64rr R12 R12 RDI i_0x0' - 'CMOV64rr R13 R13 RDI i_0x0' - 'CMOV64rr R14 R14 R9 i_0x0' - 'CMOV64rr R15 R15 RBP i_0x0' config: '' register_initial_values: - 'RAX=0x0' - 'R11=0x0' - 'EFLAGS=0x0' - 'RBP=0x0' - 'R10=0x0' - 'RBX=0x0' - 'RCX=0x0' - 'RDX=0x0' - 'RDI=0x0' - 'R9=0x0' - 'RSI=0x0' - 'R12=0x0' - 'R13=0x0' - 'R14=0x0' - 'R15=0x0' cpu_name: bdver2 llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 10000 measurements: - { key: inverse_throughput, value: 0.6073, per_snippet_value: 8.5022 } error: '' info: instruction has tied variables, using static renaming. assembled_snippet: 5541574156415541545348B8000000000000000049BB00000000000000004883EC08C7042400000000C7442404000000009D48BD000000000000000049BA000000000000000048BB000000000000000048B9000000000000000048BA000000000000000048BF000000000000000049B9000000000000000048BE000000000000000049BC000000000000000049BD000000000000000049BE000000000000000049BF0000000000000000490F40C3490F40EA490F40DA480F40CA480F40F8490F40D1480F40F04C0F40CB4D0F40D44C0F40DF4C0F40E74C0F40EF4D0F40F14C0F40FD490F40C3490F40EA5B415C415D415E415F5DC35541574156415541545348B8000000000000000049BB00000000000000004883EC08C7042400000000C7442404000000009D48BD000000000000000049BA000000000000000048BB000000000000000048B9000000000000000048BA000000000000000048BF000000000000000049B9000000000000000048BE000000000000000049BC000000000000000049BD000000000000000049BE000000000000000049BF000000000000000049B80200000000000000490F40C3490F40EA490F40DA480F40CA480F40F8490F40D1480F40F04C0F40CB4D0F40D44C0F40DF4C0F40E74C0F40EF4D0F40F14C0F40FD4983C0FF75C25B415C415D415E415F5DC3 ... ``` but i open to suggestions as to how test that. I also have gone with the suggestion to default to this new mode. This was irking me for some time, so i'm happy to finally see progress here. Looking forward to feedback. Reviewers: courbet, gchatelet Reviewed By: courbet, gchatelet Subscribers: mstojanovic, RKSimon, llvm-commits, courbet, gchatelet Tags: #llvm Differential Revision: https://reviews.llvm.org/D76921	2020-04-02 09:28:35 +03:00
Serguei Katkov	2ede5dccff	[DOC] Remove too strong restriction for ‘llvm.experimental.gc.statepoint’ Intrinsic The requirement for deopt parameter to be in gc parameter if it can be modified by GC is very strong and difficult to follow. The key example of why this can't work: %p1 = bitcast i8* %p to i8* statepoint [gc = (%p1)], [deopt = (%p1)] The optimizer is allowed to replace either use (or both) of %p1 with %p. If it updates only one of the two (entirely legal), the two sets do not overlap. So this change removes the strong wording. Reviewers: reames, dantrushin Reviewed By: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D77122	2020-04-02 10:56:42 +07:00
Johannes Doerfert	6cd673345c	[LangRef][AliasAnalysis] Clarify `noalias` affects only modified objects We already mention that `noalias` is modeled after the C99 `restrict` qualifier but we did omit one important requirement in the description. For the restrict guarantees the object affected has to be modified during the execution of the function, in any way (see 6.7.3.1.4 in [0]). There are two reasons we want this restriction as well: 1) To match the `restrict` semantics when we lower it to `noalias`. 2) To allow the reasoning that the object pointed to by a `noalias` pointer is not modified through means not derived from this pointer. Hence, following the uses of that pointer is sufficient to determine potential modifications. The discussion on this came up as part of D73428. In that patch the Attributor is taught to derive `noalias` for call site arguments based on alias queries against objects that are accessed in the callee. This is possible even if the pointer passed at the call site was "not-`noalias`". To simplify the logic there and to allow the use of `noalias` as described in 2) above, it is beneficial to follow the C `restrict` semantics in cases where there might be "read-read-aliases". Note that AliasAnalysis* queries for read only objects already result in `NoAlias` even if the pointers might "alias". * From this point of view our Alias Analysis is basically a Dependence Analysis. [0] http://www.open-std.org/jtc1/sc22/wg14/www/docs/n1124.pdf Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D74935	2020-04-01 20:40:55 -05:00
Richard Smith	11ccad6e87	[docs] Make llvm-addr2line documentation more explicit about which behavior is llvm-addr2line's and which is llvm-symbolizer's.	2020-03-31 12:44:45 -07:00
Sterling Augustine	21d9d0855b	New symbolizer option to print files relative to the compilation directory. Summary: New "--relative" option to allow printing files relative to the compilation directory. Reviewers: jhenderson Subscribers: MaskRay, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76733	2020-03-31 09:29:24 -07:00
Stefanos Baziotis	229cda968c	[LoopTerminology] LCSSA form Reviewed by: Michael Kruse (Meinersbur) Differential Revision: https://reviews.llvm.org/D75233	2020-03-31 15:30:59 +03:00
James Henderson	6aacdd6083	[docs] Document coding standard for error and warning messages In particular, these messages should start with a lower-case letter and should have no trailing period at the end of the last sentence. See http://lists.llvm.org/pipermail/llvm-dev/2020-March/140178.html for context. Reviewed by: aaron.ballman, hubert.reinterpretcast, rnk, dblaikie Differential Revision: https://reviews.llvm.org/D76833	2020-03-31 12:41:17 +01:00
Juneyoung Lee	05f0e598ab	[LangRef] Clarify the semantics of branch on undef Summary: This patch clarifies the semantics of branching on undef value. Defining `br undef` as undefined behavior explains optimizations that use branch conditions, such as CVP (D76931) and GVN (propagateEquality). For `switch cond`, it is defined to raise UB if cond is an expression containing undef && cond is not frozen && it may yield different values. This allows that at the destination block the branch condition can be assumed to be frozen already (otherwise UB was already triggered). This condition is slightly stricter than MemorySanitizer, which allows undef-y condition if it always leads to the same destination, but it does not break MemorySanitizer because we are giving stricter constraint. Reviewers: efriedma, fhahn, nikic, spatel, jdoerfert, nlopes Reviewed By: nlopes Subscribers: regehr, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76973	2020-03-30 11:41:47 +09:00
Evan LeClercq	37943e518c	[docs] Added solutions to slow build under common problems. I added a list of options to configure should someone have issues with long build time or running out of memory. This was added under common problems in the getting started section of the documentation. Reviewed By: Meinersbur, dim, e-leclercq Differential Revision: https://reviews.llvm.org/D75425	2020-03-28 04:19:45 -05:00
Louis Dionne	faf415a1de	[lit] Recursively expand substitutions This allows defining substitutions in terms of other substitutions. For example, a %build substitution could be defined in terms of a %cxx substitution as '%cxx %s -o %t.exe' and the script would be properly expanded. Differential Revision: https://reviews.llvm.org/D76178	2020-03-27 09:25:26 -04:00
Jinsong Ji	fe025a3490	[docs][Phabricator] git migration related update 1.Add instructions to update author when committing other's patch We have updated DeveloperPolicy to show how to change author in https://reviews.llvm.org/D72468 We should also update Phabricator page to include such infomation, in case people follow the steps here and forget to update author info. 2. Replace `git llvm push` with `git push` Reviewed By: probinson Differential Revision: https://reviews.llvm.org/D76718	2020-03-26 18:08:06 +00:00
Aaron Ballman	4778e409de	Clarify use of llvm_unreachable in the coding standard. There has been some ongoing confusion regarding when to use `llvm_unreachable` which this patch attempts to address. Specifically, the confusion has been around whether `llvm_unreachable` is intended to mark only unreachable code paths that the compiler cannot determine itself or to mark a code path which is unconditionally a bug to reach. Based on email and IRC discussions, it sounds like "unconditional bug to reach" is the consensus.	2020-03-26 08:08:23 -04:00
Adrian Prantl	ed8ad6ec15	Add an -object-path-prefix option to dsymutil to remap object file paths (but no source paths) before processing. This is meant to be used for Clang objects where the module cache location was remapped using ``-fdebug-prefix-map``; to help dsymutil find the Clang module cache. <rdar://problem/55685132> Differential Revision: https://reviews.llvm.org/D76391	2020-03-24 17:13:42 -07:00
Louis Dionne	c5f4b72835	NFC: Fix typos in TestingGuide documentation	2020-03-24 14:54:55 -04:00
Louis Dionne	83346a4077	[lit] NFC: Document missing result codes These result codes already exist, but they were not documented. I assume this is an oversight when adding these result codes.	2020-03-24 14:46:54 -04:00
Simon Tatham	f282b6ab23	[ReleaseNotes,ARM] MVE intrinsics are all implemented! Summary: The next release of LLVM will support the full ACLE spec for MVE intrinsics, so it's worth saying so in the release notes. Reviewers: kristof.beyls Reviewed By: kristof.beyls Subscribers: cfe-commits, hans, dmgreen, llvm-commits Tags: #llvm, #clang Differential Revision: https://reviews.llvm.org/D76513	2020-03-24 11:42:25 +00:00
Jay Foad	0444d16a16	[GlobalISel] Add generic opcodes for saturating add/subtract Summary: Add new generic MIR opcodes G_SADDSAT etc. Add support in IRTranslator for translating the saturating add/subtract intrinsics to the new opcodes. Reviewers: aemerson, dsanders, paquette, arsenm Subscribers: jvesely, wdng, nhaehnle, rovka, hiraditya, volkan, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76600	2020-03-23 15:16:45 +00:00
Simon Pilgrim	6a6a83c6e9	MergeFunctions.rst - multiply vs shift typo (PR44717) The doc is suggesting that a mul-by-2 is the same as a ashr-by-1 instead of shl-by-1 Differential Revision: https://reviews.llvm.org/D76566	2020-03-23 10:13:25 +00:00
Sylvestre Ledru	986051749c	doc: use the right url to bugzilla	2020-03-22 22:49:40 +01:00
Sylvestre Ledru	72fd1033ea	Doc: Links should use https	2020-03-22 22:49:33 +01:00
Sylvestre Ledru	ea4ec17208	update of the llvm doc: we moved to git	2020-03-22 22:36:21 +01:00
Petr Hosek	8a8778f25f	[CMake] Enable the use of -ffile-prefix-map This handles not paths embedded in debug info, but also in sources. Since the use of this flag is controlled by an option, rather than replacing the new option, we add a new option. Differential Revision: https://reviews.llvm.org/D76018	2020-03-19 15:14:15 -07:00
Scott Linder	0e9368cc8c	[AMDGPU] Move frame pointer from s34 to s33 Remove the gap left between the stack pointer (s32) and frame pointer (s34) now that the scratch wave offset is no longer a part of the calling convention ABI. Update llvm/docs/AMDGPUUsage.rst to reflect the change. Tags: #llvm Differential Revision: https://reviews.llvm.org/D75657	2020-03-19 15:35:16 -04:00
Scott Linder	60b1967c39	[AMDGPU] Add Scratch Wave Offset to Scratch Buffer Descriptor in entry functions Add the scratch wave offset to the scratch buffer descriptor (SRSrc) in the entry function prologue. This allows us to removes the scratch wave offset register from the calling convention ABI. As part of this change, allow the use of an inline constant zero for the SOffset of MUBUF instructions accessing the stack in entry functions when a frame pointer is not requested/required. Entry functions with calls still need to set up the calling convention ABI stack pointer register, and reference it in order to address arguments of called functions. The ABI stack pointer register remains unswizzled, but is now wave-relative instead of queue-relative. Non-entry functions also use an inline constant zero SOffset for wave-relative scratch access, but continue to use the stack and frame pointers as before. When the stack or frame pointer is converted to a swizzled offset it is now scaled directly, as the scratch wave offset no longer needs to be subtracted first. Update llvm/docs/AMDGPUUsage.rst to reflect these changes to the calling convention. Tags: #llvm Differential Revision: https://reviews.llvm.org/D75138	2020-03-19 15:35:16 -04:00
Simon Moll	733b319948	[VP,Integer,#1] Vector-predicated integer intrinsics Summary: This patch adds IR intrinsics for vector-predicated integer arithmetic. It is subpatch #1 of the [integer slice](https://reviews.llvm.org/D57504#1732277) of [LLVM-VP](https://reviews.llvm.org/D57504). LLVM-VP is a larger effort to bring native vector predication to LLVM. Reviewed By: andrew.w.kaylor Differential Revision: https://reviews.llvm.org/D69891	2020-03-19 10:51:47 +01:00
Sanjay Patel	d8061456bc	[LangRef] fix typo in select poison explanation; NFC	2020-03-18 18:59:14 -04:00
Sanjay Patel	acaf144222	[LangRef] fix formatting tick; NFC	2020-03-18 17:26:41 -04:00
Sanjay Patel	faba1d034a	[LangRef] add explanatory text for select poison semantics (PR20895) This is copied from the suggested text by @regehr in: https://bugs.llvm.org/show_bug.cgi?id=20895 The way forward was not clear for several years, but now that we have 'freeze' and Alive2, the behavior should be documented. Also see comments in D76332.	2020-03-18 17:17:20 -04:00
Sergej Jaskiewicz	f8dbe50e99	[docs] Remove outdated note about migration to Git Reviewers: probinson, jyknight Reviewed By: probinson Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76074	2020-03-17 18:43:38 +03:00
Stefanos Baziotis	3f3bda1c37	[LoopTerminology] Minor fixes in loop rotation	2020-03-17 06:34:02 +02:00
Stefanos Baziotis	30dc342f08	[LoopTerminology] Rotated Loops images	2020-03-17 01:02:19 +02:00
Stefanos Baziotis	7fa204580d	[LoopTerminology] Rotated Loops	2020-03-17 00:54:26 +02:00
Artem Belevich	74bf95d71d	[CUDA] Updated CompileCudaWithLLVM doc.	2020-03-16 15:49:41 -07:00
Nico Weber	9e48422035	Revert "[llvm-objdump] Display locations of variables alongside disassembly" Makes tests fail on Windows, see https://reviews.llvm.org/D70720#1924542 This reverts commit `3a5ddedadb`, and follow-ups: `f4cb9c919e` `042eb0482a` `c0cf5f5da9` `18649f4813` `f62b898c1f`	2020-03-16 14:04:25 -04:00
Oliver Stannard	3a5ddedadb	[llvm-objdump] Display locations of variables alongside disassembly This adds the --debug-vars option to llvm-objdump, which prints locations (registers/memory) of source-level variables alongside the disassembly based on DWARF info. A vertical line is printed for each live-range, with a label at the top giving the variable name and location, and the position and length of the line indicating the program counter range in which it is valid. Currently, this only works for object files, not executables or shared libraries. Differential revision: https://reviews.llvm.org/D70720	2020-03-16 10:54:40 +00:00
Dylan McKay	56aed6144a	[AVR] Add a release note about the AVR backend becoming an official backend AVR has been enabled by default since `c480c584a0`, the tests have been stable for a couple days now, revert extremely unlikely.	2020-03-16 20:07:59 +13:00
Arlo Siemsen	1478ed69d3	Add support for SHA256 source file checksums in debug info LLVM currently supports CSK_MD5 and CSK_SHA1 source file checksums in debug info. This change adds support for CSK_SHA256 checksums. The SHA256 checksums are supported by the CodeView debug format. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D75785	2020-03-12 16:32:05 -07:00
Tyker	f16f139db4	Basis of dropping uses in llvm.assume. Summary: This patch adds the basic utilities to deal with dropable uses. dropable uses are uses that we rather drop than prevent transformations, for now they are limited to uses in llvm.assume. Reviewers: jdoerfert, sstefan1 Reviewed By: jdoerfert Subscribers: uenoku, lebedev.ri, mgorny, hiraditya, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73404	2020-03-12 10:10:22 +01:00
Jonathan Roelofs	6bfd10ff80	Fix internal links in Kaleidoscope tutorial	2020-03-09 15:07:44 -06:00
JF Bastien	8fc9eea43a	Test that volatile load type isn't changed Summary: As discussed in D75505, it's not particularly useful to change the type of a load to/from floating-point/integer because it's followed by a bitcast, and it might lead to surprising code generation. Check that this doesn't generally happen. Reviewers: lebedev.ri Subscribers: jkorous, dexonsmith, ributzka, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75644	2020-03-09 11:19:23 -07:00
Fangrui Song	0d673be13a	[llvm-objdump] Rename --disassemble-functions to --disassemble-symbols https://bugs.llvm.org/show_bug.cgi?id=41910 The feature can disassemble data and the new option name reflects its more generic usage. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D75816	2020-03-09 08:25:45 -07:00
kpdev	0dfcb23b05	[NFC][Test commit] Remove redundant point in docs	2020-03-07 10:30:42 +03:00
Hal Finkel	fa913f8980	Add the CodeReview Documentation to GettingInvolved TOC	2020-03-07 04:55:46 +00:00
Hal Finkel	4d0339aecb	High-Level Code-Review Documentation Update This is an update to the documentation of our community code-review process. Based on the RFC: High-Level Code-Review Documentation Update (http://lists.llvm.org/pipermail/llvm-dev/2019-November/136808.html). In this patch, I've pulled out the documentation into a separate file, and broken it into a number of subsections. This is, of course, just one further step in better documenting our community processes. I expect we'll continue to improve this over time. Thank you to everyone who provided feedback! Differential Revision: https://reviews.llvm.org/D71916	2020-03-07 04:20:18 +00:00
Shivam Gupta	dafc7a5492	Correct the Bjarne Stroustrup's C++ Page link Summary: Bjarne Stroustrup's C++ Page link pointing to wrong AT&T page. Reviewers: jyknight, sanjoy, silvas, hubert.reinterpretcast Reviewed By: hubert.reinterpretcast Subscribers: hubert.reinterpretcast, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75709	2020-03-06 16:59:50 -05:00
Pablo Barrio	e440e0a715	Fix MemTagSanitizer docs to point at Armv8.5-A MTE The Memory Tagging Extension was introduced in Armv8.5-A.	2020-03-05 17:23:58 +00:00
Stefanos Baziotis	6f5d5d6602	[LoopTerminology][NFC] Fix typo	2020-03-04 02:12:33 +02:00
Vedant Kumar	dd1ea9de2e	Reland: [Coverage] Revise format to reduce binary size Try again with an up-to-date version of D69471 (`99317124` was a stale revision). --- Revise the coverage mapping format to reduce binary size by: 1. Naming function records and marking them `linkonce_odr`, and 2. Compressing filenames. This shrinks the size of llc's coverage segment by 82% (334MB -> 62MB) and speeds up end-to-end single-threaded report generation by 10%. For reference the compressed name data in llc is 81MB (__llvm_prf_names). Rationale for changes to the format: - With the current format, most coverage function records are discarded. E.g., more than 97% of the records in llc are duplicate placeholders for functions visible-but-not-used in TUs. Placeholders are used to show under-covered functions, but duplicate placeholders waste space. - We reached general consensus about giving (1) a try at the 2017 code coverage BoF [1]. The thinking was that using `linkonce_odr` to merge duplicates is simpler than alternatives like teaching build systems about a coverage-aware database/module/etc on the side. - Revising the format is expensive due to the backwards compatibility requirement, so we might as well compress filenames while we're at it. This shrinks the encoded filenames in llc by 86% (12MB -> 1.6MB). See CoverageMappingFormat.rst for the details on what exactly has changed. Fixes PR34533 [2], hopefully. [1] http://lists.llvm.org/pipermail/llvm-dev/2017-October/118428.html [2] https://bugs.llvm.org/show_bug.cgi?id=34533 Differential Revision: https://reviews.llvm.org/D69471	2020-02-28 18:12:04 -08:00
Vedant Kumar	3388871714	Revert "[Coverage] Revise format to reduce binary size" This reverts commit `99317124e1`. This is still busted on Windows: http://lab.llvm.org:8011/builders/lld-x86_64-win7/builds/40873 The llvm-cov tests report 'error: Could not load coverage information'.	2020-02-28 18:03:15 -08:00
Vedant Kumar	99317124e1	[Coverage] Revise format to reduce binary size Revise the coverage mapping format to reduce binary size by: 1. Naming function records and marking them `linkonce_odr`, and 2. Compressing filenames. This shrinks the size of llc's coverage segment by 82% (334MB -> 62MB) and speeds up end-to-end single-threaded report generation by 10%. For reference the compressed name data in llc is 81MB (__llvm_prf_names). Rationale for changes to the format: - With the current format, most coverage function records are discarded. E.g., more than 97% of the records in llc are duplicate placeholders for functions visible-but-not-used in TUs. Placeholders are used to show under-covered functions, but duplicate placeholders waste space. - We reached general consensus about giving (1) a try at the 2017 code coverage BoF [1]. The thinking was that using `linkonce_odr` to merge duplicates is simpler than alternatives like teaching build systems about a coverage-aware database/module/etc on the side. - Revising the format is expensive due to the backwards compatibility requirement, so we might as well compress filenames while we're at it. This shrinks the encoded filenames in llc by 86% (12MB -> 1.6MB). See CoverageMappingFormat.rst for the details on what exactly has changed. Fixes PR34533 [2], hopefully. [1] http://lists.llvm.org/pipermail/llvm-dev/2017-October/118428.html [2] https://bugs.llvm.org/show_bug.cgi?id=34533 Differential Revision: https://reviews.llvm.org/D69471	2020-02-28 17:33:25 -08:00
Francis Visoiu Mistrih	e551b737c3	[LTO][Legacy] Add new API to query Mach-O CPU (sub)type Tools working with object files on Darwin (e.g. lipo) may need to know properties like the CPU type and subtype of a bitcode file. The logic of converting a triple to a Mach-O CPU_(SUB_)TYPE should be provided by LLVM instead of relying on tools to re-implement it. Differential Revision: https://reviews.llvm.org/D75067	2020-02-28 12:56:05 -08:00
Vedant Kumar	b0142cd986	[ADT] Add CoalescingBitVector, implemented using IntervalMap [1/3] Add CoalescingBitVector to ADT. This is part 1 of a 3-part series to address a compile-time explosion issue in LiveDebugValues. --- CoalescingBitVector is a bitvector that, under the hood, relies on an IntervalMap to coalesce elements into intervals. CoalescingBitVector efficiently represents sets which predominantly contain contiguous ranges (e.g. the VarLocSets in LiveDebugValues, which are very long sequences that look like {1, 2, 3, ...}). OTOH, CoalescingBitVector isn't good at representing sets with lots of gaps between elements. The first N coalesced intervals of set bits are stored in-place (in the initial heap allocation). Compared to SparseBitVector, CoalescingBitVector offers more predictable performance for non-sequential find() operations. This provides a crucial speedup in LiveDebugValues. Differential Revision: https://reviews.llvm.org/D74984	2020-02-27 12:39:46 -08:00
Stefanos Baziotis	2a49d650a5	[docs][LoopTerminology] Add Loop Simplify Form description. Information taken from https://youtu.be/3pRhvQi7Z10?t=481 and comments in LoopSimplify.h. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D74989	2020-02-26 20:41:06 -06:00
James Henderson	974bce3edd	[docs][llvm-objcopy][llvm-strip] Move --wildcard description earlier This moves it above the response file description, which should be at the end.	2020-02-26 10:51:17 +00:00
James Henderson	6b74745c06	[docs][llvm-symbolizer] Fix indentation of inline option examples The examples for different options were inconsistently indented in the HTML display. As they are tied to the options, this change normalises to indent them the same as the option description body.	2020-02-26 10:51:16 +00:00
James Henderson	190707f60e	[docs][llvm-symbolizer] Fix --functions description "--functions none" and "--functions=none" are not the same. One is the option "--functions" with its default value of "linkage", followed by an input address of "none", and the other is "--functions" with the value "none". This patch fixes the doc to match the actual behaviour by adding an extra '=' sign in the allowed values description.	2020-02-26 10:50:24 +00:00
Bill Wendling	23c2a5ce33	Allow "callbr" to return non-void values Summary: Terminators in LLVM aren't prohibited from returning values. This means that the "callbr" instruction, which is used for "asm goto", can support "asm goto with outputs." This patch removes all restrictions against "callbr" returning values. The heavy lifting is done by the code generator. The "INLINEASM_BR" instruction's a terminator, and the code generator doesn't allow non-terminator instructions after a terminator. In order to correctly model the feature, we need to copy outputs from "INLINEASM_BR" into virtual registers. Of course, those copies aren't terminators. To get around this issue, we split the block containing the "INLINEASM_BR" right before the "COPY" instructions. This results in two cheats: - Any physical registers defined by "INLINEASM_BR" need to be marked as live-in into the block with the "COPY" instructions. This violates an assumption that physical registers aren't marked as "live-in" until after register allocation. But it seems as if the live-in information only needs to be correct after register allocation. So we're able to get away with this. - The indirect branches from the "INLINEASM_BR" are moved to the "COPY" block. This is to satisfy PHI nodes. I've been told that MLIR can support this handily, but until we're able to use it, we'll have to stick with the above. Reviewers: jyknight, nickdesaulniers, hfinkel, MaskRay, lattner Reviewed By: nickdesaulniers, MaskRay, lattner Subscribers: rriddle, qcolombet, jdoerfert, MatzeB, echristo, MaskRay, xbolva00, aaron.ballman, cfe-commits, JonChesterfield, hiraditya, llvm-commits, rnk, craig.topper Tags: #llvm, #clang Differential Revision: https://reviews.llvm.org/D69868	2020-02-24 18:29:06 -08:00
Francesco Petrogalli	3d65dd1e66	[ReleaseNotes] Mention the `vector-function-abi-variant` attribute. Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74969	2020-02-24 17:39:31 +00:00
Bevin Hansson	6e561d1c94	[Intrinsic] Add fixed point saturating division intrinsics. Summary: This patch adds intrinsics and ISelDAG nodes for signed and unsigned fixed-point division: ``` llvm.sdiv.fix.sat.* llvm.udiv.fix.sat.* ``` These intrinsics perform scaled, saturating division on two integers or vectors of integers. They are required for the implementation of the Embedded-C fixed-point arithmetic in Clang. Reviewers: bjope, leonardchan, craig.topper Subscribers: hiraditya, jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71550	2020-02-24 10:50:52 +01:00
Fangrui Song	fc6057e34f	[Frontend] Replace CC1 option -mcode-model with -mcmodel= Before: % clang -mcmodel=x -xc /dev/null error: invalid argument 'x' in '-mcode-model x' Now: % clang -mcmodel=x -xc /dev/null clang-11: error: invalid argument 'x' to -mcmodel=	2020-02-21 23:10:50 -08:00
Stefanos Baziotis	393f4e8ac2	[Analysis][Docs] Parents of loops documentation. Recently I had to use it and although one assumes it returns null if there's no parent loop, I think it helps to doc it. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D74890	2020-02-21 17:11:53 -06:00
Tony	788e74ce29	[AMDGPU] AMDGPUUsage define call convention ABI Reviewers: scott.linder, arsenm, b-sumner Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74861	2020-02-19 15:56:19 -05:00
Tony	f5678d4a6a	[AMDGPU] Update AMDGPUUsage with DWARF proposal Summary: - Add AMDGPU DWARF proposal. - Add references for gfx10 ISA and SemVer. Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, aprantl, dstuttard, tpr, jfb, dmgreen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70523	2020-02-19 15:30:53 -05:00
Tyker	170ae68fef	[AssumeBundle] Add documentation for the operand bundles of an llvm.assume Summary: Operand bundles on an llvm.assume allows representing assumptions that an attribute holds for a certain value at a certain position. Operand bundles enable assumptions that are either hard or impossible to represent as a boolean argument of an llvm.assume. Reviewers: jdoerfert, fhahn, nlopes, reames, regehr, efriedma Reviewed By: jdoerfert Subscribers: lebedev.ri, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74209	2020-02-19 18:53:15 +01:00
Reid Kleckner	236fcbc21a	Add coding standard recommending use of qualifiers in cpp files There is prior art for this in the code base itself, and a recent example of this here: `c45f8d4989` This came up in discussion on this review where @maskray was going the opposite direction: https://reviews.llvm.org/D68772 Given that there is disagreement, we should make a choice and document it. Thanks to John McCall for the precise wording. Reviewed By: MaskRay, rjmccall Differential Revision: https://reviews.llvm.org/D74515	2020-02-18 14:08:56 -08:00
David Tenty	58817a0783	[clang][XCOFF] Indicate that XCOFF does not support COMDATs Summary: XCOFF doesn't support COMDATs, so clang shouldn't emit them. Reviewers: stevewan, sfertile, Xiangling_L Reviewed By: sfertile Subscribers: dschuff, aheejin, dexonsmith, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D74631	2020-02-18 16:10:11 -05:00
Evandro Menezes	4af3be7b04	[docs] Add note on using cmake to perform the build Repeat the build instructions from the top level README in the Getting Started guide.	2020-02-14 13:44:56 -06:00
James Henderson	4e1c49cf4d	[doc] Clarify responsibility for fixing experimental target problems Experimental targets are meant to be maintained by the community behind the target. They are not monitored by the primary build bots. This change clarifies that it is this communities responsibility for things like test fixes related to the target caused by changes unrelated to that target. See http://lists.llvm.org/pipermail/llvm-dev/2020-February/139115.html for a full discussion. Reviewed by: rupprecht, lattner, MaskRay Differential Revision: https://reviews.llvm.org/D74538	2020-02-14 09:50:18 +00:00

... 8 9 10 11 12 ...

8904 Commits