llvm-project

Commit Graph

Author	SHA1	Message	Date
Aakanksha Patil	3453f3dd46	[AMDGPU] Add gfx1035 target Differential Revision: https://reviews.llvm.org/D104804	2021-06-24 14:32:41 -04:00
Brendon Cahoon	927b809783	[GlobalISel] Describe undefined values for G_SBFX/G_UBFX operands Differential Revision: https://reviews.llvm.org/D104245	2021-06-24 09:31:41 -04:00
Jay Foad	beebe5a056	[MCA] Allow unlimited cycles in the timeline view Change --max-timeline-cycles=0 to mean no limit on the number of cycles. Use this in AMDGPU tests to show all instructions in the timeline view instead of having it arbitrarily truncated. Differential Revision: https://reviews.llvm.org/D104846	2021-06-24 12:54:57 +01:00
Arthur Eubanks	e15673df27	[docs][NewPM] Add some instructions on how to invoke opt Also add link to blog post. Reviewed By: nickdesaulniers Differential Revision: https://reviews.llvm.org/D104812	2021-06-23 19:49:35 -07:00
Nick Desaulniers	24d48d45cc	[LangRef] add note to warn-frame-size about ODR As sugguested by @dblaikie in D104342. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D104736	2021-06-23 16:28:55 -07:00
pooja2299	a15f9ff996	[docs][GISel]Added GISel documentation link Added the GISel docs link here - https://llvm.org/docs/CodeGenerator.html#instruction-selection-section Differential Revision: https://reviews.llvm.org/D104204	2021-06-24 00:55:00 +05:30
Nick Desaulniers	8ace121305	[IR] convert warn-stack-size from module flag to fn attr Otherwise, this causes issues when building with LTO for object files that use different values. Link: https://github.com/ClangBuiltLinux/linux/issues/1395 Reviewed By: dblaikie, MaskRay Differential Revision: https://reviews.llvm.org/D104342	2021-06-21 15:09:25 -07:00
Andrew Ng	d02bf362dc	[llvm-symbolizer][docs] Update example for --verbose in the guide Differential Revision: https://reviews.llvm.org/D104128	2021-06-17 19:12:44 +01:00
Bjorn Pettersson	4c7f820b2b	Update @llvm.powi to handle different int sizes for the exponent This can be seen as a follow up to commit `0ee439b705`, that changed the second argument of __powidf2, __powisf2 and __powitf2 in compiler-rt from si_int to int. That was to align with how those runtimes are defined in libgcc. One thing that seem to have been missing in that patch was to make sure that the rest of LLVM also handle that the argument now depends on the size of int (not using the si_int machine mode for 32-bit). When using __builtin_powi for a target with 16-bit int clang crashed. And when emitting libcalls to those rtlib functions, typically when lowering @llvm.powi), the backend would always prepare the exponent argument as an i32 which caused miscompiles when the rtlib was compiled with 16-bit int. The solution used here is to use an overloaded type for the second argument in @llvm.powi. This way clang can use the "correct" type when lowering __builtin_powi, and then later when emitting the libcall it is assumed that the type used in @llvm.powi matches the rtlib function. One thing that needed some extra attention was that when vectorizing calls several passes did not support that several arguments could be overloaded in the intrinsics. This patch allows overload of a scalar operand by adding hasVectorInstrinsicOverloadedScalarOpd, with an entry for powi. Differential Revision: https://reviews.llvm.org/D99439	2021-06-17 09:38:28 +02:00
Joachim Meyer	053dbb939d	Use `-cfg-func-name` value as filter for `-view-cfg`, etc. Currently the value is only used when calling `F->viewCFG()` which is missing out on its potential and usefulness. So I added the check to the printer passes as well. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D102011	2021-06-16 23:54:51 +02:00
Patrick Holland	ef16c8eaa5	Reapply "[MCA] Adding the CustomBehaviour class to llvm-mca". The original change was pushed in main as commit `f7a23ecece`. It was then reverted by commit `a04f01bab2` because it caused linker failures on buildbots that don't build the AMDGPU target. -- Some instructions are not defined well enough within the target’s scheduling model for llvm-mca to be able to properly simulate its behaviour. The ideal solution to this situation is to modify the scheduling model, but that’s not always a viable strategy. Maybe other parts of the backend depend on that instruction being modelled the way that it is. Or maybe the instruction is quite complex and it’s difficult to fully capture its behaviour with tablegen. The CustomBehaviour class (which I will refer to as CB frequently) is designed to provide intuitive scaffolding for developers to implement the correct modelling for these instructions. More details are available in the original commit log message (`f7a23ecece`). Differential Revision: https://reviews.llvm.org/D104149	2021-06-16 16:54:48 +01:00
Ben Dunbobbin	dbc07ef5ca	[llvm-symbolizer] improve test and fix doc example after recent --print-source-context-lines behaviour change I believe that after https://reviews.llvm.org/D102355 the behaviour of --print-source-context-lines has changed. Before: --print-source-context-lines=3 prints 4 lines. After: --print-source-context-lines=3 prints 3 lines. Adjust the example in the docs for this change and make the testing a little more robust. Differential Revision: https://reviews.llvm.org/D104114	2021-06-16 13:38:22 +01:00
Andrea Di Biagio	a04f01bab2	Revert "[MCA] Adding the CustomBehaviour class to llvm-mca" This reverts commit `f7a23ecece`. It appears to breaks buildbots that don't build the AMDGPU backend.	2021-06-15 21:41:36 +01:00
Patrick Holland	f7a23ecece	[MCA] Adding the CustomBehaviour class to llvm-mca Some instructions are not defined well enough within the target’s scheduling model for llvm-mca to be able to properly simulate its behaviour. The ideal solution to this situation is to modify the scheduling model, but that’s not always a viable strategy. Maybe other parts of the backend depend on that instruction being modelled the way that it is. Or maybe the instruction is quite complex and it’s difficult to fully capture its behaviour with tablegen. The CustomBehaviour class (which I will refer to as CB frequently) is designed to provide intuitive scaffolding for developers to implement the correct modelling for these instructions. Implementation details: llvm-mca does its best to extract relevant register, resource, and memory information from every MCInst when lowering them to an mca::Instruction. It then uses this information to detect dependencies and simulate stalls within the pipeline. For some instructions, the information that gets captured within the mca::Instruction is not enough for mca to simulate them properly. In these cases, there are two main possibilities: 1. The instruction has a dependency that isn’t detected by mca. 2. mca is incorrectly enforcing a dependency that shouldn’t exist. For the rest of this discussion, I will be focusing on (1), but I have put some thought into (2) and I may revisit it in the future. So we have an instruction that has dependencies that aren’t picked up by mca. The basic idea for both pipelines in mca is that when an instruction wants to be dispatched, we first check for register hazards and then we check for resource hazards. This is where CB is injected. If no register or resource hazards have been detected, we make a call to CustomBehaviour::checkCustomHazard() to give the target specific CB the chance to detect and enforce any custom dependencies. The return value for checkCustomHazaard() is an unsigned int representing the (minimum) number of cycles that the instruction needs to stall for. It’s fine to underestimate this value because when StallCycles gets down to 0, we’ll end up checking for all the hazards again before the instruction is actually dispatched. However, it’s important not to overestimate the value and the more accurate your estimate is, the more efficient mca’s execution can be. In general, for checkCustomHazard() to be able to detect these custom dependencies, it needs information about the current instruction and also all of the instructions that are still executing within the pipeline. The mca pipeline uses mca::Instruction rather than MCInst and the current information encoded within each mca::Instruction isn’t sufficient for my use cases. I had to add a few extra attributes to the mca::Instruction class and have them get set by the MCInst during instruction building. For example, the current mca::Instruction doesn’t know its opcode, and it also doesn’t know anything about its immediate operands (both of which I had to add to the class). With information about the current instruction, a list of all currently executing instructions, and some target specific objects (MCSubtargetInfo and MCInstrInfo which the base CB class has references to), developers should be able to detect and enforce most custom dependencies within checkCustomHazard. If you need more information than is present in the mca::Instruction, feel free to add attributes to that class and have them set during the lowering sequence from MCInst. Fortunately, in the in-order pipeline, it’s very convenient for us to pass these arguments to checkCustomHazard. The hazard checking is taken care of within InOrderIssueStage::canExecute(). This function takes a const InstRef as a parameter (representing the instruction that currently wants to be dispatched) and the InOrderIssueStage class maintains a SmallVector<InstRef, 4> which holds all of the currently executing instructions. For the out-of-order pipeline, it’s a bit trickier to get the list of executing instructions and this is why I have held off on implementing it myself. This is the main topic I will bring up when I eventually make a post to discuss and ask for feedback. CB is a base class where targets implement their own derived classes. If a target specific CB does not exist (or we pass in the -disable-cb flag), the base class is used. This base class trivially returns 0 from its checkCustomHazard() implementation (meaning that the current instruction needs to stall for 0 cycles aka no hazard is detected). For this reason, targets or users who choose not to use CB shouldn’t see any negative impacts to accuracy or performance (in comparison to pre-patch llvm-mca). Differential Revision: https://reviews.llvm.org/D104149	2021-06-15 21:30:48 +01:00
Arthur Eubanks	0e31e22ed9	[docs][OpaquePtr] Shuffle around the transition plan section Emphasize that this is basically an attempt to remove ``PointerType::getElementType`` and ``Type::getPointerElementType()``. Add a couple more subtasks. Differential Revision: https://reviews.llvm.org/D104151	2021-06-14 10:59:41 -07:00
Jeroen Dobbelaere	bb8ce25e88	Intrinsic::getName: require a Module argument Ensure that we provide a `Module` when checking if a rename of an intrinsic is necessary. This fixes the issue that was detected by https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=32288 (as mentioned by @fhahn), after committing D91250. Note that the `LLVMIntrinsicCopyOverloadedName` is being deprecated in favor of `LLVMIntrinsicCopyOverloadedName2`. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D99173	2021-06-14 14:52:29 +02:00
Simon Moll	74d45b884c	[VP] Binary floating-point intrinsics. This patch implements vector-predicated intrinsics on IR level for fadd, fsub, fmul, fdiv and frem. There operate in the default floating-point environment. We will use constrained fp operand bundles for constrained vector-predicated fp math (D93455). Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93470	2021-06-14 08:51:41 +02:00
Philip Reames	ac81cb7e6d	Allow ptrtoint/inttoptr of non-integral pointer types in IR I don't like landing this change, but it's an acknowledgement of a practical reality. Despite not having well specified semantics for inttoptr and ptrtoint involving non-integral pointer types, they are used in practice. Here's a quick summary of the current pragmatic reality: * I happen to know that the main external user of non-integral pointers has effectively disabled the verifier rules. * RS4GC (the lowering pass for abstract GC machine model which is the key motivation for non-integral pointers), even supports them. We just have all the tests using an integral pointer space to let the verifier run. * Certain idioms (such as alignment checks for alignment N, where any relocation is guaranteed to be N byte aligned) are fine in practice. * As implemented, inttoptr/ptrtoint are CSEd and are not control dependent. This means that any code which is intending to check a particular bit pattern at site of use must be wrapped in an intrinsic or external function call. This change allows them in the Verifier, and updates the LangRef to specific them as implementation dependent. This allows us to acknowledge current reality while still leaving ourselves room to punt on figuring out "good" semantics until the future.	2021-06-11 13:38:32 -07:00
Arthur Eubanks	06c3d52aa2	[docs][OpaquePtr] Add some specific examples of what needs to be done	2021-06-11 12:51:46 -07:00
gbreynoo	3b46283c15	[docs][llvm-ar] Add rsp-quoting option to the llvm-ar command guide. I noticed that I did not update the command guide when introducing the --rsp-quoting option. This change fixes this. Differential Revision: https://reviews.llvm.org/D103915	2021-06-10 16:32:31 +01:00
Juneyoung Lee	c0438a2c0f	[LangRef] Fix missing code highlighting format	2021-06-10 16:12:17 +09:00
Jim Lin	dec3154c16	[Docs] Fix incorrect return type for example code	2021-06-10 14:20:11 +08:00
madhur13490	62bd7da889	[LangRef] Add link to opaque pointers Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D103981	2021-06-10 00:11:02 +05:30
Nathan Sidwell	f776108168	[docs] Collate CMake options I found the documentation of the various CMake variables difficult to navigate, because they are unsorted. I can see they've grown organically with new clusters of somewhat-related options, but the result is hard to use. This collates them (treating '_' as space). Differential Revision: https://reviews.llvm.org/D102481	2021-06-09 11:24:38 -07:00
Jim Lin	391f9ef1aa	[docs] Fix load instructions in chapter 7 of the tutorial Loads in the first half of the chapter are missing the type argument. Patched By: klao (Mihaly Barasz) Reviewed By: Jim Differential Revision: https://reviews.llvm.org/D90326	2021-06-09 17:39:11 +08:00
Jim Lin	9751af22c4	[Docs] Fix incorrect return type for example code	2021-06-09 15:22:49 +08:00
Brendon Cahoon	294efbbd3e	Reland "[AMDGPU] Add gfx1013 target" This reverts commit `211e584fa2`. Fixed a use-after-free error that caused the sanitizers to fail.	2021-06-08 21:15:35 -04:00
Brendon Cahoon	211e584fa2	Revert "[AMDGPU] Add gfx1013 target" This reverts commit `ea10a86984`. A sanitizer buildbot reports an error.	2021-06-08 16:29:41 -04:00
Brendon Cahoon	ea10a86984	[AMDGPU] Add gfx1013 target Differential Revision: https://reviews.llvm.org/D103663	2021-06-08 12:49:49 -04:00
Arthur Eubanks	47211fa889	Revert "[TargetLowering] Only inspect attributes in the arguments for ArgListEntry" Needs to be discussed more. This reverts commit 255a5c1baa6020c009934b4fa342f9f6dbbcc46 This reverts commit df2056ff3730316f376f29d9986c9913b95ceb1 This reverts commit faff79b7ca144e505da6bc74aa2b2f7cffbbf23 This reverts commit d2a9020785c6e02afebc876aa2778fa64c5cafd	2021-06-07 16:07:44 -07:00
Krzysztof Parzyszek	9d35c1701f	[docs] Set Phabricator as the tool for pre-commit reviews Differential Revision: https://reviews.llvm.org/D103811	2021-06-07 11:50:52 -05:00
Arthur Eubanks	9255a5c1ba	[TargetLowering] Only inspect attributes in the arguments for ArgListEntry Parameter attributes are considered part of the function [1], and like mismatched calling conventions [2], we can't have the verifier check for mismatched parameter attributes. Issues can be diagnosed with D103412. [1] https://llvm.org/docs/LangRef.html#parameter-attributes [2] https://llvm.org/docs/FAQ.html#why-does-instcombine-simplifycfg-turn-a-call-to-a-function-with-a-mismatched-calling-convention-into-unreachable-why-not-make-the-verifier-reject-it Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D101806	2021-06-03 15:52:01 -07:00
Fangrui Song	a3fd40b955	[docs] Update llvm-cov gcov Mention some new options. Remove outdated information about -g and -O0. -g0 works. -O1/-O2/-O3 work.	2021-06-03 12:36:27 -07:00
cynecx	22f635b1b3	[LangRef] update according to unwinding support in inline asm https://reviews.llvm.org/D95745 introduced a new `unwind` keyword for inline assembler expressions. Inline asms marked with the `unwind` keyword allows stack unwinding from inline assembly because the compiler emits unwinding information ("around" the inline asm) as it would for calls/invokes. Unwinding the stack from within non-unwind inline asm may cause UB. Reviewed By: Amanieu Differential Revision: https://reviews.llvm.org/D102642	2021-05-31 09:01:46 +01:00
Arthur Eubanks	71cca4f728	Revert "[TargetLowering] Only inspect attributes in the arguments for ArgListEntry" This reverts commit `1c7f32334d`. Some code still needs to properly set parameter ABI attributes, see D101806.	2021-05-29 23:08:15 -07:00
Tim Northover	9ff2eb1ea5	SwiftTailCC: teach verifier musttail rules applicable to this CC. SwiftTailCC has a different set of requirements than the C calling convention for a tail call. The exact argument sequence doesn't have to match, but fewer ABI-affecting attributes are allowed. Also make sure the musttail diagnostic triggers if a musttail call isn't actually a tail call.	2021-05-28 11:12:00 +01:00
Fangrui Song	3f85e124f6	[docs] llvm-objdump: Mention -M no-aliases is supported on AArch64	2021-05-26 23:57:32 -07:00
Yevgeny Rouban	4d26f41f76	[RS4GC] Introduce intrinsics to get base ptr and offset There can be a need for some optimizations to get (base, offset) for any GC pointer. The base can be calculated by generating needed instructions as it is done by the RewriteStatepointsForGC::findBasePointer() function. The offset can be calculated in the same way. Though to not expose the base calculation and to make the offset calculation as simple as ptrtoint(derived_ptr) - ptrtoint(base_ptr), which is illegal outside RS4GC, this patch introduces 2 intrinsics: @llvm.experimental.gc.get.pointer.base(%derived_ptr) @llvm.experimental.gc.get.pointer.offset(%derived_ptr) These intrinsics are inlined by RS4GC along with generation of statepoint sequences. With these new intrinsics the GC parseable lowering for atomic memcpy intrinsics (`6ec2c5e402`) could be implemented as a separate pass. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D100445	2021-05-27 09:14:14 +07:00
naromero77	5f8810d7b4	[flang][docs] Initial documentation for the Fortran LLVM Test Suite. Describes how to run the Fortran LLVM Test Suite, specifically the external SPEC CPU 2017 Fortran tests. Reviewed By: rovka Differential Revision: https://reviews.llvm.org/D102877	2021-05-26 15:59:55 -05:00
pooja2299	cebdf5d846	[Docs] Updated the content of getting started documentation under llvm/lib/MC Wrote about llvm/lib/MC subproject on https://llvm.org/docs/GettingStarted.html page. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D101047	2021-05-26 16:25:26 +05:30
Martin Storsjö	a2a65a5bae	[docs] [CMake] Change recommendations for how to use LLVM_DEFINITIONS LLVM_DEFINITIONS is a string variable containing a list of arguments to pass to the compiler. When CMake's add_definitions is passed a string variable, this is interpreted as one argument. To make it behave properly, the string variable needs to be split into a list. Despite the fact that add_definitions isn't supposed to be used like the LLVM docs recommended, it worked fine in practice in many cases. If the first argument in LLVM_DEFINITIONS is of the form -DFOO=42 instead of plain -DFOO, the rest of the string is treated as value to this define. I.e. if LLVM_DEFINITIONS consists of `-DFOO=42 -DBAR`, CMake ended up passing `-DFOO="42 -DBAR"` to the compiler. See https://gitlab.kitware.com/cmake/cmakissues/22162 for discussion on the matter. Changing LLVM_DEFINITIONS to be a list variable would possibly be more disruptive; instead keep the variable defined as before but change the recommendation for how to use it. Then projects using it can gradually be updated to follow the new recommendation. Differential Revision: https://reviews.llvm.org/D103044	2021-05-25 22:56:51 +03:00
Arthur Eubanks	dce91f247d	[docs] Explain address spaces a bit more in opaque pointers doc Reviewed By: theraven Differential Revision: https://reviews.llvm.org/D102523	2021-05-25 12:35:43 -07:00
Marco Elver	280333021e	[SanitizeCoverage] Add support for NoSanitizeCoverage function attribute We really ought to support no_sanitize("coverage") in line with other sanitizers. This came up again in discussions on the Linux-kernel mailing lists, because we currently do workarounds using objtool to remove coverage instrumentation. Since that support is only on x86, to continue support coverage instrumentation on other architectures, we must support selectively disabling coverage instrumentation via function attributes. Unfortunately, for SanitizeCoverage, it has not been implemented as a sanitizer via fsanitize= and associated options in Sanitizers.def, but rolls its own option fsanitize-coverage. This meant that we never got "automatic" no_sanitize attribute support. Implement no_sanitize attribute support by special-casing the string "coverage" in the NoSanitizeAttr implementation. To keep the feature as unintrusive to existing IR generation as possible, define a new negative function attribute NoSanitizeCoverage to propagate the information through to the instrumentation pass. Fixes: https://bugs.llvm.org/show_bug.cgi?id=49035 Reviewed By: vitalybuka, morehouse Differential Revision: https://reviews.llvm.org/D102772	2021-05-25 12:57:14 +02:00
Roman Lebedev	78eaff2ef8	[llvm-exegesis] Loop unrolling for loop snippet repetitor mode I really needed this, like, factually, yesterday, when verifying dependency breaking idioms for AMD Zen 3 scheduler model. Consider the following example: ``` $ ./bin/llvm-exegesis --mode=inverse_throughput --snippets-file=/tmp/snippet.s --num-repetitions=1000000 --repetition-mode=duplicate Check generated assembly with: /usr/bin/objdump -d /tmp/snippet-4a7e50.o --- mode: inverse_throughput key: instructions: - 'VPXORYrr YMM0 YMM0 YMM0' config: '' register_initial_values: [] cpu_name: znver3 llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 1000000 measurements: - { key: inverse_throughput, value: 0.31025, per_snippet_value: 0.31025 } error: '' info: '' assembled_snippet: C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C3 ... ``` What does it tell us? So wait, it can only execute ~3 x86 AVX YMM PXOR zero-idioms per cycle? That doesn't seem right. That's even less than there are pipes supporting this type of op. Now, second example: ``` $ ./bin/llvm-exegesis --mode=inverse_throughput --snippets-file=/tmp/snippet.s --num-repetitions=1000000 --repetition-mode=loop Check generated assembly with: /usr/bin/objdump -d /tmp/snippet-2418b5.o --- mode: inverse_throughput key: instructions: - 'VPXORYrr YMM0 YMM0 YMM0' config: '' register_initial_values: [] cpu_name: znver3 llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 1000000 measurements: - { key: inverse_throughput, value: 1.00011, per_snippet_value: 1.00011 } error: '' info: '' assembled_snippet: 49B80800000000000000C5FDEFC0C5FDEFC04983C0FF75F2C3 ... ``` Now that's just worse. Due to the looping, the throughput completely plummeted, and now we can only do a single instruction/cycle!? That's not great. And final example: ``` $ ./bin/llvm-exegesis --mode=inverse_throughput --snippets-file=/tmp/snippet.s --num-repetitions=1000000 --repetition-mode=loop --loop-body-size=1000 Check generated assembly with: /usr/bin/objdump -d /tmp/snippet-c402e2.o --- mode: inverse_throughput key: instructions: - 'VPXORYrr YMM0 YMM0 YMM0' config: '' register_initial_values: [] cpu_name: znver3 llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 1000000 measurements: - { key: inverse_throughput, value: 0.167087, per_snippet_value: 0.167087 } error: '' info: '' assembled_snippet: 49B80800000000000000C5FDEFC0C5FDEFC04983C0FF75F2C3 ... ``` So if we merge the previous two approaches, do duplicate this single-instruction snippet 1000x (loop-body-size/instruction count in snippet), and run a loop with 1000 iterations over that duplicated/unrolled snippet, the measured throughput goes through the roof, up to 5.9 instructions/cycle, which finally tells us that this idiom is zero-cycle! Reviewed By: courbet Differential Revision: https://reviews.llvm.org/D102522	2021-05-25 12:08:27 +03:00
Tony Tye	355114a753	[NFC][AMDGPU] Add documentation for AMD Instinct MI100 accelerator Add link to documentation for "AMD Instinct MI100 Instruction Set Architecture" to AMDGPUUsage.rst. Reviewed By: kzhuravl, rampitec, dp Differential Revision: https://reviews.llvm.org/D102859	2021-05-21 16:51:13 +00:00
Tony Tye	b408efe4ff	[NFC][AMDGPU] Mark C code in AMDGPUUsage.rst Reviewed By: foad Differential Revision: https://reviews.llvm.org/D102910	2021-05-21 10:08:05 +00:00
Andy Wingo	81bc732816	[IR][Verifier] Relax restriction on alloca address spaces In the WebAssembly target, we would like to allow alloca in two address spaces. The alloca instruction already has an address space argument, but the verifier asserts that the address space of an alloca is the default alloca address space from the datalayout. This patch removes this restriction. Targets that would like to impose additional restrictions should do so via target-specific verification passes. Differential Revision: https://reviews.llvm.org/D101045	2021-05-21 11:52:45 +02:00
Djordje Todorovic	b9076d119a	Recommit: "[Debugify][Original DI] Test dbg var loc preservation"" [Debugify][Original DI] Test dbg var loc preservation This is an improvement of [0]. This adds checking of original llvm.dbg.values()/declares() instructions in optimizations. We have picked a real issue that has been found with this (actually, picked one variable location missing from [1] and resolved the issue), and the result is the fix for that -- D100844. Before applying the D100844, using the options from [0] (but with this patch applied) on the compilation of GDB 7.11, the final HTML report for the debug-info issues can be found at [1] (please scroll down, and look for "Summary of Variable Location Bugs"). After applying the D100844, the numbers has improved a bit -- please take a look into [2]. [0] https://llvm.org/docs/HowToUpdateDebugInfo.html#\ test-original-debug-info-preservation-in-optimizations [1] https://djolertrk.github.io/di-check-before-adce-fix/ [2] https://djolertrk.github.io/di-check-after-adce-fix/ Differential Revision: https://reviews.llvm.org/D100845 The Unit test was failing because the pass from the test that modifies the IR, in its runOnFunction() didn't return 'true', so the expensive-check configuration triggered an assertion.	2021-05-21 02:04:29 -07:00
Djordje Todorovic	0ae3c1d4d7	Revert "[Debugify][Original DI] Test dbg var loc preservation" This reverts commit `76f375f3d9`. This will be pushed again, after investigating a test failure: https://lab.llvm.org/buildbot/#/builders/16/builds/11254	2021-05-20 07:11:35 -07:00
Djordje Todorovic	76f375f3d9	[Debugify][Original DI] Test dbg var loc preservation This is an improvement of [0]. This adds checking of original llvm.dbg.values()/declares() instructions in optimizations. We have picked a real issue that has been found with this (actually, picked one variable location missing from [1] and resolved the issue), and the result is the fix for that -- D100844. Before applying the D100844, using the options from [0] (but with this patch applied) on the compilation of GDB 7.11, the final HTML report for the debug-info issues can be found at [1] (please scroll down, and look for "Summary of Variable Location Bugs"). After applying the D100844, the numbers has improved a bit -- please take a look into [2]. [0] https://llvm.org/docs/HowToUpdateDebugInfo.html\ [1] https://djolertrk.github.io/di-check-before-adce-fix/ [2] https://djolertrk.github.io/di-check-after-adce-fix/ Differential Revision: https://reviews.llvm.org/D100845	2021-05-20 06:42:02 -07:00
Ahmed Bougacha	c9dbaa4c86	[docs] Describe reporting security issues on the chromium tracker. To track security issues, we're starting with the chromium bug tracker (using the llvm project there). We considered using Github Security Advisories. However, they are currently intended as a way for project owners to publicize their security advisories, and aren't well-suited to reporting issues. This also moves the issue-reporting paragraph to the beginning of the document, in part to make it more discoverable, in part to allow the anchor-linking to actually display the paragraph at the top of the page. Note that this doesn't update the concrete list of security-sensitive areas, which is still an open item. When we do, we may want to move the list of security-sensitive areas next to the issue-reporting paragraph as well, as it seems like relevant information needed in the reporting process. Finally, when describing the discission medium, this splits the topics discussed into two: the concrete security issues, discussed in the issue tracker, and the logistics of the group, in our mailing list, as patches on public lists, and in the monthly sync-up call. While there, add a SECURITY.md page linking to the relevant paragraph. Differential Revision: https://reviews.llvm.org/D100873	2021-05-19 15:21:50 -07:00
Vitaly Buka	c742d8d23c	[libfuzzer] Update doc mentioning removed flags.	2021-05-18 22:40:42 -07:00
Alex Orlov	4fedb3a613	[symbolizer] Added StartAddress for the resolved function. In many cases it is helpful to know at what address the resolved function starts. This patch adds a new StartAddress member to the DILineInfo structure. Reviewed By: jhenderson, dblaikie Differential Revision: https://reviews.llvm.org/D102316	2021-05-19 02:38:13 +04:00
Arthur Eubanks	b9d25cc921	[docs] Fix broken docs after `1c7f32334`	2021-05-18 14:38:12 -07:00
Arthur Eubanks	1c7f32334d	[TargetLowering] Only inspect attributes in the arguments for ArgListEntry Parameter attributes are considered part of the function [1], and like mismatched calling conventions [2], we can't have the verifier check for mismatched parameter attributes. This is a reland after fixing MSan issues in D102667. [1] https://llvm.org/docs/LangRef.html#parameter-attributes [2] https://llvm.org/docs/FAQ.html#why-does-instcombine-simplifycfg-turn-a-call-to-a-function-with-a-mismatched-calling-convention-into-unreachable-why-not-make-the-verifier-reject-it Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D101806	2021-05-18 14:30:22 -07:00
Konstantin Zhuravlyov	4e297dcd18	AMDGPU/Docs: Remove reserved MACH 0x3E (it is no longer reserved), sort MACHs by value	2021-05-18 16:57:56 -04:00
Ten Tzen	797ad70152	[Windows SEH]: HARDWARE EXCEPTION HANDLING (MSVC -EHa) - Part 1 This patch is the Part-1 (FE Clang) implementation of HW Exception handling. This new feature adds the support of Hardware Exception for Microsoft Windows SEH (Structured Exception Handling). This is the first step of this project; only X86_64 target is enabled in this patch. Compiler options: For clang-cl.exe, the option is -EHa, the same as MSVC. For clang.exe, the extra option is -fasync-exceptions, plus -triple x86_64-windows -fexceptions and -fcxx-exceptions as usual. NOTE:: Without the -EHa or -fasync-exceptions, this patch is a NO-DIFF change. The rules for C code: For C-code, one way (MSVC approach) to achieve SEH -EHa semantic is to follow three rules: * First, no exception can move in or out of _try region., i.e., no "potential faulty instruction can be moved across _try boundary. * Second, the order of exceptions for instructions 'directly' under a _try must be preserved (not applied to those in callees). * Finally, global states (local/global/heap variables) that can be read outside of _try region must be updated in memory (not just in register) before the subsequent exception occurs. The impact to C++ code: Although SEH is a feature for C code, -EHa does have a profound effect on C++ side. When a C++ function (in the same compilation unit with option -EHa ) is called by a SEH C function, a hardware exception occurs in C++ code can also be handled properly by an upstream SEH _try-handler or a C++ catch(...). As such, when that happens in the middle of an object's life scope, the dtor must be invoked the same way as C++ Synchronous Exception during unwinding process. Design: A natural way to achieve the rules above in LLVM today is to allow an EH edge added on memory/computation instruction (previous iload/istore idea) so that exception path is modeled in Flow graph preciously. However, tracking every single memory instruction and potential faulty instruction can create many Invokes, complicate flow graph and possibly result in negative performance impact for downstream optimization and code generation. Making all optimizations be aware of the new semantic is also substantial. This design does not intend to model exception path at instruction level. Instead, the proposed design tracks and reports EH state at BLOCK-level to reduce the complexity of flow graph and minimize the performance-impact on CPP code under -EHa option. One key element of this design is the ability to compute State number at block-level. Our algorithm is based on the following rationales: A _try scope is always a SEME (Single Entry Multiple Exits) region as jumping into a _try is not allowed. The single entry must start with a seh_try_begin() invoke with a correct State number that is the initial state of the SEME. Through control-flow, state number is propagated into all blocks. Side exits marked by seh_try_end() will unwind to parent state based on existing SEHUnwindMap[]. Note side exits can ONLY jump into parent scopes (lower state number). Thus, when a block succeeds various states from its predecessors, the lowest State triumphs others. If some exits flow to unreachable, propagation on those paths terminate, not affecting remaining blocks. For CPP code, object lifetime region is usually a SEME as SEH _try. However there is one rare exception: jumping into a lifetime that has Dtor but has no Ctor is warned, but allowed: Warning: jump bypasses variable with a non-trivial destructor In that case, the region is actually a MEME (multiple entry multiple exits). Our solution is to inject a eha_scope_begin() invoke in the side entry block to ensure a correct State. Implementation: Part-1: Clang implementation described below. Two intrinsic are created to track CPP object scopes; eha_scope_begin() and eha_scope_end(). _scope_begin() is immediately added after ctor() is called and EHStack is pushed. So it must be an invoke, not a call. With that it's also guaranteed an EH-cleanup-pad is created regardless whether there exists a call in this scope. _scope_end is added before dtor(). These two intrinsics make the computation of Block-State possible in downstream code gen pass, even in the presence of ctor/dtor inlining. Two intrinsic, seh_try_begin() and seh_try_end(), are added for C-code to mark _try boundary and to prevent from exceptions being moved across _try boundary. All memory instructions inside a _try are considered as 'volatile' to assure 2nd and 3rd rules for C-code above. This is a little sub-optimized. But it's acceptable as the amount of code directly under _try is very small. Part-2 (will be in Part-2 patch): LLVM implementation described below. For both C++ & C-code, the state of each block is computed at the same place in BE (WinEHPreparing pass) where all other EH tables/maps are calculated. In addition to _scope_begin & _scope_end, the computation of block state also rely on the existing State tracking code (UnwindMap and InvokeStateMap). For both C++ & C-code, the state of each block with potential trap instruction is marked and reported in DAG Instruction Selection pass, the same place where the state for -EHsc (synchronous exceptions) is done. If the first instruction in a reported block scope can trap, a Nop is injected before this instruction. This nop is needed to accommodate LLVM Windows EH implementation, in which the address in IPToState table is offset by +1. (note the purpose of that is to ensure the return address of a call is in the same scope as the call address. The handler for catch(...) for -EHa must handle HW exception. So it is 'adjective' flag is reset (it cannot be IsStdDotDot (0x40) that only catches C++ exceptions). Suppress push/popTerminate() scope (from noexcept/noTHrow) so that HW exceptions can be passed through. Original llvm-dev [RFC] discussions can be found in these two threads below: https://lists.llvm.org/pipermail/llvm-dev/2020-March/140541.html https://lists.llvm.org/pipermail/llvm-dev/2020-April/141338.html Differential Revision: https://reviews.llvm.org/D80344/new/	2021-05-17 22:42:17 -07:00
Alex Zinenko	1417ddafdb	[llvm][doc] fix header for read/write_register intrinsics in LangRef Mutli-line headers are not allowed in RST, reformat the header to be a single wide line.	2021-05-17 18:38:16 +02:00
Tim Northover	82a0e808bb	IR/AArch64/X86: add "swifttailcc" calling convention. Swift's new concurrency features are going to require guaranteed tail calls so that they don't consume excessive amounts of stack space. This would normally mean "tailcc", but there are also Swift-specific ABI desires that don't naturally go along with "tailcc" so this adds another calling convention that's the combination of "swiftcc" and "tailcc". Support is added for AArch64 and X86 for now.	2021-05-17 10:48:34 +01:00
Arthur Eubanks	341902672c	Revert "[TargetLowering] Only inspect attributes in the arguments for ArgListEntry" This reverts commit `16748bd2fb`. Causes https://crbug.com/1209013	2021-05-16 22:02:10 -07:00
Stanislav Mekhanoshin	6fb02596a2	[AMDGPU] Add support for architected flat scratch Add support for the readonly flat Scratch register initialized by the SPI. Differential Revision: https://reviews.llvm.org/D102432	2021-05-14 10:53:48 -07:00
Dmitry Preobrazhensky	434b278cde	[AMDGPU][MC][NFC][DOC] Updated AMD GPU assembler syntax description. Summary of changes: - added description of GFX90A; - minor bugfixing and improvements.	2021-05-14 16:13:30 +03:00
Tim Northover	ea0eec69f1	IR+AArch64: add a "swiftasync" argument attribute. This extends any frame record created in the function to include that parameter, passed in X22. The new record looks like [X22, FP, LR] in memory, and FP is stored with 0b0001 in bits 63:60 (CodeGen assumes they are 0b0000 in normal operation). The effect of this is that tools walking the stack should expect to see one of three values there: * 0b0000 => a normal, non-extended record with just [FP, LR] * 0b0001 => the extended record [X22, FP, LR] * 0b1111 => kernel space, and a non-extended record. All other values are currently reserved. If compiling for arm64e this context pointer is address-discriminated with the discriminator 0xc31a and the DB (process-specific) key. There is also an "i8** @llvm.swift.async.context.addr()" intrinsic providing front-ends access to this slot (and forcing its creation initialized to nullptr if necessary).	2021-05-14 11:43:58 +01:00
Pooja Yadav	4763c8c9e3	[docs] Added llvm/cmake section Added information about the cmake inside llvm. Reviewed By: xgupta, jroelofs Differential Revision: https://reviews.llvm.org/D101925	2021-05-14 14:10:56 +05:30
Arthur Eubanks	2155dc51d7	[IR] Introduce the opaque pointer type The opaque pointer type is essentially just a normal pointer type with a null pointee type. This also adds support for the opaque pointer type to the bitcode reader/writer, as well as to textual IR. To avoid confusion with existing pointer types, we disallow creating a pointer to an opaque pointer. Opaque pointer types should not be widely used at this point since many parts of LLVM still do not support them. The next steps are to add some very simple use cases of opaque pointers to make sure they work, then start pretending that all pointers are opaque pointers and see what breaks. https://lists.llvm.org/pipermail/llvm-dev/2021-May/150359.html Reviewed By: dblaikie, dexonsmith, pcc Differential Revision: https://reviews.llvm.org/D101704	2021-05-13 15:22:27 -07:00
Arthur Eubanks	772bdef6af	[docs] Add page on opaque pointer types Reviewed By: dblaikie, dexonsmith Differential Revision: https://reviews.llvm.org/D102292	2021-05-13 15:10:27 -07:00
Martin Storsjö	b42fb6811e	[llvm-nm] Support the -V option, print that the tool is compatible with GNU nm This unlocks some codepaths in libtool. Differential Revision: https://reviews.llvm.org/D102321	2021-05-13 22:36:25 +03:00
Aakanksha Patil	464e4dc50f	[AMDGPU] Add gfx1034 target Differential Revision: https://reviews.llvm.org/D102306	2021-05-13 14:25:18 -04:00
Krzysztof Parzyszek	2b20dee59b	Fix section title underlining in the release notes	2021-05-13 08:37:06 -05:00
Krzysztof Parzyszek	4dea348731	Add entry about Hexagon V68 support to the release notes	2021-05-13 08:28:55 -05:00
Shoaib Meenai	56f7e5a822	[cmake] Add support for multiple distributions LLVM's build system contains support for configuring a distribution, but it can often be useful to be able to configure multiple distributions (e.g. if you want separate distributions for the tools and the libraries). Add this support to the build system, along with documentation and usage examples. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D89177	2021-05-12 11:13:18 -07:00
Tony Tye	d6a228cba4	[NFC][AMDGPU] Correct product name for gfx908 The product name for gfx908 is "AMD Instinct MI100 Accelerator". Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D102209	2021-05-11 15:17:04 +00:00
Alex Orlov	05d1ae4e18	* Add support for JSON output style to llvm-symbolizer This patch adds JSON output style to llvm-symbolizer to better support CLI automation by providing a machine readable output. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D96883	2021-05-11 13:10:54 +04:00
Arthur Eubanks	16748bd2fb	[TargetLowering] Only inspect attributes in the arguments for ArgListEntry Parameter attributes are considered part of the function [1], and like mismatched calling conventions [2], we can't have the verifier check for mismatched parameter attributes. [1] https://llvm.org/docs/LangRef.html#parameter-attributes [2] https://llvm.org/docs/FAQ.html#why-does-instcombine-simplifycfg-turn-a-call-to-a-function-with-a-mismatched-calling-convention-into-unreachable-why-not-make-the-verifier-reject-it Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D101806	2021-05-10 12:35:11 -07:00
gbreynoo	2aa5f9b45a	[llvm-symbolizer] Update Command Guide The option --use-symbol-table is now a noop and does not appear in the help text, however it still appears in the command guide. This change removes it from the command guide and updates the description of --output-style . Differential Revision: https://reviews.llvm.org/D102078	2021-05-10 17:21:34 +01:00
Fraser Cormack	2e0ee68dc8	[LangRef][VP] Fix typos in VP sdiv/udiv examples	2021-05-06 16:37:18 +01:00
Matt Arsenault	e723b511e6	GlobalISel: Update documentation	2021-05-05 17:35:02 -04:00
Pooja Yadav	0b9447157b	[docs] Update the llvm/example section Added details about the llvm/example section. Reviewed By: xgupta Differential Revision: https://reviews.llvm.org/D101284	2021-05-05 21:33:14 +05:30
Sushma Unnibhavi	e4eec51937	[DOCS] Added example for G_EXTRACT and G_INSERT Reviewed By: xgupta, gargaroff Differential Revision: https://reviews.llvm.org/D101227	2021-05-05 15:47:35 +05:30
Fangrui Song	e510860656	[llvm-objdump] Add -M {att,intel} & deprecate --x86-asm-syntax={att,intel} The internal `cl::opt` option --x86-asm-syntax sets the AsmParser and AsmWriter dialect. The option is used by llc and llvm-mc tests to set the AsmWriter dialect. This patch adds -M {att,intel} as GNU objdump compatible aliases (PR43413). Note: the dialect is initialized when the MCAsmInfo is constructed. `MCInstPrinter::applyTargetSpecificCLOption` is called too late and its MCAsmInfo reference is const, so changing the `cl::opt` in `MCInstPrinter::applyTargetSpecificCLOption` is not an option, at least without large amount of refactoring. Reviewed By: hoy, jhenderson, thakis Differential Revision: https://reviews.llvm.org/D101695	2021-05-05 00:20:41 -07:00
Alina Sbirlea	b14c8f5f6e	Add cal entry for MemorySSA syncs.	2021-05-04 12:56:06 -07:00
Alina Sbirlea	974ff623aa	Add monthly MemorySSA sync.	2021-05-04 11:23:36 -07:00
Arthur Eubanks	0172b1389e	[docs] Fix some wording	2021-05-04 10:21:38 -07:00
gbreynoo	3273f27692	[llvm-objdump] Remove --cfg option from command guide The llvm-objdump command guide has the option --cfg which was removed from the tool by `888320e9fa` in 2014. This change updates the command guide to reflect this. Differential Revision: https://reviews.llvm.org/D101648	2021-05-04 16:42:13 +01:00
Fraser Cormack	2d480abd9a	[LangRef] Fix a typo in the vector-type memory layout section	2021-05-04 15:40:53 +01:00
Arthur Eubanks	9779b664b6	[docs][NewPM] Add section on analyses Reviewed By: asbirlea, ychen Differential Revision: https://reviews.llvm.org/D100912	2021-05-03 10:15:02 -07:00
Christian Kühnel	91607dce61	[doc] typo fixes as proposed by @FlashSheridan in https://reviews.llvm.org/rG7f9717b922d4	2021-05-03 10:59:51 +02:00
Nick Desaulniers	dde24a87c5	[llvm-objdump] add -v alias for --version Used by the Linux kernel's CONFIG_X86_DECODER_SELFTEST. Link: https://github.com/ClangBuiltLinux/linux/issues/1130 Reviewed By: MaskRay, jhenderson, rupprecht Differential Revision: https://reviews.llvm.org/D101483	2021-04-30 11:26:36 -07:00
Pooja Yadav	cfb95f6f91	[docs]Added llvm/bindings section Added information about language bindings provided by LLVM. Reviewed By: xgupta, gandhi21299 Differential Revision: https://reviews.llvm.org/D101295	2021-04-30 19:05:22 +05:30
Jonas Devlieghere	625bd94c6d	[dsymutil] Add flag to force a static variable to keep its enclosing function Add a flag to change dsymutil's behavior and force a static variable to keep its enclosing function. The test shows a situation where that could be useful. I'm not convinced this behavior makes sense as a default, which is why it's behind a flag. rdar://74918374 Differential revision: https://reviews.llvm.org/D101337	2021-04-28 11:33:04 -07:00
Paul C. Anagnostopoulos	952c6ddd8b	[TableGen] Add the !find bang operator !find searches a source string for a target string and returns the position. Differential Revision: https://reviews.llvm.org/D101318	2021-04-28 09:51:00 -04:00
Ahmed Bougacha	6a2e298517	[docs] Replace Apple representative to security group. Differential Revision: https://reviews.llvm.org/D100864	2021-04-27 11:00:49 -07:00
Christian Kühnel	4dc6763289	[doc] added documentation for pre-merge testing fixes https://github.com/google/llvm-premerge-checks/issues/275 Differential Revision: https://reviews.llvm.org/D100936	2021-04-27 16:53:16 +02:00
Pooja Yadav	0764c8af76	[Docs] Updated LLVM_TARGETS_TO_BUILD section in GettingStarted.rst Updated LLVM_TARGETS_TO_BUILD under https://llvm.org/docs/GettingStarted.html#local-llvm-configuration. Differential Revision: https://reviews.llvm.org/D101101	2021-04-24 00:31:43 +05:30
Paul C. Anagnostopoulos	d9187f50b9	[TableGen] [docs] Improve BNF for the 'multiclass' statement [NFC]	2021-04-23 12:05:52 -04:00
Paul C. Anagnostopoulos	6a067cdb06	[TableGen] [docs] Improve description of NAME in Programmer's Reference Also use "parent class" consistently and add a note about the term. Differential Revision: https://reviews.llvm.org/D100867	2021-04-23 09:49:17 -04:00
Thomas Preud'homme	2fdedf905a	[doc] Clarify constrained fcmps behavior Reviewed By: uweigand Differential Revision: https://reviews.llvm.org/D101053	2021-04-23 11:55:20 +01:00
Fangrui Song	2786e673c7	[IR][sanitizer] Add module flag "frame-pointer" and set it for cc1 -mframe-pointer={non-leaf,all} The Linux kernel objtool diagnostic `call without frame pointer save/setup` arise in multiple instrumentation passes (asan/tsan/gcov). With the mechanism introduced in D100251, it's trivial to respect the command line -m[no-]omit-leaf-frame-pointer/-f[no-]omit-frame-pointer, so let's do it. Fix: https://github.com/ClangBuiltLinux/linux/issues/1236 (tsan) Fix: https://github.com/ClangBuiltLinux/linux/issues/1238 (asan) Also document the function attribute "frame-pointer" which is long overdue. Differential Revision: https://reviews.llvm.org/D101016	2021-04-22 18:07:30 -07:00
Keith Smiley	86b98c60c5	llvm-objdump: add --rpaths to macho support This prints the rpaths for the given binary Reviewed By: kastiglione Differential Revision: https://reviews.llvm.org/D100681	2021-04-22 16:01:10 -07:00
Evgeniy Brevnov	b9e9e2eef1	Wordsmith the semantics of invariant.load Don't phrase the semantics in terms of the optimizer. Instead have a more straightforward execution based semantic. Reviewed By: ebrevnov Differential Revision: https://reviews.llvm.org/D63439	2021-04-22 10:06:13 +07:00
Christian Kühnel	cf61cf0724	[NFC] fixed link in documentation	2021-04-21 10:17:03 +02:00
Christian Kühnel	7f9717b922	added section on CI system Add documentation for working with the CI systems. This is based on the discussion in the Infrastructure Working Group: https://github.com/ChristianKuehnel/iwg-workspace/issues/37 Differential Revision: https://reviews.llvm.org/D97389	2021-04-21 09:59:41 +02:00
David Sherwood	eecb4b478f	[Docs] Fix formatting issue for llvm.experimental.stepvector in LangRef The llvm.experimental.stepvector section was missing the '^^^' line underneath the intrinsic name.	2021-04-21 08:42:40 +01:00
Nico Weber	1a3f88658a	[llvm-objdump] Add an llvm-otool tool This implements an LLVM tool that's flag- and output-compatible with macOS's `otool` -- except for bugs, but from testing with both `otool` and `xcrun otool-classic`, llvm-otool matches vanilla otool's behavior very well already. It's not 100% perfect, but it's a very solid start. This uses the same approach as llvm-objcopy: llvm-objdump uses a different OptTable when it's invoked as llvm-otool. This is possible thanks to D100433. Differential Revision: https://reviews.llvm.org/D100583	2021-04-20 08:24:58 -04:00
Luo, Yuanke	519cf6e807	[X86][AMX] Add description of x86_amx to LangRef. Differential Revision: https://reviews.llvm.org/D100032	2021-04-20 14:29:17 +08:00
xgupta	a637b8eac0	[Docs] Mention LLVM_EXPERIMENTAL_TARGETS_TO_BUILD variable in CMake.rst Beginners might not aware of this variable and wanted to try a new experimental target. Although this variable mention in Writing a Backend Documentation. But it becomes easy to search when listed in cmake.rst doc where most variables are listed. Reviewed By: myhsu Differential Revision: https://reviews.llvm.org/D100729	2021-04-20 09:27:57 +05:30
Paul C. Anagnostopoulos	a5aaec8f4e	[TableGen] Add support for the 'assert' statement in multiclasses This is step 3 of adding the 'assert' statement. Differential Revision: https://reviews.llvm.org/D99751	2021-04-19 09:01:42 -04:00
xgupta	2cb8ec8f38	[Docs] Correct Boehm collector weblink in GarbageCollection.rst	2021-04-18 17:30:17 +05:30
Philip Reames	ff55d01a8e	[nofree] Restrict semantics to memory visible to caller This patch clarifies the semantics of the nofree function attribute to make clear that it provides an "as if" semantic. That is, a nofree function is guaranteed not to free memory which existed before the call, but might allocate and then deallocate that same memory within the lifetime of the callee. This is the result of the discussion on llvm-dev under the thread "Ambiguity in the nofree function attribute". The most important part of this change is the LangRef wording. The rest is minor comment changes to emphasize the new semantics where code was accidentally consistent, and fix one place which wasn't consistent. That one place is currently narrowly used as it is primarily part of the ongoing (and not yet enabled) deref-at-point semantics work. Differential Revision: https://reviews.llvm.org/D100141	2021-04-16 11:38:55 -07:00
Kristof Beyls	a7bbd670aa	[docs] Add Pointer Authentication call info	2021-04-16 15:18:21 +02:00
Simon Moll	fda078bffb	[docs] Add vector predication call Add the syncup call to the table Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D100474	2021-04-16 10:49:34 +02:00
Juneyoung Lee	085423282d	[LangRef] formatting	2021-04-16 10:41:30 +09:00
Juneyoung Lee	25e96dffac	[LangRef] fix unexepcted unindent errror	2021-04-16 09:58:55 +09:00
Juneyoung Lee	1bcadb0984	[LangRef] clarify the semantics of nocapture This patch clarifies the semantics of nocapture attribute. A 'Pointer Capture' subsection is added to describe the semantics of pointer capture first. For the nocapture example with two same pointer arguments, it is consistent with the semantics that Alive2 used to run lit tests. Reviewed By: nlopes Differential Revision: https://reviews.llvm.org/D97924	2021-04-16 09:48:42 +09:00
Jon Roelofs	0bae93771d	s/setGenerator/addGenerator/ in the JIT docs. NFC	2021-04-15 15:54:28 -07:00
Momchil Velikov	f9d932e673	[clang][AArch64] Correctly align HFA arguments when passed on the stack When we pass a AArch64 Homogeneous Floating-Point Aggregate (HFA) argument with increased alignment requirements, for example struct S { __attribute__ ((__aligned__(16))) double v[4]; }; Clang uses `[4 x double]` for the parameter, which is passed on the stack at alignment 8, whereas it should be at alignment 16, following Rule C.4 in AAPCS (https://github.com/ARM-software/abi-aa/blob/master/aapcs64/aapcs64.rst#642parameter-passing-rules) Currently we don't have a way to express in LLVM IR the alignment requirements of the function arguments. The align attribute is applicable to pointers only, and only for some special ways of passing arguments (e..g byval). When implementing AAPCS32/AAPCS64, clang resorts to dubious hacks of coercing to types, which naturally have the needed alignment. We don't have enough types to cover all the cases, though. This patch introduces a new use of the stackalign attribute to control stack slot alignment, when and if an argument is passed in memory. The attribute align is left as an optimizer hint - it still applies to pointer types only and pertains to the content of the pointer, whereas the alignment of the pointer itself is determined by the stackalign attribute. For byval arguments, the stackalign attribute assumes the role, previously perfomed by align, falling back to align if stackalign` is absent. On the clang side, when passing arguments using the "direct" style (cf. `ABIArgInfo::Kind`), now we can optionally specify an alignment, which is emitted as the new `stackalign` attribute. Patch by Momchil Velikov and Lucas Prates. Differential Revision: https://reviews.llvm.org/D98794	2021-04-15 22:58:14 +01:00
Paul C. Anagnostopoulos	9345f9fa5d	[TableGen] [docs] Correct a reference in the TableGen Overview document Differential Revision: https://reviews.llvm.org/D100382	2021-04-15 09:25:09 -04:00
Kostya Kortchinsky	4acdac081d	[docs][scudo] Update Scudo documentation Update the Scudo document to align with the standalone version. Add some more verbiage about the various component of the allocator, rework a bit everything. The build instructions have been updated. The options and their default values have been updated, and the `mallopt` ones have been added. Differential Revision: https://reviews.llvm.org/D100230	2021-04-13 08:41:56 -07:00
Gulfem Savrun Yeniceri	e96df3e531	[Passes] Add relative lookup table converter pass Lookup tables generate non PIC-friendly code, which requires dynamic relocation as described in: https://bugs.llvm.org/show_bug.cgi?id=45244 This patch adds a new pass that converts lookup tables to relative lookup tables to make them PIC-friendly. Differential Revision: https://reviews.llvm.org/D94355	2021-04-13 01:29:41 +00:00
Kristof Beyls	28dc50c4b7	[docs] Add Windows/COFF call info	2021-04-12 17:11:25 +02:00
Sushma Unnibhavi	002c6c1187	Typo fix Reviewed By: dsanders Differential Revision: https://reviews.llvm.org/D100254	2021-04-11 12:24:27 +05:30
Sushma Unnibhavi	e8b0542078	Missing syntax highlighting for LLVM IR in Langref Added syntax highlighting Differential Revision: https://reviews.llvm.org/D100125	2021-04-11 12:19:58 +05:30
Paul C. Anagnostopoulos	175b8819f2	[TableGen] [docs] Change title of tblgen.rst to fix man page filename	2021-04-09 09:37:56 -04:00
Konstantin Zhuravlyov	4fae63c612	AMDGPU: Add gfx90c support to code object v2 for backwards compatibility Differential Revision: https://reviews.llvm.org/D100126	2021-04-08 16:42:43 -04:00
Paul C. Anagnostopoulos	3f919ff250	Revert "[TableGen] Add support for the 'assert' statement in multiclasses" This reverts commit `3b9a15d910`.	2021-04-08 13:58:58 -04:00
Paul C. Anagnostopoulos	3b9a15d910	[TableGen] Add support for the 'assert' statement in multiclasses	2021-04-08 08:36:03 -04:00
Philip Reames	0918f44e26	[docs] Document our norms around reverts This has come up a few times recently, and I was surprised to notice that we don't have anything in the docs. This patch deliberately sticks to stuff that is uncontroversial in the community. Everything herein is thought to be widely agreed to by a large majority of the community. A few things were noted and removed in review which failed this standard, if you spot anything else, please point it out. Differential Revision: https://reviews.llvm.org/D99305	2021-04-07 21:02:19 -07:00
Tony Tye	2e9465ce2e	[NFC][AMDGPU] Correct indentation in AMDGPUUsage.rst Correct indentation that results in rST syntax error.	2021-04-08 01:00:13 +00:00
Tony Tye	4658cd4c18	[AMDGPU] Update gfx90a memory model support Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D100070	2021-04-07 22:17:58 +00:00
Paul C. Anagnostopoulos	13a84f21d7	[TableGen] [docs] Correct a couple of mistakes; use 'true' and 'false' in examples Differential Revision: https://reviews.llvm.org/D99800	2021-04-05 09:15:58 -04:00
Nikita Popov	665065821e	[FastISel] Remove kill tracking This is a followup to D98145: As far as I know, tracking of kill flags in FastISel is just a compile-time optimization. However, I'm not actually seeing any compile-time regression when removing the tracking. This probably used to be more important in the past, before FastRA was switched to allocate instructions in reverse order, which means that it discovers kills as a matter of course. As such, the kill tracking doesn't really seem to serve a purpose anymore, and just adds additional complexity and potential for errors. This patch removes it entirely. The primary changes are dropping the hasTrivialKill() method and removing the kill arguments from the emitFast methods. The rest is mechanical fixup. Differential Revision: https://reviews.llvm.org/D98294	2021-04-03 15:50:13 +02:00
Paul C. Anagnostopoulos	7f7f5e2543	[TableGen] [Docs] Add lldb-tblgen to command guide; add 4 guide stubs Differential Revision: https://reviews.llvm.org/D99605	2021-04-02 09:52:16 -04:00
Tony	4c70f56ec6	[NFC][AMDGPU] Add product names for gfx908 and gfx10 processors Reviewed By: msearles Differential Revision: https://reviews.llvm.org/D99781	2021-04-02 00:58:11 +00:00
Jon Roelofs	58cbb222eb	[docs] Fix up dead clang-format links after monorepo move. NFC	2021-03-30 14:29:35 -07:00
oToToT	1363fb8ca6	[Docs] Update googletest docs link. The documentation link of Google Test on GitHub have been moved to the top-level docs directory. Thus, the original link is invalid now. Reviewed By: Pavel Labath Differential Revision: https://reviews.llvm.org/D99559	2021-03-30 23:20:23 +08:00
Krasimir Georgiev	c51e91e046	Revert "[Passes] Add relative lookup table converter pass" This reverts commit `5178ffc7cf`. Compiling `llvm-profdata` with a compiler build from this produces a crashing binary.	2021-03-30 14:13:37 +02:00
Nuno Lopes	ad613b1497	[docs] remove references to checking out svn repos	2021-03-30 10:00:31 +01:00
Tim Renouf	083b0f1b40	[AMDGPU] Update AMDGPU PAL usage documentation Change-Id: I65f3edcfe5063551cad5aab0da1374c3a6ccd3a2	2021-03-30 08:33:18 +01:00
Gulfem Savrun Yeniceri	5178ffc7cf	[Passes] Add relative lookup table converter pass Lookup tables generate non PIC-friendly code, which requires dynamic relocation as described in: https://bugs.llvm.org/show_bug.cgi?id=45244 This patch adds a new pass that converts lookup tables to relative lookup tables to make them PIC-friendly. Differential Revision: https://reviews.llvm.org/D94355	2021-03-29 21:53:32 +00:00
Paul C. Anagnostopoulos	5f473a04af	[TableGen] Add support for the 'assert' statement in class definitions. Differential Revision: https://reviews.llvm.org/D99275	2021-03-29 09:20:29 -04:00
Matt Arsenault	9a0c9402fa	Reapply "OpaquePtr: Turn inalloca into a type attribute" This reverts commit `07e46367ba`.	2021-03-29 08:55:30 -04:00
Oliver Stannard	07e46367ba	Revert "Reapply "OpaquePtr: Turn inalloca into a type attribute"" Reverting because test 'Bindings/Go/go.test' is failing on most buildbots. This reverts commit `fc9df30991`.	2021-03-29 11:32:22 +01:00
Matt Arsenault	fc9df30991	Reapply "OpaquePtr: Turn inalloca into a type attribute" This reverts commit `20d5c42e0e`.	2021-03-28 13:35:21 -04:00
Nico Weber	20d5c42e0e	Revert "OpaquePtr: Turn inalloca into a type attribute" This reverts commit `4fefed6563`. Broke check-clang everywhere.	2021-03-28 13:02:52 -04:00
Zakk Chen	821547cabb	[RISCV][Clang] Update new overloading rules for RVV intrinsics. RVV intrinsics has new overloading rule, please see `82aac7dad4` Changed: 1. Rename `generic` to `overloaded` because the new rule is not using C11 generic. 2. Change HasGeneric to HasNoMaskedOverloaded because all masked operations support overloading api. 3. Add more overloaded tests due to overloading rule changed. Differential Revision: https://reviews.llvm.org/D99189	2021-03-28 09:04:35 -07:00
Matt Arsenault	4fefed6563	OpaquePtr: Turn inalloca into a type attribute I think byval/sret and the others are close to being able to rip out the code to support the missing type case. A lot of this code is shared with inalloca, so catch this up to the others so that can happen.	2021-03-28 11:12:23 -04:00
George Burgess IV	5079bc8a23	docs: Adding Google representative to the security group This adds me as a Google representative for the LLVM security group. This was proposed, discussed, and voted on in the differential revision linked below; please see it for more information. Differential Revision: https://reviews.llvm.org/D99232	2021-03-26 18:55:37 -07:00
Tony	850fcedb27	[NFC][AMDGPU] Corrections to AMD GPU initial kernel launch documentation Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D99223	2021-03-26 02:05:45 +00:00
Amara Emerson	55533203d7	[GlobalISel] Add G_ROTR and G_ROTL opcodes for rotates. Differential Revision: https://reviews.llvm.org/D99383	2021-03-25 17:23:30 -07:00
Djordje Todorovic	8420a53324	[Debugify] Expose original debug info preservation check as CC1 option In order to test the preservation of the original Debug Info metadata in your projects, a front end option could be very useful, since users usually report that a concrete entity (e.g. variable x, or function fn2()) is missing debug info. The [0] is an example of running the utility on GDB Project. This depends on: D82546 and D82545. Differential Revision: https://reviews.llvm.org/D82547	2021-03-25 05:29:42 -07:00

1 2 3 4 5 ...

8904 Commits