llvm-project

Commit Graph

Author	SHA1	Message	Date
Tony	ce74e97d9b	[AMDGPU] Correct missing sram-ecc target feature for gfx906 Differential Revision: https://reviews.llvm.org/D85476	2020-08-06 22:12:25 +00:00
Stanislav Mekhanoshin	ea7d0e2996	[AMDGPU] gfx1031 target Differential Revision: https://reviews.llvm.org/D85337	2020-08-05 12:36:26 -07:00
Tony	e24f5f3149	[AMDGPU] DWARF proposal changes - Clarify that these are extensions to DWARF 5 and not as yet a proposal. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D70523	2020-07-30 05:07:09 +00:00
Tony	5aa2fd88cf	[AMDGPU] DWARF proposal changes for expression context - Clarify what context is used in DWARF expression evaluation. - Define location descriptions to fully resolve the context and so include the context in their result. - As a consequence of location descriptions being fully resoved, change address spaces so only a swizzled and unswizzled private address space is defined. The lane is now part of the location description context. - Clarify how call frame information is used to fully resolve expressions that specify registers. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D70523	2020-07-30 01:59:22 +00:00
Matt Arsenault	31f4e43f3f	AMDGPU: Remove .value_type from kernel metadata This doesn't appear used for anything, and is emitted incorrectly based on the description. This also depends on the IR type, and pointee element type.	2020-07-10 18:16:31 -04:00
Tony	76b2d9cbeb	[AMDGPU] Correct AMDGPUUsage.rst DW_AT_LLVM_lane_pc example - Correct typo of DW_OP_xaddr to DW_OP_addrx in AMDGPUUsage.rst for DW_AT_LLVM_lane_pc example. Change-Id: I1b0ee2b24362a0240388e4c2f044c1d4883509b9	2020-07-01 08:23:15 +00:00
Tony	990f8702c9	[AMDGPU] Define DWARF encoding for condition code registers Summary: - Define DWARF register numbers for vector and scalar condition codes. - Document intended purpose of reserved DWARF register numbers. Reviewers: yaxunl, kzhuravl, arsenm, rampitec, b-sumner Subscribers: jvesely, wdng, nhaehnle, aprantl, dstuttard, tpr, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82519	2020-06-26 17:53:55 -04:00
Tony	ea6df2fb8f	[AMDGPU] Update AMD GPU processor information Summary: - Add product names for some processors. - Correct XNACK support for a processor. Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82348	2020-06-23 18:47:56 -04:00
Matt Arsenault	ae5adb8da5	AMDGPU: Update private null pointer value in documentation Private pointers used to workaround IR semantics by artifically reserving an object at offset 0 so no user object would be allocated there. Since alloca now uses a non-0 address space, that workaround is unnecssary and 0 can be treated as a valid pointer.	2020-06-18 17:27:19 -04:00
Stanislav Mekhanoshin	9ee272f13d	[AMDGPU] Add gfx1030 target Differential Revision: https://reviews.llvm.org/D81886	2020-06-15 16:18:05 -07:00
madhur13490	bca413b036	Fix a typo in AMDGPU docs Reviewers: t-tye, arsenm Reviewed By: arsenm Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81247	2020-06-05 13:30:17 +00:00
Tony	7318e24000	[AMDGPU] Add loaded code object path URI definition to AMDGPUUsage Differential Revision: https://reviews.llvm.org/D80407	2020-05-29 19:52:52 -04:00
Tony	e36be90c82	[AMDGPU] Correct formatting typos in documentation Summary: - Correct missing space in some "note" and "TODO" directives in AMDGPUUsage.rst - Correct warning for heading underline being too short in BitCodeFormat.rst Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80407	2020-05-21 20:36:46 -04:00
Jinsong Ji	628f008b20	[docs] Fix buildbot failures Buildbot has been failing since http://lab.llvm.org:8011/builders/llvm-sphinx-docs/builds/44711 This patch fix the minor issues that cause warnings.	2020-05-21 22:07:33 +00:00
Christudasan Devadasan	7c4e711ef8	[AMDGPU] Enable base pointer. When the callee requires a dynamic stack realignment, it is not possible to correcty access the incoming stack arguments using the stack pointer. We reserve a base pointer in such cases to access the function arguments inside the callee. The base pointer will hold the incoming stack pointer value before any kind of delta added to it. Reviewed By: arsenm, scott.linder Differential Revision: https://reviews.llvm.org/D78811	2020-05-17 16:13:55 +05:30
Christudasan Devadasan	375cec4b6c	[AMDGPU] Introduce more scratch registers in the ABI. The AMDGPU target has a convention that defined all VGPRs (execept the initial 32 argument registers) as callee-saved. This convention is not efficient always, esp. when the callee requiring more registers, ended up emitting a large number of spills, even though its caller requires only a few. This patch revises the ABI by introducing more scratch registers that a callee can freely use. The 256 vgpr registers now become: 32 argument registers 112 scratch registers and 112 callee saved registers. The scratch registers and the CSRs are intermixed at regular intervals (a split boundary of 8) to obtain a better occupancy. Reviewers: arsenm, t-tye, rampitec, b-sumner, mjbedy, tpr Reviewed By: arsenm, t-tye Differential Revision: https://reviews.llvm.org/D76356	2020-05-05 23:02:58 +05:30
Kazuaki Ishizaki	0312b9f550	[llvm] NFC: Fix trivial typo in rst and td files Differential Revision: https://reviews.llvm.org/D77469	2020-04-23 14:26:32 +09:00
Tony	1eac2c55d8	[AMDGPU] Move DWARF proposal to separate file - Move DWARF proposal for heterogeneous debugging to a separate file. - Add references. Differential Revision: https://reviews.llvm.org/D70523	2020-04-15 17:19:39 -04:00
Tony	b436124010	[AMDGPU] Update DWARF proposal - Unify the sections on DWARF expression and location lists. - Allow a location description to have one or more single location descriptions. - Define context of DWARF expression that includes an initial stack. Allow initial stack to be used when evaluating location list expression with overlapping PC ranges. - Reorganize the DWARF proposal in AMDGPUUsage so suitable for submission to the DWARF site. - Replace CFI instruction DW_CFA_LLVM_def_cfa_aspace with DW_CFA_def_aspace_cfa and DW_CFA_def_aspace_cfa_sf. This is to avoid the problem that DW_CFA_def_cfa and DW_CFA_def_cfa_sf cannot use a register that is not the size of an address in the CFA address space. - Clarify DWARF address class and DWARF address space. Define language values for DWARF address classes and specify how they are used by some common source languages. - Define rules for accessing registers and derefencing memory when the type size and register size or byte size operand do not match. - Numerous cleanups for consistency. Differential Revision: https://reviews.llvm.org/D70523	2020-04-14 20:05:15 -04:00
Sylvestre Ledru	72fd1033ea	Doc: Links should use https	2020-03-22 22:49:33 +01:00
Scott Linder	0e9368cc8c	[AMDGPU] Move frame pointer from s34 to s33 Remove the gap left between the stack pointer (s32) and frame pointer (s34) now that the scratch wave offset is no longer a part of the calling convention ABI. Update llvm/docs/AMDGPUUsage.rst to reflect the change. Tags: #llvm Differential Revision: https://reviews.llvm.org/D75657	2020-03-19 15:35:16 -04:00
Scott Linder	60b1967c39	[AMDGPU] Add Scratch Wave Offset to Scratch Buffer Descriptor in entry functions Add the scratch wave offset to the scratch buffer descriptor (SRSrc) in the entry function prologue. This allows us to removes the scratch wave offset register from the calling convention ABI. As part of this change, allow the use of an inline constant zero for the SOffset of MUBUF instructions accessing the stack in entry functions when a frame pointer is not requested/required. Entry functions with calls still need to set up the calling convention ABI stack pointer register, and reference it in order to address arguments of called functions. The ABI stack pointer register remains unswizzled, but is now wave-relative instead of queue-relative. Non-entry functions also use an inline constant zero SOffset for wave-relative scratch access, but continue to use the stack and frame pointers as before. When the stack or frame pointer is converted to a swizzled offset it is now scaled directly, as the scratch wave offset no longer needs to be subtracted first. Update llvm/docs/AMDGPUUsage.rst to reflect these changes to the calling convention. Tags: #llvm Differential Revision: https://reviews.llvm.org/D75138	2020-03-19 15:35:16 -04:00
Tony	788e74ce29	[AMDGPU] AMDGPUUsage define call convention ABI Reviewers: scott.linder, arsenm, b-sumner Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74861	2020-02-19 15:56:19 -05:00
Tony	f5678d4a6a	[AMDGPU] Update AMDGPUUsage with DWARF proposal Summary: - Add AMDGPU DWARF proposal. - Add references for gfx10 ISA and SemVer. Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, aprantl, dstuttard, tpr, jfb, dmgreen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70523	2020-02-19 15:30:53 -05:00
Dmitry Preobrazhensky	2de2275cbd	[AMDGPU][MC][DOC] Updated AMD GPU assembler syntax description. Summary of changes: - updated description of gfx906 and gfx908; - added description of gfx1011 and gfx1012 subtargets.	2020-02-07 16:23:46 +03:00
Hans Wennborg	e334a3a60f	[docs] NFC: Fix typos in documents "the the" -> "the" "an" -> "a" Patch by Kazuaki Ishizaki <ishizaki@jp.ibm.com>! Differential revision: https://reviews.llvm.org/D72091	2020-01-07 16:06:14 +01:00
Dmitry Preobrazhensky	80c45e49c3	[AMDGPU][MC][DOC] Updated AMD GPU assembler syntax description. Summary of changes: - added description of GFX9 subtargets: - gfx900; - gfx902; - gfx904; - gfx906; - gfx908; - gfx909.	2019-12-25 17:51:53 +03:00
Tony	7a54f727a2	[AMDGPU] AMDGPUUsage clarify address space information and other typo and formatting fixes Summary: - Clarify AMDGPU address spaces. - Correct path to AMDGPU backend since now in the mono-repo. - Fix numerous text style and typo issues. - Correct reStructure text formatting warnings. - Made reStructure directive usage more consistent. - Add references for gfx10 ISA specification. Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, jfb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71392	2019-12-12 14:51:27 -05:00
Nico Weber	761dd780ea	Fix a few doc typos, to cycle bots.	2019-12-08 18:51:48 -05:00
Sameer Sahasrabuddhe	52c5014da0	[AMDGPU] add support for hostcall buffer pointer as hidden kernel argument Hostcall is a service that allows a kernel to submit requests to the host using shared buffers, and block until a response is received. This will eventually replace the shared buffer currently used for printf, and repurposes the same hidden kernel argument. This change introduces a new ValueKind in the HSA metadata to represent the hostcall buffer. Differential Revision: https://reviews.llvm.org/D70038	2019-11-20 15:53:55 +05:30
Stanislav Mekhanoshin	22b2c3d651	[AMDGPU] gfx908 target Differential Revision: https://reviews.llvm.org/D64429 llvm-svn: 365525	2019-07-09 18:10:06 +00:00
Dmitry Preobrazhensky	463b87ae88	[AMDGPU][MC][DOC] Updated AMD GPU assembler syntax description. Corrected a typo. llvm-svn: 365353	2019-07-08 17:09:09 +00:00
Dmitry Preobrazhensky	cef9d42157	[AMDGPU][MC][DOC] Updated AMD GPU assembler syntax description. Summary of changes: - added description of GFX10; - added description of operands sccz, vccz, lds_direct, etc; - minor bugfixing and improvements. llvm-svn: 365347	2019-07-08 16:50:11 +00:00
Yaxun Liu	a62413526d	[AMDGPU] Added a new metadata for multi grid sync implicit argument Patch by Christudasan Devadasan. Differential Revision: https://reviews.llvm.org/D63886 llvm-svn: 365217	2019-07-05 16:05:17 +00:00
Nicolai Haehnle	08e8cb5760	AMDGPU/MC: Add .amdgpu_lds directive Summary: The directive defines a symbol as an group/local memory (LDS) symbol. LDS symbols behave similar to common symbols for the purposes of ELF, using the processor-specific SHN_AMDGPU_LDS as section index. It is the linker and/or runtime loader's job to "instantiate" LDS symbols and resolve relocations that reference them. It is not possible to initialize LDS memory (not even zero-initialize as for .bss). We want to be able to link together objects -- starting with relocatable objects, but possible expanding to shared objects in the future -- that access LDS memory in a flexible way. LDS memory is in an address space that is entirely separate from the address space that contains the program image (code and normal data), so having program segments for it doesn't really make sense. Furthermore, we want to be able to compile multiple kernels in a compilation unit which have disjoint use of LDS memory. In that case, we may want to place LDS symbols differently for different kernels to save memory (LDS memory is very limited and physically private to each kernel invocation), so we can't simply place LDS symbols in a .lds section. Hence this solution where LDS symbols always stay undefined. Change-Id: I08cbc37a7c0c32f53f7b6123aa0afc91dbc1748f Reviewers: arsenm, rampitec, t-tye, b-sumner, jsjodin Subscribers: kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D61493 llvm-svn: 364296	2019-06-25 11:51:35 +00:00
Stanislav Mekhanoshin	4336a9496d	[AMDGPU] gfx10 documentation update. NFC. llvm-svn: 363332	2019-06-13 22:18:47 +00:00
Matt Arsenault	4fb580c314	AMDGPU: Remove amdgpu-max-work-group-size attribute This has been deprecated for a long time, and mesa recently switched to amdgpu-flat-work-group-size. llvm-svn: 362641	2019-06-05 20:32:32 +00:00
Zachary Turner	6eb7ab97a5	Try to fix Sphinx bot. llvm-svn: 357790	2019-04-05 18:06:42 +00:00
Matt Arsenault	055e4dce45	AMDGPU: Remove dx10-clamp from subtarget features Since this can be set with s_setreg*, it should not be a subtarget property. Set a default based on the calling convention, and Introduce a new amdgpu-dx10-clamp attribute to override this if desired. Also introduce a new amdgpu-ieee attribute to match. The values need to match to allow inlining. I think it is OK for the caller's dx10-clamp attribute to override the callee, but there doesn't appear to be the infrastructure to do this currently without definining the attribute in the generic Attributes.td. Eventually the calling convention lowering will need to insert a mode switch somewhere for these. llvm-svn: 357302	2019-03-29 19:14:54 +00:00
Scott Linder	0bc9f15ddd	[AMDGPU] Add an additional Code Object V3 assembler example Document the intended use of the `.amdgcn.next_free_{s,v}gpr` in the context of multiple kernels and functions. Differential Revision: https://reviews.llvm.org/D59949 llvm-svn: 357289	2019-03-29 17:49:51 +00:00
Konstantin Zhuravlyov	2b766ed774	AMDGPU: Make sram-ecc off by default for Vega20 Differential Revision: https://reviews.llvm.org/D59718 llvm-svn: 357247	2019-03-29 12:04:18 +00:00
Scott Linder	ac20b74573	[AMDGPU] Clarify Code Object V2/V3 differences in AMDGPUUsage Ensure Code Object V2 documentation is complete, but always contains a warning and a link to the equivalent Code Object V3 documentation. Explicitly indicate that any note records present in a code object that are not documented must be considered deprecated and ignored. Differential Revision: https://reviews.llvm.org/D59782 llvm-svn: 357176	2019-03-28 15:08:52 +00:00
Konstantin Zhuravlyov	51809cbc98	AMDGPU: Add support for cross address space synchronization scopes Differential Revision: https://reviews.llvm.org/D59517 llvm-svn: 356946	2019-03-25 20:50:21 +00:00
Neil Henning	523dab0788	[AMDGPU] Add an experimental buffer fat pointer address space. Add an experimental buffer fat pointer address space that is currently unhandled in the backend. This commit reserves address space 7 as a non-integral pointer repsenting the 160-bit fat pointer (128-bit buffer descriptor + 32-bit offset) that is heavily used in graphics workloads using the AMDGPU backend. Differential Revision: https://reviews.llvm.org/D58957 llvm-svn: 356373	2019-03-18 14:44:28 +00:00
Dmitry Preobrazhensky	62a0318dff	[AMDGPU][MC][CODEOBJECT] Added predefined symbols to access GPU minor and stepping numbers Added the following Code Object v3 symbols: .amdgcn.gfx_generation_minor .amdgcn.gfx_generation_stepping Reviewers: artem.tamazov, kzhuravl Differential Revision: https://reviews.llvm.org/D57826 llvm-svn: 353515	2019-02-08 13:51:31 +00:00
Dmitry Preobrazhensky	47eb63684d	[AMDGPU][MC][DOC] Updated AMD GPU assembler description Stage 2: added detailed description of operands See bug 36572: https://bugs.llvm.org/show_bug.cgi?id=36572 llvm-svn: 349368	2018-12-17 17:38:11 +00:00
Scott Linder	8d5a36a839	[AMDGPU] Update code object metadata format documentation * Add amdhsa prefix to names to allow other tools to use the metadata without collision. * Make names consistent. * Simplify structure. * Change note record ID. * Switch from YAML to MsgPack format. * Document metadata assembler directive. Patch By: t-tye (Tony Tye) Differential Revision: https://reviews.llvm.org/D53445 llvm-svn: 346992	2018-11-15 20:46:55 +00:00
Konstantin Zhuravlyov	3c5d23912b	AMDGPU/Docs: Add product names for Vega20 Differential Revision: https://reviews.llvm.org/D54178 llvm-svn: 346354	2018-11-07 20:54:16 +00:00
Konstantin Zhuravlyov	b44b890100	AMDGPU/Docs: Fix the processor table llvm-svn: 346263	2018-11-06 20:23:53 +00:00
Konstantin Zhuravlyov	108927b944	AMDGPU: Add sram-ecc feature Differential Revision: https://reviews.llvm.org/D53222 llvm-svn: 346177	2018-11-05 22:44:19 +00:00
Tim Renouf	2a1b1d94b6	[AMDGPU] Defined gfx909 Raven Ridge 2 Differential Revision: https://reviews.llvm.org/D53418 Change-Id: Ie3d054f2e956c2768988c0f4c0ffd29a47294eef llvm-svn: 345120	2018-10-24 08:14:07 +00:00
Chandler Carruth	343a87ac8d	[docs] Turn of `nasm` highlighting for a code block. This appears to produce a warning on the docs build bot. It doesn't reproduce for me, likely because I have a newer (or more full featured) pygments install. llvm-svn: 338978	2018-08-06 01:19:43 +00:00
Konstantin Zhuravlyov	dd6b05c34c	AMDHSA: Put old assembler docs back Until we switch to code object v3 by default. Follow up for https://reviews.llvm.org/D47736. Differential Revision: https://reviews.llvm.org/D48497 llvm-svn: 335378	2018-06-22 19:23:18 +00:00
Scott Linder	1e8c2c705d	[AMDGPU] Update assembler for HSA Code Object v3 Update AMDGPU assembler syntax behind the code-object-v3 feature: * Replace/rename most AMDGPU assembler directives/symbols and document them. * Provide more diagnostics (e.g. values out of range, missing values, repeated values). * Provide path for backwards compatibility, even with underlying descriptor changes. Differential Revision: https://reviews.llvm.org/D47736 llvm-svn: 335281	2018-06-21 19:38:56 +00:00
Konstantin Zhuravlyov	766c77efd7	AMDGPU/AMDHSA: Remove GridWorkGroupCountX/Y/Z and everything that comes with it from implementation and v3 header files. Leave definition in v2 header files for backwards compatibility. Differential Revision: https://reviews.llvm.org/D48191 llvm-svn: 335267	2018-06-21 18:36:04 +00:00
Tony Tye	e2f3e10913	[AMDGPU] Document the AMDGPU LLVM attributes Differential Revision: https://reviews.llvm.org/D48101 llvm-svn: 334733	2018-06-14 16:40:10 +00:00
Konstantin Zhuravlyov	00f2cb1116	AMDHSA: Code object v3 updates - Do not emit following assembler directives: - .hsa_code_object_version - .hsa_code_object_isa - .amd_amdgpu_isa - .amd_amdgpu_hsa_metadata - .amd_amdgpu_pal_metadata - Do not emit .note entries - Cleanup and bring in sync kernel descriptor header file - Emit kernel descriptor into .rodata with appropriate relocations and alignments llvm-svn: 334519	2018-06-12 18:02:46 +00:00
Konstantin Zhuravlyov	2ca6b1f2ba	AMDGPU: Always set COMPUTE_PGM_RSRC2.ENABLE_TRAP_HANDLER to zero for AMDHSA as it is set by CP Differential Revision: https://reviews.llvm.org/D47392 llvm-svn: 333451	2018-05-29 19:09:13 +00:00
Tony Tye	43259df44a	[AMDGPU] Change llvm.debugtrap to be a debug breakpoint that can resume execution. No longer require the queue pointer to be passed in in fixed SGPRs. Differential Revision: https://reviews.llvm.org/D46769 llvm-svn: 332485	2018-05-16 16:19:34 +00:00
Matt Arsenault	0084adc516	AMDGPU: Add Vega12 and Vega20 Changes by Matt Arsenault Konstantin Zhuravlyov llvm-svn: 331215	2018-04-30 19:08:16 +00:00
Tony Tye	b6efb90717	[AMDGPU] Add gfx902 product names Differential Revision: https://reviews.llvm.org/D45609 llvm-svn: 330081	2018-04-14 01:58:10 +00:00
Tony Tye	223f4c7c99	[AMDGPU] Update relocation record description Document which relocation records are static and dynamic. Differential Revision: https://reviews.llvm.org/D45587 llvm-svn: 329981	2018-04-13 01:01:27 +00:00
Hiroshi Inoue	bcadfee2ad	[NFC] fix trivial typos in documents and comments "is is" -> "is", "if if" -> "if", "or or" -> "or" llvm-svn: 329878	2018-04-12 05:53:20 +00:00
Tim Corringham	af2dfc697b	Add AMDPAL Code Conventions section to AMD docs Summary: This is a first version of the AMDPAL code conventions. Further updates will undoubtably be required to fully document AMDPAL. Subscribers: nhaehnle, llvm-commits Differential Revision: https://reviews.llvm.org/D45246 llvm-svn: 329188	2018-04-04 13:02:09 +00:00
Tony Tye	01bfd6c4e5	[AMDGPU] Define code object identification string used in AMDHSA runtimes. Differential Revision: https://reviews.llvm.org/D44718 llvm-svn: 328669	2018-03-27 21:20:46 +00:00
Tony Tye	88441a3d1e	[AMDGPU] Update OpenCL to use 48 bytes of implicit arguments for AMDGPU Add two additional implicit arguments for OpenCL for the AMDGPU target using the AMDHSA runtime to support device enqueue. Differential Revision: https://reviews.llvm.org/D44697 llvm-svn: 328351	2018-03-23 18:58:47 +00:00
Tony Tye	7a893d4e34	[AMDGPU] Remove use of OpenCL triple environment and replace with function attribute for AMDGPU - Remove use of the opencl and amdopencl environment member of the target triple for the AMDGPU target. - Use function attribute to communicate to the AMDGPU backend to add implicit arguments for OpenCL kernels for the AMDHSA OS. Differential Revision: https://reviews.llvm.org/D43736 llvm-svn: 328349	2018-03-23 18:45:18 +00:00
Eugene Zelenko	3507b0489f	[Documentation] Fix markup problem in AMDGPUUsage.rst. llvm-svn: 328116	2018-03-21 17:09:35 +00:00
Craig Topper	b5ed275025	[TableGen] Pass result of std::unique to vector::erase instead of calculating a size and calling resize. llvm-svn: 328031	2018-03-20 20:24:10 +00:00
Dmitry Preobrazhensky	c6d31e6f4e	[AMDGPU][MC][DOC] Updated AMD GPU assembler description See bug 36572: https://bugs.llvm.org/show_bug.cgi?id=36572 Differential Revision: https://reviews.llvm.org/D44020 Reviewers: artem.tamazov, vpykhtin llvm-svn: 327288	2018-03-12 15:55:08 +00:00
Tony Tye	5bbcca6967	[AMDGPU] Update AMDGOUUsage.rst descriptions - Improve description of XNACK ELF flag. - Rename all uses of wave to wavefront to be consistent. Differential Revision: https://reviews.llvm.org/D43983 llvm-svn: 326989	2018-03-08 05:46:01 +00:00
Scott Linder	16c7bdaf32	[DebugInfo] Support DWARF v5 source code embedding extension In DWARF v5 the Line Number Program Header is extensible, allowing values with new content types. In this extension a content type is added, DW_LNCT_LLVM_source, which contains the embedded source code of the file. Add new optional attribute for !DIFile IR metadata called source which contains source text. Use this to output the source to the DWARF line table of code objects. Analogously extend METADATA_FILE in Bitcode and .file directive in ASM to support optional source. Teach llvm-dwarfdump and llvm-objdump about the new values. Update the output format of llvm-dwarfdump to make room for the new attribute on file_names entries, and support embedded sources for the -source option in llvm-objdump. Differential Revision: https://reviews.llvm.org/D42765 llvm-svn: 325970	2018-02-23 23:01:06 +00:00
Konstantin Zhuravlyov	9122a63143	AMDGPU: Bring elf flags in sync with the spec - Add MACH flags - Add XNACK flag - Add reserved flags - Minor cleanups in docs Differential Revision: https://reviews.llvm.org/D43356 llvm-svn: 325399	2018-02-16 22:33:59 +00:00
Yaxun Liu	0124b5484c	[AMDGPU] Change constant addr space to 4 Differential Revision: https://reviews.llvm.org/D43170 llvm-svn: 325030	2018-02-13 18:00:25 +00:00
Matt Arsenault	923712b6b5	Reapply "AMDGPU: Add 32-bit constant address space" This reverts r324494 and reapplies r324487. llvm-svn: 324747	2018-02-09 16:57:57 +00:00
Yaxun Liu	976f317f0c	[AMDGPU] Updae documentation about address space llvm-svn: 324617	2018-02-08 15:41:19 +00:00
Rafael Espindola	f4e3f3e31c	Revert "AMDGPU: Add 32-bit constant address space" This reverts commit r324487. It broke clang tests. llvm-svn: 324494	2018-02-07 18:09:35 +00:00
Marek Olsak	871c30e540	AMDGPU: Add 32-bit constant address space Note: This is a candidate for LLVM 6.0, because it was planned to be in that release but was delayed due to a long review period. Merge conflict in release_60 - resolution: Add "-p6:32:32" into the second (non-amdgiz) string. Only scalar loads support 32-bit pointers. An address in a VGPR will fail to compile. That's OK because the results of loads will only be used in places where VGPRs are forbidden. Updated AMDGPUAliasAnalysis and used SReg_64_XEXEC. The tests cover all uses cases we need for Mesa. Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D41651 llvm-svn: 324487	2018-02-07 16:01:00 +00:00
Tony Tye	db6c993faf	[AMDGPU] Update relocation documentation and elf flag machine architecture numbers Differential Revision: https://reviews.llvm.org/D42714 llvm-svn: 323835	2018-01-30 23:59:43 +00:00
Tony Tye	e039d0ee12	[AMDGPU] Clarify ReqdWorkGroupSize and MaxFlatWorkGroupSize metadata - If ReqdWorkGroupSize is present it must have all elements >=1. - If MaxFlatWorkGroupSize must be consistent with ReqdWorkGroupSize. - Remove FixedWorkGroupSize as now equivalent to ReqdWorkGroupSize. llvm-svn: 323829	2018-01-30 23:07:10 +00:00
Tim Hammerquist	680671eb26	fix invalid footnote syntax llvm-svn: 321839	2018-01-05 00:24:54 +00:00
Tony Tye	a697880b38	[AMDGPU] Rename Bonaire target to be gfx704; remove gfx800 and make Iceland and Tonga both use gfx802; update target feature handling Correct committed version to match intended accepted review D40051 id=123417 - Rename Bonaire target to be gfx704. - Eliminate gfx800 and make Iceland and Tonga both use gfx802 as they use the same code. - List target features supported by each processor in the processor table together with the default value. - Add xnack flag to e_flags. - Remove xnack from kernel metadata and kernel descriptor since it is now a whole code object property. Differential Revision: https://reviews.llvm.org/D40051 llvm-svn: 320457	2017-12-12 05:47:00 +00:00
Tony Tye	31105cc997	[AMDGPU] Rename Bonaire target to be gfx704; update target feature handling - Rename Bonaire target to be gfx704. - Eliminate gfx800 and make Iceland and Tonga both use gfx802 as they use the same code. - List target features supported by each processor in the processor table together with the default value. - Add xnack flag to e_flags. - Remove xnack from kernel metadata and kernel descriptor since it is now a whole code object property. Differential Revision: https://reviews.llvm.org/D40051 llvm-svn: 320378	2017-12-11 15:35:27 +00:00
Mark Searles	095d4ea4bf	[AMDGPU] Fix typo in Kernel Descriptor for GFX6-GFX9 Differential Revision: https://reviews.llvm.org/D40981 llvm-svn: 320087	2017-12-07 21:24:27 +00:00
Konstantin Zhuravlyov	06ae4ec78e	AMDGPU: Add num spilled s/vgprs to metadata This was requested by tools. Differential Revision: https://reviews.llvm.org/D40321 llvm-svn: 319192	2017-11-28 17:51:08 +00:00
Tony Tye	3507750063	[AMDGPU] Correct targets that support XNACK Differential Revision: https://reviews.llvm.org/D39887 llvm-svn: 317955	2017-11-11 00:50:32 +00:00
Tony Tye	f59d0715b1	[AMDGPU] AMDGPUUsage.rst minor corrections Differential Revision: https://reviews.llvm.org/D39887 llvm-svn: 317924	2017-11-10 20:51:43 +00:00
Tony Tye	07d9f10374	[AMDGPU] Update code object description - Use ELF header flags to identify processor. - Remove isa note record. - Add target feature section. - Make metadata for NumVGPRs, NumSGPRs and MaxFlatWorkGroupSize required. - Add FixedWorkGroupSize to CodeProps metadata. - Add ReqdWorkGroupSize* to kernel descriptor and move MaxFlatWorkGroupSize to be adjacent. - Move IsXNACKEnabled in the kernel descriptor to be at the end of the unused flags. - Remove IsDynamicCallStack from the metadata and kernel descriptor. - Remove legacy debugger metadata. - Remove old XNACK enabled processor names. Differential Revision: https://reviews.llvm.org/D39828 llvm-svn: 317855	2017-11-10 01:00:54 +00:00
Yaxun Liu	c928f2a6d4	[AMDGPU] Emit metadata for hidden arguments for kernel enqueue Identifies kernels which performs device side kernel enqueues and emit metadata for the associated hidden kernel arguments. Such kernels are marked with calls-enqueue-kernel function attribute by AMDGPUOpenCLEnqueueKernelLowering pass and later on hidden kernel arguments metadata HiddenDefaultQueue and HiddenCompletionAction are emitted for them. Differential Revision: https://reviews.llvm.org/D39255 llvm-svn: 316907	2017-10-30 14:30:28 +00:00
Konstantin Zhuravlyov	ea35e46b71	AMDGPU/Docs: Fix unreadable characters llvm-svn: 316171	2017-10-19 17:12:55 +00:00
Tony Tye	6baa6d21e8	[AMDGPU] Corrections to memory model description. - Add description on nontemporal support. - Correct OpenCL sequentially consistent and fence code sequences. - Minor test cleanup. Differential Revision: https://reviews.llvm.org/D39073 llvm-svn: 316131	2017-10-18 22:16:55 +00:00
Konstantin Zhuravlyov	265d253aae	AMDGPU/Docs: Make target naming consistent - R600 Arch: Use Radeon HD XXXX Series - GCN Arch: Use GFXX Differential Revision: https://reviews.llvm.org/D39019 llvm-svn: 316100	2017-10-18 17:59:20 +00:00
Konstantin Zhuravlyov	8d5e9e110c	AMDGPU: Rename MaxFlatWorkgroupSize to MaxFlatWorkGroupSize for consistency Differential Revision: https://reviews.llvm.org/D38957 llvm-svn: 316097	2017-10-18 17:31:09 +00:00
Tony Tye	d288430c3e	Add base relative relocation record that can be used for the following case (OpenCL example): static __global int Var = 0; __global int* Ptr[] = {&Var}; ... In this case Var is a non premptable symbol and so its address can be used as the value of Ptr, with a base relative relocation that will add the delta between the ELF address and the actual load address. Such relocations do not require a symbol. Differential Revision: https://reviews.llvm.org/D38909 llvm-svn: 315935	2017-10-16 20:44:29 +00:00
Konstantin Zhuravlyov	13376a4bdf	AMDGPU: Add AMDGPU HSA Kernel Descriptor - Update docs to match llvm coding style - Add missing FP16_OVFL bit for gfx9 - Fix the size of the kernel descriptor in the docs Differential Revision: https://reviews.llvm.org/D38902 llvm-svn: 315822	2017-10-14 19:17:08 +00:00
Konstantin Zhuravlyov	a01d8b0b63	AMDGPU: Bring HSA metadata on par with the specification Differential Revision: https://reviews.llvm.org/D38753 llvm-svn: 315821	2017-10-14 19:03:51 +00:00
Yaxun Liu	de4b88d9a1	[AMDGPU] Lower enqueued blocks and generate runtime metadata This patch adds a post-linking pass which replaces the function pointer of enqueued block kernel with a global variable (runtime handle) and adds runtime-handle attribute to the enqueued block kernel. In LLVM CodeGen the runtime-handle metadata will be translated to RuntimeHandle metadata in code object. Runtime allocates a global buffer for each kernel with RuntimeHandel metadata and saves the kernel address required for the AQL packet into the buffer. __enqueue_kernel function in device library knows that the invoke function pointer in the block literal is actually runtime handle and loads the kernel address from it and puts it into AQL packet for dispatching. This cannot be done in FE since FE cannot create a unique global variable with external linkage across LLVM modules. The global variable with internal linkage does not work since optimization passes will try to replace loads of the global variable with its initialization value. Differential Revision: https://reviews.llvm.org/D38610 llvm-svn: 315352	2017-10-10 19:39:48 +00:00
Konstantin Zhuravlyov	3696352d85	AMDGPU/Docs: Follow up on review feedback in https://reviews.llvm.org/D38387 llvm-svn: 314848	2017-10-03 21:18:03 +00:00
Konstantin Zhuravlyov	0aa94d314c	AMDGPU: Add ELFOSABI_AMDGPU_MESA3D Differential Revision: https://reviews.llvm.org/D38387 llvm-svn: 314846	2017-10-03 21:14:14 +00:00
Konstantin Zhuravlyov	a952b44ed5	AMDGPU: Add ELFOSABI_AMDGPU_PAL llvm-svn: 314843	2017-10-03 20:54:07 +00:00

1 2 3 4

174 Commits