llvm-project

Commit Graph

Author	SHA1	Message	Date
Ivan Kosarev	4b9dbbdb09	[AMDGPU][MC][NFC] Refine SMEM load definitions. Reviewed By: dp Differential Revision: https://reviews.llvm.org/D130009	2022-07-21 14:56:56 +01:00
Ivan Kosarev	432cbd7827	[AMDGPU][CodeGen] Support (register + immediate) SMRD offsets. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D129381	2022-07-18 11:29:31 +01:00
Ivan Kosarev	60d6fbb621	[AMDGPU][GFX9][GFX10] Support base+soffset+offset SMEM atomics. Resolves a part of https://github.com/llvm/llvm-project/issues/38652 Reviewed By: dp Differential Revision: https://reviews.llvm.org/D127314	2022-06-10 13:22:41 +01:00
Ivan Kosarev	facbfb121a	[AMDGPU][GFX9+] Support base+soffset+offset s_atc_probe's. Resolves part of https://github.com/llvm/llvm-project/issues/38652 Reviewed By: dp Differential Revision: https://reviews.llvm.org/D126791	2022-06-06 16:46:22 +01:00
Ivan Kosarev	79ec1e8fd6	[AMDGPU][GFX9][GFX10] Support base+soffset+offset s_dcache_discard's. Resolves part of https://github.com/llvm/llvm-project/issues/38652 Reviewed By: dp Differential Revision: https://reviews.llvm.org/D126766	2022-06-06 16:32:16 +01:00
Ivan Kosarev	f199b2b00f	[AMDGPU][NFC] Refine defining the offset field for GFX10+ SMEM instructions. Reviewed By: dp Differential Revision: https://reviews.llvm.org/D126662	2022-05-31 09:54:51 +01:00
Ivan Kosarev	b4dbcba3b7	[AMDGPU][GFX9][NFC] Rename the base class for SMEM stores.	2022-05-30 10:31:59 +01:00
Ivan Kosarev	082822b381	[AMDGPU][GFX9] Support base+soffset+offset SMEM stores. Reviewed By: dp Differential Revision: https://reviews.llvm.org/D126388	2022-05-30 10:27:57 +01:00
Ivan Kosarev	b0ccf38b01	[AMDGPU][GFX9] Support base+soffset+offset SMEM loads. Resolves part of https://github.com/llvm/llvm-project/issues/38652 Reviewed By: dp Differential Revision: https://reviews.llvm.org/D125700	2022-05-26 12:42:33 +01:00
Ivan Kosarev	1586e1dc95	[AMDGPU][MC][GFX11] Support base+soffset+offset SMEM loads. Reviewed By: dp Differential Revision: https://reviews.llvm.org/D126207	2022-05-24 15:13:14 +01:00
Jay Foad	9af56c676e	[AMDGPU] Mark SMEM cache invalidations as not reading memory This brings the MachineInstrs in line with the corresponding intrinsics which have side effects but do not access memory. It also matches how BUF cache invalidation instructions are defined. The lit test changes are just because the machine scheduler previously treated them like loads, and added an artificial scheduling edge from them to the exit SU, which caused them to be scheduled earlier. Differential Revision: https://reviews.llvm.org/D126074	2022-05-20 17:18:03 +01:00
Jay Foad	9ece051847	[AMDGPU] Mark s_get_waveid_in_workgroup as not reading memory It is already marked as having side effects, at least in MIR. It does not interact with anything else that is modelled as a memory access either in IR or MachineIR. Differential Revision: https://reviews.llvm.org/D125985	2022-05-19 21:25:46 +01:00
Joe Nash	ac2ff258d6	[AMDGPU] gfx11 scalar memory instructions Contributors: Mirko Brkusanin <Mirko.Brkusanin@amd.com> Patch 9/N for upstreaming of AMDGPU gfx11 architecture. Depends on D125820 Reviewed By: kosarev, #amdgpu, arsenm Differential Revision: https://reviews.llvm.org/D125822	2022-05-19 10:27:47 -04:00
Dmitry Preobrazhensky	169416c64a	[AMDGPU][MC][GFX7] Disable cache policy modifiers with SMRD Differential Revision: https://reviews.llvm.org/D125799	2022-05-18 15:17:49 +03:00
Joe Nash	c70259405c	[AMDGPU] gfx11 BUF Instructions Includes MachineCode layer support and tests, and MIR tests not requiring CodeGen pass changes. Includes a small change in SMInstructions.td to correct encoded bits. Contributors: Petar Avramovic <Petar.Avramovic@amd.com> Dmitry Preobrazhensky <dmitry.preobrazhensky@amd.com> Depends on D125316 Patch 6/N for upstreaming of AMDGPU gfx11 architecture. Reviewed By: dp, Petar.Avramovic Differential Revision: https://reviews.llvm.org/D125319	2022-05-16 09:41:40 -04:00
Ivan Kosarev	cb67b2ccc4	[AMDGPU][GFX10] Support base+soffset+offset SMEM stores. Also makes another step towards resolving https://github.com/llvm/llvm-project/issues/38652 Reviewed By: foad, dp Differential Revision: https://reviews.llvm.org/D125380	2022-05-12 08:48:05 +01:00
Ivan Kosarev	88f04bdbd8	[AMDGPU][GFX10] Support base+soffset+offset SMEM loads. Also makes a step towards resolving https://github.com/llvm/llvm-project/issues/38652 Reviewed By: foad, dp Differential Revision: https://reviews.llvm.org/D125117	2022-05-10 16:17:14 +01:00
Jay Foad	077a14e00b	[AMDGPU] Mark time intrinsics as nomem, hassideeffects Adding IntrHasSideEffects to @llvm.amdgcn.s.memtime and @llvm.amdgcn.s.memrealtime means that we can stop pretending they read and write memory, and similarly for the corresponding pseudo instructions. This should stop these intrinsics from being rescheduled past all other instructions, even ones which don't load or store. See also https://reviews.llvm.org/D58635. Differential Revision: https://reviews.llvm.org/D115227	2021-12-07 16:24:06 +00:00
Jessica Clarke	3ee56eed2f	[AMDGPU][NFC] Alter ComplexPattern types to be consistent with their uses When used as a non-leaf node, TableGen does not currently use the type of a ComplexPattern for type inference, which also means it does not check it doesn't conflict with the use. This differs from when used as a leaf value, where the type is used for inference. Fixing that discrepancy is something I intend to upstream as a subsequent review. AMDGPU currently has several ComplexPatterns that are used in contexts where they're expected to be an iPTR, and where using an iPTR instead of a fixed-width integer type matters. With my locally-patched TableGen, none of these mismatches result in type contradictions, but do change the patterns and cause various failures to select. These changes to the ComplexPatterns' types reflect how they are actually used, result in bit-for-bit identical TableGen output (without my local TableGen patch), and ensure that with improved type inference AMDGPU's backend will continue to work. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D109032	2021-12-03 07:04:59 +00:00
Jay Foad	24ffc343f9	[AMDGPU] Set IsAtomicRet and IsAtomicNoRet on Real instructions This does not affect codegen but might benefit llvm-mca.	2021-06-16 12:23:29 +01:00
Jay Foad	6f778fed8e	[AMDGPU] Set more flags on Real instructions This does not affect codegen, which only tests these flags on Pseudo instructions, but might help llvm-mca which has to work with Real instructions. In particular setting LGKM_CNT on DS instructions helps with the problem identified in D104149. Differential Revision: https://reviews.llvm.org/D104293	2021-06-16 09:58:50 +01:00
Dmitry Preobrazhensky	3eadcb86ab	[AMDGPU][MC][GFX9] Corrected SMEM decoding Corrected SMEM decoding when IMM=0 and OFFSET>127 Fixed bug 49819 (https://bugs.llvm.org/show_bug.cgi?id=49819) Differential Revision: https://reviews.llvm.org/D99804	2021-04-06 14:10:46 +03:00
Jay Foad	fc7e3e7dd9	[AMDGPU] Set SchedRW on real instructions Coyp SchedRW from pseudos to real instructions so that llvm-mca has access to it. This is NFC for normal compiler codegen, which schedules pseudos not real instructions. Add an llvm-mca test for some high latency double-precision instructions as a smoke test. Differential Revision: https://reviews.llvm.org/D99187	2021-03-23 15:38:11 +00:00
Stanislav Mekhanoshin	3bffb1cd0e	[AMDGPU] Use single cache policy operand Replace individual operands GLC, SLC, and DLC with a single cache_policy bitmask operand. This will reduce the number of operands in MIR and I hope the amount of code. These operands are mostly 0 anyway. Additional advantage that parser will accept these flags in any order unlike now. Differential Revision: https://reviews.llvm.org/D96469	2021-03-15 13:00:59 -07:00
Jay Foad	99682bc039	Revert "Revert "[AMDGPU] Restore the s_memtime instruction in gfx1030"" This reverts commit `e58d68fcd0`. This reinstates commit `fc28f600e5` with a fix to initialize HasShaderCyclesRegister. See https://reviews.llvm.org/D97928.	2021-03-06 09:00:01 +00:00
Mitch Phillips	e58d68fcd0	Revert "[AMDGPU] Restore the s_memtime instruction in gfx1030" Broke the ASan/MSan buildbots. See more comments in the original patch, https://reviews.llvm.org/D97928. Build failure at http://lab.llvm.org:8011/#/builders/5/builds/5327 This reverts commit `fc28f600e5`.	2021-03-05 18:24:59 -08:00
Jay Foad	fc28f600e5	[AMDGPU] Restore the s_memtime instruction in gfx1030 gfx1030 added a new way to implement readcyclecounter using the SHADER_CYCLES hardware register, but the s_memtime instruction still exists, so the MC layer should still accept it and the llvm.amdgcn.s.memtime intrinsic should still work. Differential Revision: https://reviews.llvm.org/D97928	2021-03-05 20:19:11 +00:00
Stanislav Mekhanoshin	48d2e04152	[AMDGPU] Mark SMRD atomics We did not have atomic flags on SMRD, did not copy TSFlags to real instructions, and did not have ret/noret atomic map. At the moment it is NFC, but needed for D96469. Differential Revision: https://reviews.llvm.org/D96823	2021-02-17 16:47:02 -08:00
Jay Foad	07e92e6b60	[AMDGPU] Make use of HasSMemRealTime predicate. NFC. We have this subtarget feature so it makes sense to use it here. This is NFC because it's always defined by default on GFX8+. Differential Revision: https://reviews.llvm.org/D93202	2020-12-14 16:34:57 +00:00
Matt Arsenault	779cba79ec	AMDGPU: Remove mayLoad/mayStore from some side effecting intrinsics These don't really modify any memory, and should not expect memory operands.	2020-06-18 14:12:19 -04:00
Stanislav Mekhanoshin	3f0c9c1634	Fix ubsan error in tblgen with signed left shift UBSAN complains when tblgen performs SHL of a negative value. Differential Revision: https://reviews.llvm.org/D81952	2020-06-16 11:15:09 -07:00
Stanislav Mekhanoshin	576fa5a50c	[AMDGPU] make ubsan happy with unsigned left shift Fixes UBSAN error after rG9ee272f13d88f090817235ef4f91e56bb2a153d6 A trivial signed/unsigned shift.	2020-06-15 17:21:10 -07:00
Stanislav Mekhanoshin	9ee272f13d	[AMDGPU] Add gfx1030 target Differential Revision: https://reviews.llvm.org/D81886	2020-06-15 16:18:05 -07:00
Matt Arsenault	27f8bd94cb	AMDGPU/GlobalISel: Fix select of <8 x s64> scalar load	2020-06-11 19:09:43 -04:00
Dmitry Preobrazhensky	5998baccb9	[AMDGPU][MC][GFX9+] Enabled 21-bit signed offsets for SMEM instructions Reviewers: arsenm, rampitec Differential Revision: https://reviews.llvm.org/D79288	2020-05-06 14:13:10 +03:00
Kazuaki Ishizaki	0312b9f550	[llvm] NFC: Fix trivial typo in rst and td files Differential Revision: https://reviews.llvm.org/D77469	2020-04-23 14:26:32 +09:00
Matt Arsenault	26ebc51a34	AMDGPU/GlobalISel: Fix smrd loads of v4i64	2020-03-24 13:44:41 -04:00
Matt Arsenault	86813e2768	AMDGPU/GlobalISel: Select llvm.amdgcn.s.buffer.load Doesn't try to fail on the dlc bit pre-gfx10 like the DAG lowering does.	2020-02-17 08:02:40 -08:00
Matt Arsenault	75fcdfa1fc	AMDGPU: Cleanup SMRD buffer selection The usage of the Imm out argument from SelectSMRDOffset is pretty confusing. Stop trying to reject CI immediates in the case where the offset field can be used. It's not an illegal way to encode the immediate, so just prefer the better encoding pattern with AddedComplexity. We probably don't even really need the different opcodes for the different offset types anymore, but that will be more work to cleanup. The SMRD non-buffer load patterns could also use a cleanup to be done separately.	2020-02-04 10:28:08 -08:00
Matt Arsenault	97a1d4bc02	AMDGPU: Don't use separate cache arguments for s_buffer_load node There's not much value to this separate node from the intrinsic. Make the operand structure the same as the intrinsic, so we can reuse the same pattern for GlobalISel.	2020-01-30 14:15:26 -08:00
Matt Arsenault	7f3280ecdd	AMDGPU/GlobalISel: Select permlane16/permlanex16	2020-01-29 17:55:31 -05:00
Stanislav Mekhanoshin	4312c4afd4	[AMDGPU] deduplicate tablegen predicates We are duplicating predicates if several parts of the combined predicate list contain the same condition. Added code to deduplicate the list. We have AssemblerPredicates and AssemblerPredicate in the PredicateControl, but we never use AssemblerPredicates with an actual list, so this one is dropped. This addresses the first part of the llvm bug 43886: https://bugs.llvm.org/show_bug.cgi?id=43886 Differential Revision: https://reviews.llvm.org/D69815	2019-11-04 12:19:17 -08:00
Matt Arsenault	bc8de8a8da	AMDGPU/GlobalISel: Select SMRD loads for more types llvm-svn: 371954	2019-09-16 00:54:07 +00:00
Stanislav Mekhanoshin	818d748a45	[AMDGPU] Always use s_memtime for readcyclecounter Differential Revision: https://reviews.llvm.org/D64369 llvm-svn: 365431	2019-07-09 03:10:18 +00:00
Nicolai Haehnle	490e83cd43	AMDGPU/GFX10: Support DLC bit in llvm.amdgcn.s.buffer.load intrinsic Summary: Change-Id: Ie4c971462a7749740938c687144e77441dac2539 Reviewers: rampitec, arsenm Subscribers: kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62486 Change-Id: Iae59523edd75c74918d2118df6571a7b671717a0 llvm-svn: 363514	2019-06-16 17:14:12 +00:00
Stanislav Mekhanoshin	a6322941ff	[AMDGPU] gfx1010 VMEM and SMEM implementation Differential Revision: https://reviews.llvm.org/D61330 llvm-svn: 359621	2019-04-30 22:08:23 +00:00
Stanislav Mekhanoshin	5182302a37	[AMDGPU] Sort out and rename multiple CI/VI predicates Differential Revision: https://reviews.llvm.org/D60346 llvm-svn: 357835	2019-04-06 09:20:48 +00:00
Stanislav Mekhanoshin	7895c03232	[AMDGPU] predicate and feature refactoring We have done some predicate and feature refactoring lately but did not upstream it. This is to sync. Differential revision: https://reviews.llvm.org/D60292 llvm-svn: 357791	2019-04-05 18:24:34 +00:00
Matt Arsenault	f97ace5639	AMDGPU: Remove IntrReadMem from memtime/memrealtime intrinsics EarlyCSE with MemorySSA was able to use this to merge multiple calls with no intervening store. llvm-svn: 354814	2019-02-25 20:16:11 +00:00
Konstantin Zhuravlyov	9a278bf6b5	Revert "AMDGPU/NFC: Cleanup subtarget predicates" It breaks one of our downstream merges, so revert it temporarily while investigating failures downstream llvm-svn: 354700	2019-02-22 23:21:06 +00:00

1 2

85 Commits