llvm-project

Commit Graph

Author	SHA1	Message	Date
Louis Gerbarg	67474e3755	Make sure no loads resulting from load->switch DAGCombine are marked invariant Currently when DAGCombine converts loads feeding a switch into a switch of addresses feeding a load the new load inherits the isInvariant flag of the left side. This is incorrect since invariant loads can be reordered in cases where it is illegal to reoarder normal loads. This patch adds an isInvariant parameter to getExtLoad() and updates all call sites to pass in the data if they have it or false if they don't. It also changes the DAGCombine to use that data to make the right decision when creating the new load. llvm-svn: 214449	2014-07-31 21:45:05 +00:00
Chandler Carruth	3de980d2ff	[SDAG] Enable the new assert for out-of-range result numbers in SDValues, fixing the two bugs left in the regression suite. The key for both of these was the use a single value type rather than a VTList which caused an unintentionally single-result merge-value node. Fix this by getting the appropriate VTList in place. Doing this exposed that the comments in x86's code abouth how MUL_LOHI operands are handle is wrong. The bug with the use of out-of-range result numbers was hiding the bug about the order of operands here (as best i can tell). There are more places where the code appears to get this backwards still... llvm-svn: 213931	2014-07-25 09:19:23 +00:00
Matt Arsenault	83e60581c3	R600: Add new functions for splitting vector loads and stores. These will be used in future patches and shouldn't change anything yet. llvm-svn: 213877	2014-07-24 17:10:35 +00:00
Matt Arsenault	0daeb63f03	R600: Fix LowerSDIV24 Use ComputeNumSignBits instead of checking for i8 / i16 which only worked when AMDIL was lying about having legal i8 / i16. If an integer is known to fit in 24-bits, we can do division faster with float ops. llvm-svn: 213843	2014-07-24 06:59:20 +00:00
Tom Stellard	067c81567b	R600/SI: Store constant initializer data in constant memory This implements a solution for constant initializers suggested by Vadim Girlin, where we store the data after the shader code and then use the S_GETPC instruction to compute its address. This saves use the trouble of creating a new buffer for constant data and then having to pass the pointer to the kernel via user SGPRs or the input buffer. llvm-svn: 213530	2014-07-21 14:01:14 +00:00
Tim Northover	00fdbbbf60	R600: support fpext/fptrunc operations to and from f16. llvm-svn: 213376	2014-07-18 13:01:37 +00:00
Tim Northover	f861de3d7b	R600: support f16 -> f64 conversion intrinsic. Unfortunately, we don't seem to have a direct truncation, but the extension can be legally split into two operations so we should support that. llvm-svn: 213357	2014-07-18 08:43:24 +00:00
Jan Vesely	6ddb8dd442	R600: Implement zero undef variants of ctlz/cttz v2: use ffbh/l if available v3: Rebase on top of Matt's SI patches Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 213072	2014-07-15 15:51:09 +00:00
Matt Arsenault	ca3976f7ae	R600: Add dag combine for copy of an illegal type. This helps avoid redundant instructions to unpack, and repack the vectors. Ideally we could recognize that pattern and eliminate it. Currently v4i8 and other small element type vectors are scalarized, so this has the added bonus of avoiding that. llvm-svn: 213031	2014-07-15 02:06:31 +00:00
Jan Vesely	2cb62ce2a0	R600: Implement float to long/ulong Use alg. from LegalizeDAG.cpp Move Expand setting to SIISellowering v2: Extend existing tests instead of creating new ones v3: use separate LowerFPTOSINT function v4: use TargetLowering::expandFP_TO_SINT add comment about using FP_TO_SINT for uints Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 212773	2014-07-10 22:40:21 +00:00
Matt Arsenault	d2c9e08b63	R600: Fix mishandling of load / store chains. Fixes various bugs with reordering loads and stores. Scalarized vector loads weren't collecting the chains at all. llvm-svn: 212473	2014-07-07 18:34:45 +00:00
Tom Stellard	e9219e0026	R600: Add a comment that llvm.AMDGPU.trunc is a legacy intrinsic llvm-svn: 212218	2014-07-02 20:53:57 +00:00
Tom Stellard	10ae6a0e6a	R600: Promote i64 loads to v2i32 llvm-svn: 212216	2014-07-02 20:53:54 +00:00
Matt Arsenault	c324b95c77	R600: Fix crashes when an illegal type load or store is not handled. I don't think anything hits this now, but will be exposed in future patches. llvm-svn: 212197	2014-07-02 17:44:53 +00:00
Matt Arsenault	d0e0f0aea0	R600: Move mul combine to separate function llvm-svn: 212052	2014-06-30 17:55:48 +00:00
Matt Arsenault	961ca43180	R600: Move load/store ReplaceNodeResults to common code. Future patches will want to custom lower loads on SI. llvm-svn: 211848	2014-06-27 02:33:47 +00:00
Aaron Ballman	3c81e46b57	Silencing a warning about isZExtFree hiding an inherited virtual function. No functional change intended. llvm-svn: 211783	2014-06-26 13:45:47 +00:00
Matt Arsenault	c6f8fdb4e5	R600: Fix vector FMA llvm-svn: 211757	2014-06-26 01:28:05 +00:00
Tom Stellard	9b3816b5ee	R600: Promote i64 stores to v2i32 Now we need only one 64-bit pattern for stores. llvm-svn: 211643	2014-06-24 23:33:04 +00:00
Matt Arsenault	257d48d22c	R600: Fix inconsistency in rsq instructions. R600 was using a clamped version of rsq, but SI was not. Add a new rsq_clamped intrinsic and use them consistently. It's unclear to me from the documentation what behavior the R600 instructions have, so I assume they have the legacy behavior described by the SI documents. For R600, use RECIPSQRT_IEEE for both llvm.AMDGPU.rsq.legacy and llvm.AMDGPU.rsq. R600 also has RECIPSQRT_FF, which I'm not sure how it fits in here. llvm-svn: 211637	2014-06-24 22:13:39 +00:00
Matt Arsenault	d40b970616	R600: Remove DIV_INF This corresponded to an amdil instruction which there is a 2 instruction equivalent for. llvm-svn: 211616	2014-06-24 17:42:16 +00:00
Matt Arsenault	f2b0aebb8a	R600/SI: Fix div_scale intrinsic. The operand that must match one of the others does matter, and implement selecting for it. llvm-svn: 211523	2014-06-23 18:28:28 +00:00
Matt Arsenault	1d555c4e91	R600: Remove AMDILISelLowering llvm-svn: 211519	2014-06-23 18:00:55 +00:00
Matt Arsenault	d5f91fd883	R600: Select is not expensive. llvm-svn: 211518	2014-06-23 18:00:52 +00:00
Matt Arsenault	c4d3d3a16e	R600: Move add/sub with overflow out of AMDILISelLowering Add more tests for these. llvm-svn: 211517	2014-06-23 18:00:49 +00:00
Matt Arsenault	e54e1c3a21	R600: Move more out of AMDILISelLowering llvm-svn: 211516	2014-06-23 18:00:44 +00:00
Matt Arsenault	b8b5153935	R600/SI: Handle i64 sub. We can handle it the same way as add llvm-svn: 211514	2014-06-23 18:00:38 +00:00
Matt Arsenault	c791f39912	R600: Rename AMDIL file llvm-svn: 211512	2014-06-23 18:00:31 +00:00
Jan Vesely	343cd6f056	R600: Use LowerSDIVREM for i64 node replace v2: move div/rem node replacement to R600ISelLowering make lowerSDIVREM protected Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 211478	2014-06-22 21:43:01 +00:00
Jan Vesely	109efdff6a	R600: Implement custom SDIVREM. Instead of separate SDIV/SREM. SDIV used UDIV which in turn used UDIVREM anyway. SREM used SDIV(UDIV->UDIVREM)+MUL+SUB, using UDIVREM directly is more efficient. v2: Don't use all caps names Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 211477	2014-06-22 21:43:00 +00:00
Tom Stellard	9c603ebca4	R600/SI: Add a pattern for f32 ftrunc llvm-svn: 211377	2014-06-20 17:06:09 +00:00
Tom Stellard	a79e9f0f6d	R600: Expand vector flog2 llvm-svn: 211376	2014-06-20 17:06:07 +00:00
Tom Stellard	5222a88653	R600: Expand vector fexp2 llvm-svn: 211375	2014-06-20 17:06:05 +00:00
Matt Arsenault	a0050b0961	R600/SI: Add intrinsics for various math instructions. These will be used for custom lowering and for library implementations of various math functions, so it's useful to expose these as builtins. llvm-svn: 211247	2014-06-19 01:19:19 +00:00
Matt Arsenault	2b0fa433a0	Use stdint macros for specifying size of constants llvm-svn: 211231	2014-06-18 22:11:03 +00:00
Matt Arsenault	692bd5ec2f	R600: Handle fnearbyint The difference from rint isn't really relevant here, so treat them as equivalent. OpenCL doesn't have nearbyint, so this is sort of pointless other than for completeness. llvm-svn: 211229	2014-06-18 22:03:45 +00:00
Matt Arsenault	b55c68f171	Use LL suffix for literal that should be 64-bits. This hopefully fixes Windows llvm-svn: 211225	2014-06-18 21:40:43 +00:00
Jan Vesely	85f0dbce5c	R600: Expand vector fceil Move fp64 fceil tests to fceil64.ll v2: rebase Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 211194	2014-06-18 17:57:29 +00:00
Matt Arsenault	d22626f6bb	Work around ridiculous warning. Apparently C++ doesn't really have hex floating point constants. llvm-svn: 211192	2014-06-18 17:45:58 +00:00
Matt Arsenault	43160e7af2	R600/SI: Add intrinsics for brev instructions llvm-svn: 211187	2014-06-18 17:13:57 +00:00
Matt Arsenault	4601093267	R600: Implement f64 ftrunc, ffloor and fceil. CI has instructions for these, so this fixes them for older hardware. llvm-svn: 211183	2014-06-18 17:05:30 +00:00
Matt Arsenault	e8208ec95b	R600: Custom lower f64 frint for pre-CI llvm-svn: 211182	2014-06-18 17:05:26 +00:00
Matt Arsenault	8579601050	R600/SI: Match ctlz_zero_undef llvm-svn: 211115	2014-06-17 17:36:24 +00:00
Tom Stellard	880a80ad07	R600: Use LDS and vectors for private memory llvm-svn: 211110	2014-06-17 16:53:14 +00:00
Tom Stellard	aad4659470	SelectionDAG: Expand i64 = FP_TO_SINT i32 llvm-svn: 211108	2014-06-17 16:53:07 +00:00
Matt Arsenault	2a60de548a	Fix copy paste error llvm-svn: 211003	2014-06-15 21:22:52 +00:00
Matt Arsenault	717c1d0319	R600: Remove a few more things from AMDILISelLowering Try to keep all the setOperationActions for integer ops together. llvm-svn: 211001	2014-06-15 21:08:58 +00:00
Matt Arsenault	b5dff9ab50	R600: Fix assert on vector sdiv llvm-svn: 211000	2014-06-15 21:08:54 +00:00
Matt Arsenault	14d4645e46	R600: Move / cleanup more leftover AMDIL stuff. llvm-svn: 210998	2014-06-15 20:23:38 +00:00
Matt Arsenault	1578aa78d4	R600: Move division custom lowering out of AMDILISelLowering llvm-svn: 210997	2014-06-15 20:08:02 +00:00
Matt Arsenault	cf9a9a148e	R600: Report that integer division is expensive. Divides by weird constants now emit much better code. llvm-svn: 210995	2014-06-15 19:48:16 +00:00
Matt Arsenault	e682a19a1c	R600: Fix asserts related to constant initializers This would assert if a constant address space was extern and therefore didn't have an initializer. If the initializer was undef, it would hit the unreachable unhandled initializer case. An extern global should never really occur since we don't have machine linking, but bugpoint likes to remove initializers. llvm-svn: 210967	2014-06-14 04:26:05 +00:00
Matt Arsenault	41aa27c96b	R600: Use address space enum instead of value llvm-svn: 210966	2014-06-14 04:26:01 +00:00
Matt Arsenault	fd8c24ede8	R600: Cleanup some old AMDIL stuff. Move / delete some of the more obviously wrong setOperationAction calls. Most of these are setting Expand for types that aren't legal which is the default anyway. Leave stuff that might require more thought on whether it's junk or not as it is. No functionality change. llvm-svn: 210922	2014-06-13 17:20:53 +00:00
Matt Arsenault	825fb0b094	R600/SI: Fix selection error on i64 rotl / rotr. Evergreen is still broken due to missing shl_parts. llvm-svn: 210885	2014-06-13 04:00:30 +00:00
Matt Arsenault	5d47d4ac7e	R600: Mostly remove remaining AMDIL intrinsics. Delete all unused ones, and add new AMDGPU named intrinsics for the ones that are. Handle the old AMDIL names for comptability (although remove their GCCBuiltin names) and add tests since there weren't any for these before. llvm-svn: 210827	2014-06-12 21:15:44 +00:00
Matt Arsenault	364a6747aa	R600/SI: Use v_cvt_f32_ubyte* instructions This eliminates extra extract instructions when loading an i8 vector to a float vector. llvm-svn: 210666	2014-06-11 17:50:44 +00:00
Rafael Espindola	ace0080a4a	Try to fix the msvc build. llvm-svn: 210636	2014-06-11 04:41:37 +00:00
Matt Arsenault	10da3b2516	Use cast instead of assert + dyn_cast llvm-svn: 210628	2014-06-11 03:30:06 +00:00
Matt Arsenault	c9df794042	R600: Add helper functions. Extract these from some of my other patches, since this is the only thing really making them dependent on each other. llvm-svn: 210627	2014-06-11 03:29:54 +00:00
Matt Arsenault	6042506b5c	R600: Use BCNT_INT for evergreen llvm-svn: 210569	2014-06-10 19:18:28 +00:00
Matt Arsenault	b5b5110b5c	R600/SI: Use bcnt instruction for ctpop llvm-svn: 210567	2014-06-10 19:18:21 +00:00
Matt Arsenault	6e43965fbc	R600: Handle fcopysign llvm-svn: 210564	2014-06-10 19:00:20 +00:00
Matt Arsenault	13ccc8f1bc	R600: Fix selection failure for vector bswap llvm-svn: 210475	2014-06-09 16:20:25 +00:00
Matt Arsenault	616a8e42b1	R600: Set all float vector expands in the same place llvm-svn: 209988	2014-06-01 07:38:21 +00:00
Matt Arsenault	05e96f4444	R600: Try to convert BFE back to standard bit ops when possible. This allows existing DAG combines to work on them, and then we can re-match to BFE if necessary during instruction selection. llvm-svn: 209462	2014-05-22 18:09:12 +00:00
Matt Arsenault	5565f65e13	R600: Add dag combine for BFE llvm-svn: 209461	2014-05-22 18:09:07 +00:00
Matt Arsenault	bf8694d36d	R600: Implement ComputeNumSignBitsForTargetNode for BFE llvm-svn: 209460	2014-05-22 18:09:03 +00:00
Matt Arsenault	af6df9d943	R600: Implement computeMaskedBitsForTargetNode for BFE llvm-svn: 209459	2014-05-22 18:09:00 +00:00
Matt Arsenault	eb260206c3	R600: Add intrinsics for mad24 llvm-svn: 209456	2014-05-22 18:00:15 +00:00
Matt Arsenault	40100887b6	R600: Add comment describing problems with LowerConstantInitializer llvm-svn: 209333	2014-05-21 22:59:17 +00:00
Matt Arsenault	6a57fd8b47	R600: Partially fix constant initializers for structs and vectors. This should extend the current workaround to work with structs that only contain legal, scalar types. llvm-svn: 209331	2014-05-21 22:42:42 +00:00
Matt Arsenault	03df7eeda1	Use cast<> instead of unchecked dyn_cast llvm-svn: 209310	2014-05-21 18:03:59 +00:00
Matt Arsenault	d504a74e3c	Use range for llvm-svn: 208922	2014-05-15 21:44:05 +00:00
Jay Foad	a0653a3e6c	Rename ComputeMaskedBits to computeKnownBits. "Masked" has been inappropriate since it lost its Mask parameter in r154011. llvm-svn: 208811	2014-05-14 21:14:37 +00:00
Matt Arsenault	62b1737081	R600: Add mul24 intrinsics llvm-svn: 208604	2014-05-12 17:49:57 +00:00
Matt Arsenault	46013d903f	Fix return before else llvm-svn: 208510	2014-05-11 21:24:41 +00:00
Tom Stellard	a2acad785a	R600: Expand i64 SELECT_CC llvm-svn: 208430	2014-05-09 16:42:19 +00:00
Tom Stellard	afa8b532b1	R600: Move MIN/MAX matching from LowerOperation() to PerformDAGCombine() llvm-svn: 208429	2014-05-09 16:42:16 +00:00
Matt Arsenault	e8a076a253	R600: Promote f64 vector load/stores to i64 for consistency llvm-svn: 208344	2014-05-08 18:01:56 +00:00
Tom Stellard	45b3dcd35b	R600: Expand i64 ISD:SUB llvm-svn: 208005	2014-05-05 21:47:15 +00:00
Tom Stellard	3dbf1f8df0	R600: Expand vector sin and cos. v2: move code to AMDGPUISelLowering.cpp squash with tests (both EG and SI) Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 207845	2014-05-02 15:41:47 +00:00
Tom Stellard	605e116e8e	R600: Expand TruncStore i64 -> {i16,i8} llvm-svn: 207844	2014-05-02 15:41:46 +00:00
Tom Stellard	676f571999	R600: optimize the UDIVREM 64 algorithm This is a squash of several optimization commits: - calculate DIV_Lo and DIV_Hi separately - use BFE_U32 if we are operating on 32bit values - use precomputed constants instead of shifting in UDVIREM - skip the first 32 iterations of udivrem v2: Check whether BFE is supported before using it Patch by: Jan Vesely Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 207589	2014-04-29 23:12:46 +00:00
Tom Stellard	bcd318fc76	R600: Implement iterative algorithm for udivrem Initial implementation, rather slow Patch by: Jan Vesely Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 207588	2014-04-29 23:12:45 +00:00
Tom Stellard	5f3378879f	R600: Change UDIV/UREM to UDIVREM when legalizing types When legalizing ops, with UDIV/UREM set to expand, they automatically expand to UDIVREM (if legal or custom). We need to do this manually for legalize types. v2: SI should be set to Expand because the type is legal, and it is automatically lowered to UDIVREM if UDIVREM is Legal/Custom R600 should set to UDIV/UREM to Custom because it needs to lower them during type legalization Patch by: Jan Vesely Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 207587	2014-04-29 23:12:43 +00:00
Tom Stellard	df780303ef	R600: remove unused variable Patch by: Jan Vesely Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 207586	2014-04-29 23:12:38 +00:00
Craig Topper	8c0b4d0791	Convert more SelectionDAG functions to use ArrayRef. llvm-svn: 207397	2014-04-28 05:57:50 +00:00
Craig Topper	64941d9786	Convert SelectionDAG::getMergeValues to use ArrayRef. llvm-svn: 207374	2014-04-27 19:20:57 +00:00
Craig Topper	48d114bed1	Convert SelectionDAG::getNode methods to use ArrayRef<SDValue>. llvm-svn: 207327	2014-04-26 18:35:24 +00:00
Matt Arsenault	de1c3410c3	R600: Fix function name printing in LowerCall v2: Check both ExternalSymbol and GlobalAddress Patch by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 207282	2014-04-25 22:22:01 +00:00
Craig Topper	062a2baef0	[C++] Use 'nullptr'. Target edition. llvm-svn: 207197	2014-04-25 05:30:21 +00:00
Matt Arsenault	16353871c3	R600: Emit error instead of unreachable on function call llvm-svn: 206904	2014-04-22 16:42:00 +00:00
Matt Arsenault	a3c8cde77b	R600: Change how vector truncating stores are packed. Don't introduce new operations on an illegal sub 32-bit type. Do the operations on a 32-bit value, and then use a truncating store. llvm-svn: 206864	2014-04-22 04:11:14 +00:00
Matt Arsenault	5dbd5db518	R600: Make sign_extend_inreg legal. Don't know why I didn't just do this in the first place. llvm-svn: 206862	2014-04-22 03:49:30 +00:00
Tom Stellard	aeeea8a864	R600: Add comment clariying use of sext for result of MUL_U24 llvm-svn: 206501	2014-04-17 21:00:13 +00:00
Matt Arsenault	4e46665a80	R600: Expand sign extension of vectors. Setting vector types to expand will result in scalarization on pre SI hw, as those gpus don't have vector shifts either. Expand also i32 vectors, this helps llvm make the correct decision about scalarizing the vector ops. v2: move setOperation() calls to R600ISelLowering.cpp. cleanup the SI code to make it obvious that this patch does is nop for SI Patch by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 206348	2014-04-16 01:41:30 +00:00
Matt Arsenault	470acd81a8	R600/SI: Fix loads of i1 llvm-svn: 206330	2014-04-15 22:28:39 +00:00
Nick Lewycky	aad475b324	Break PseudoSourceValue out of the Value hierarchy. It is now the root of its own tree containing FixedStackPseudoSourceValue (which you can use isa/dyn_cast on) and MipsCallEntry (which you can't). Anything that needs to use either a PseudoSourceValue* and Value* is strongly encouraged to use a MachinePointerInfo instead. llvm-svn: 206255	2014-04-15 07:22:52 +00:00
Matt Arsenault	9ec3cf2c8a	Move ExtractVectorElements to SelectionDAG. This seems generally useful, and makes sense to go along with SplitVector. llvm-svn: 206041	2014-04-11 17:47:30 +00:00

1 2 3 4 5

230 Commits