llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Arsenault	46645fa102	R600/SI: Implement getOptimalMemOpType The default guess uses i32. This needs an address space argument to really do the right thing in all cases. llvm-svn: 214104	2014-07-28 17:49:26 +00:00
Matt Arsenault	86033cae84	R600/SI: Make argument loads invariant llvm-svn: 214101	2014-07-28 17:31:39 +00:00
Matt Arsenault	6f2a526101	Add alignment value to allowsUnalignedMemoryAccess Rename to allowsMisalignedMemoryAccess. On R600, 8 and 16 byte accesses are mostly OK with 4-byte alignment, and don't need to be split into multiple accesses. Vector loads with an alignment of the element type are not uncommon in OpenCL code. llvm-svn: 214055	2014-07-27 17:46:40 +00:00
Matt Arsenault	a5789bb4e1	R600: Move intrinsic lowering to separate functions llvm-svn: 214023	2014-07-26 06:23:37 +00:00
Matt Arsenault	83e60581c3	R600: Add new functions for splitting vector loads and stores. These will be used in future patches and shouldn't change anything yet. llvm-svn: 213877	2014-07-24 17:10:35 +00:00
Tom Stellard	e812f2fdd8	R600/SI: Clean up some of the unused REGISTER_{LOAD,STORE} code There are a few more cleanups to do, but I ran into some problems with ext loads and trunc stores, when I tried to change some of the vector loads and stores from custom to legal, so I wasn't able to get rid of everything. llvm-svn: 213552	2014-07-21 15:45:06 +00:00
Tom Stellard	b02094e115	R600/SI: Use scratch memory for large private arrays llvm-svn: 213551	2014-07-21 15:45:01 +00:00
Tom Stellard	067c81567b	R600/SI: Store constant initializer data in constant memory This implements a solution for constant initializers suggested by Vadim Girlin, where we store the data after the shader code and then use the S_GETPC instruction to compute its address. This saves use the trouble of creating a new buffer for constant data and then having to pass the pointer to the kernel via user SGPRs or the input buffer. llvm-svn: 213530	2014-07-21 14:01:14 +00:00
NAKAMURA Takumi	45e0a83141	SIISelLowering.cpp: Define _USE_MATH_DEFINES to let M_PI provided on MS <cmath>. FIXME: Would it be better to move it into configure? llvm-svn: 213477	2014-07-20 11:15:07 +00:00
Matt Arsenault	e261b6e853	R600/SI: Remove dead code and add missing tests. This probably was killed by some generic DAGCombiner improvements in checking the TargetBooleanContents instead of just 1. llvm-svn: 213471	2014-07-20 06:11:02 +00:00
Matt Arsenault	ad14ce84b7	R600/SI: implement range reduction for sin/cos These instructions can only take a limited input range, and return the constant value 1 out of range. We should do range reduction to be able to process arbitrary values. Use a FRACT instruction after normalization to achieve this. Also add a test for constant folding with the lowered code with unsafe-fp-math enabled. v2: use DAG lowering instead of intrinsic, adapt test v3: calculate constant, fold pattern into instruction definition v4: misc style fixes, add sin-fold testcase, cosmetics Patch by Grigori Goronzy llvm-svn: 213458	2014-07-19 18:44:39 +00:00
Matt Arsenault	22ca3f8860	R600/SI: Allow using f32 rcp / rsq when denormals not handled. These are precise enough to use for OpenCL unless denormals are handled. llvm-svn: 213107	2014-07-15 23:50:10 +00:00
Matt Arsenault	0d89e849bd	R600/SI: Fix select on i1 llvm-svn: 213096	2014-07-15 21:44:37 +00:00
Matt Arsenault	e9fa3b8e6b	R600/SI: Implement less wrong f32 fdiv Assuming single precision denormals and accurate sqrt/div are not reported, this passes the OpenCL conformance test. llvm-svn: 213089	2014-07-15 20:18:31 +00:00
Matt Arsenault	762af96f46	R600: Make ShaderType private llvm-svn: 212896	2014-07-13 03:06:39 +00:00
Jan Vesely	2cb62ce2a0	R600: Implement float to long/ulong Use alg. from LegalizeDAG.cpp Move Expand setting to SIISellowering v2: Extend existing tests instead of creating new ones v3: use separate LowerFPTOSINT function v4: use TargetLowering::expandFP_TO_SINT add comment about using FP_TO_SINT for uints Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 212773	2014-07-10 22:40:21 +00:00
Matt Arsenault	d2c9e08b63	R600: Fix mishandling of load / store chains. Fixes various bugs with reordering loads and stores. Scalarized vector loads weren't collecting the chains at all. llvm-svn: 212473	2014-07-07 18:34:45 +00:00
Chandler Carruth	9d010fffe1	[codegen,aarch64] Add a target hook to the code generator to control vector type legalization strategies in a more fine grained manner, and change the legalization of several v1iN types and v1f32 to be widening rather than scalarization on AArch64. This fixes an assertion failure caused by scalarizing nodes like "v1i32 trunc v1i64". As v1i64 is legal it will fail to scalarize v1i32. This also provides a foundation for other targets to have more granular control over how vector types are legalized. Patch by Hao Liu, reviewed by Tim Northover. I'm committing it to allow some work to start taking place on top of this patch as it adds some really important hooks to the backend that I'd like to immediately start using. =] http://reviews.llvm.org/D4322 llvm-svn: 212242	2014-07-03 00:23:43 +00:00
Tom Stellard	10ae6a0e6a	R600: Promote i64 loads to v2i32 llvm-svn: 212216	2014-07-02 20:53:54 +00:00
Tom Stellard	9b3816b5ee	R600: Promote i64 stores to v2i32 Now we need only one 64-bit pattern for stores. llvm-svn: 211643	2014-06-24 23:33:04 +00:00
Matt Arsenault	e54e1c3a21	R600: Move more out of AMDILISelLowering llvm-svn: 211516	2014-06-23 18:00:44 +00:00
Matt Arsenault	b8b5153935	R600/SI: Handle i64 sub. We can handle it the same way as add llvm-svn: 211514	2014-06-23 18:00:38 +00:00
Matt Arsenault	c791f39912	R600: Rename AMDIL file llvm-svn: 211512	2014-06-23 18:00:31 +00:00
Matt Arsenault	dbc9aae1fb	R600/SI: Prettier operand printing for 64-bit ops. Copy what is done for 32-bit already so the order is about the same. llvm-svn: 211186	2014-06-18 17:13:51 +00:00
Matt Arsenault	7aeb813b2a	R600/SI: Temporary fix for f64 fneg This should be a source modifier, but this unblocks most of my math patches. llvm-svn: 211181	2014-06-18 17:05:22 +00:00
Matt Arsenault	364a6747aa	R600/SI: Use v_cvt_f32_ubyte* instructions This eliminates extra extract instructions when loading an i8 vector to a float vector. llvm-svn: 210666	2014-06-11 17:50:44 +00:00
Matt Arsenault	6042506b5c	R600: Use BCNT_INT for evergreen llvm-svn: 210569	2014-06-10 19:18:28 +00:00
Matt Arsenault	8333e4378e	R600/SI: Implement i64 ctpop llvm-svn: 210568	2014-06-10 19:18:24 +00:00
Matt Arsenault	b5b5110b5c	R600/SI: Use bcnt instruction for ctpop llvm-svn: 210567	2014-06-10 19:18:21 +00:00
Matt Arsenault	b2cbf799d1	R600/SI: Handle sign_extend and zero_extend to i64 with patterns. llvm-svn: 210563	2014-06-10 18:54:59 +00:00
Tom Stellard	3ca1bfc728	SelectionDAG: Expand SELECT_CC to SELECT + SETCC This consolidates code from the Hexagon, R600, and XCore targets. No functionality change intended. llvm-svn: 210539	2014-06-10 16:01:22 +00:00
Matt Arsenault	c6f338dd5e	Use nullptr llvm-svn: 210222	2014-06-05 00:01:12 +00:00
Matt Arsenault	08d84943af	Fix typos llvm-svn: 210135	2014-06-03 23:06:13 +00:00
Matt Arsenault	5565f65e13	R600: Add dag combine for BFE llvm-svn: 209461	2014-05-22 18:09:07 +00:00
Tom Stellard	f719ee9e76	R600/SI: Promote f32 SELECT to i32 llvm-svn: 209024	2014-05-16 20:56:41 +00:00
Matt Arsenault	d504a74e3c	Use range for llvm-svn: 208922	2014-05-15 21:44:05 +00:00
Tom Stellard	436780bebb	R600/SI: Stop using VSrc_* as the default register class for types. We now use SReg_* for integer types and VReg_* for floating-point types. This should help simplify the SIFixSGPRCopies pass and no longer causes ISel to insert a COPY after termiator instuctions that output a value. This change is covered by exisitng tests. llvm-svn: 208888	2014-05-15 14:41:57 +00:00
Vincent Lejeune	29c0c210fc	R600/SI: Fold fabs/fneg into src input modifier llvm-svn: 208480	2014-05-10 19:18:39 +00:00
Vincent Lejeune	94af31fbe8	R600/SI: Prettier display of input modifiers llvm-svn: 208479	2014-05-10 19:18:33 +00:00
Vincent Lejeune	79a5834647	R600/SI: Use pseudo instruction for fabs/clamp/fneg llvm-svn: 208478	2014-05-10 19:18:25 +00:00
Tom Stellard	afa8b532b1	R600: Move MIN/MAX matching from LowerOperation() to PerformDAGCombine() llvm-svn: 208429	2014-05-09 16:42:16 +00:00
Tom Stellard	1bd80725b3	R600/SI: Use VALU instructions for copying i1 values We can't use SALU instructions for this since they ignore the EXEC mask and are always executed. This fixes several OpenCV tests. llvm-svn: 207661	2014-04-30 15:31:33 +00:00
Tom Stellard	919bb6b83f	R600/SI: Custom lower SI_IF and SI_ELSE to avoid machine verifier errors SI_IF and SI_ELSE are terminators which also produce a value. For these instructions ISel always inserts a COPY to move their value to another basic block. This COPY ends up between SI_(IF\|ELSE) and the S_BRANCH* instruction at the end of the block. This breaks MachineBasicBlock::getFirstTerminator() and also the machine verifier which assumes that terminators are grouped together at the end of blocks. To solve this we coalesce the copy away right after ISel to make sure there are no instructions in between terminators at the end of blocks. llvm-svn: 207591	2014-04-29 23:12:53 +00:00
Tom Stellard	5f3378879f	R600: Change UDIV/UREM to UDIVREM when legalizing types When legalizing ops, with UDIV/UREM set to expand, they automatically expand to UDIVREM (if legal or custom). We need to do this manually for legalize types. v2: SI should be set to Expand because the type is legal, and it is automatically lowered to UDIVREM if UDIVREM is Legal/Custom R600 should set to UDIV/UREM to Custom because it needs to lower them during type legalization Patch by: Jan Vesely Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 207587	2014-04-29 23:12:43 +00:00
Craig Topper	8c0b4d0791	Convert more SelectionDAG functions to use ArrayRef. llvm-svn: 207397	2014-04-28 05:57:50 +00:00
Craig Topper	131de82adb	Convert SelectionDAG::MorphNodeTo to use ArrayRef. llvm-svn: 207378	2014-04-27 19:21:16 +00:00
Craig Topper	64941d9786	Convert SelectionDAG::getMergeValues to use ArrayRef. llvm-svn: 207374	2014-04-27 19:20:57 +00:00
Craig Topper	206fcd450a	Convert getMemIntrinsicNode to take ArrayRef of SDValue instead of pointer and size. llvm-svn: 207329	2014-04-26 19:29:41 +00:00
Craig Topper	48d114bed1	Convert SelectionDAG::getNode methods to use ArrayRef<SDValue>. llvm-svn: 207327	2014-04-26 18:35:24 +00:00
Craig Topper	062a2baef0	[C++] Use 'nullptr'. Target edition. llvm-svn: 207197	2014-04-25 05:30:21 +00:00
Matt Arsenault	1018c897f6	R600/SI: Use address space in allowsUnalignedMemoryAccesses llvm-svn: 207126	2014-04-24 17:08:26 +00:00
Matt Arsenault	5dbd5db518	R600: Make sign_extend_inreg legal. Don't know why I didn't just do this in the first place. llvm-svn: 206862	2014-04-22 03:49:30 +00:00
Matt Arsenault	27cc958dff	R600/SI: Match sign_extend_inreg to s_sext_i32_i8 and s_sext_i32_i16 llvm-svn: 206547	2014-04-18 01:53:18 +00:00
Tom Stellard	868fd92e54	R600/SI: Stop using i128 as the resource descriptor type Having i128 as a legal type complicates the legalization phase. v4i32 is already a legal type, so we will use that instead. This fixes several piglit tests. llvm-svn: 206500	2014-04-17 21:00:11 +00:00
Tom Stellard	334b29c7f6	R600/SI: Change default register class for i32 to SReg_32 SIFixSGPRCopies is smart enough to handle this now. llvm-svn: 206499	2014-04-17 21:00:09 +00:00
Matt Arsenault	a90d22fad5	R600/SI: f64 frint is legal on CI llvm-svn: 206475	2014-04-17 17:06:37 +00:00
Matt Arsenault	51df0c1965	R600/SI: Fix zext from i1 to i64 llvm-svn: 206437	2014-04-17 02:03:08 +00:00
Craig Topper	abb4ac7f87	Convert SelectionDAG::getVTList to use ArrayRef llvm-svn: 206357	2014-04-16 06:10:51 +00:00
Matt Arsenault	4e46665a80	R600: Expand sign extension of vectors. Setting vector types to expand will result in scalarization on pre SI hw, as those gpus don't have vector shifts either. Expand also i32 vectors, this helps llvm make the correct decision about scalarizing the vector ops. v2: move setOperation() calls to R600ISelLowering.cpp. cleanup the SI code to make it obvious that this patch does is nop for SI Patch by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 206348	2014-04-16 01:41:30 +00:00
Matt Arsenault	470acd81a8	R600/SI: Fix loads of i1 llvm-svn: 206330	2014-04-15 22:28:39 +00:00
Matt Arsenault	e1f030ca66	R600: Check if a sextload should be used for parameter loads. Through some oddity where truncate (sextload x) isn't folded into an anyextload for vectors, the sextload remains if the vector isn't immediately scalarized. This keeps the expected zextload instructions in the kernel-args test when small type vectors aren't scalarized. llvm-svn: 206070	2014-04-11 20:59:54 +00:00
Tom Stellard	50122a5890	R600: Match 24-bit arithmetic patterns in a Target DAGCombine Moving these patterns from TableGen files to PerformDAGCombine() should allow us to generate better code by eliminating unnecessary shifts and extensions earlier. This also fixes a bug where the MAD pattern was calling SimplifyDemandedBits with a 24-bit mask on the first operand even when the full pattern wasn't being matched. This occasionally resulted in some instructions being incorrectly deleted from the program. v2: - Fix bug with 64-bit mul llvm-svn: 205731	2014-04-07 19:45:41 +00:00
Matt Arsenault	4be76e99fe	Use std::swap llvm-svn: 205723	2014-04-07 16:44:26 +00:00
Tom Stellard	7ed0b5235a	R600/SI: Lower 64-bit immediates using REG_SEQUENCE llvm-svn: 205561	2014-04-03 20:19:27 +00:00
Matt Arsenault	f751d6272d	Change shouldSplitVectorElementType to better match the description. Pass the entire vector type, and not just the element. llvm-svn: 205247	2014-03-31 20:54:58 +00:00
Matt Arsenault	d7bdcc46a6	R600/SI: Implement shouldConvertConstantLoadToIntImm llvm-svn: 205244	2014-03-31 19:54:27 +00:00
Tom Stellard	7ea3d6d420	R600/SI: Lower i64 SELECT by bitcasting to a vector type This allows allows us to replace ISD::EXTRACT_ELEMENT, which is lowered using shifts, with ISD::EXTRACT_VECTOR_ELT, which is a no-op. llvm-svn: 205187	2014-03-31 14:01:55 +00:00
Matt Arsenault	ad41d7b531	R600/SI: Fix 64-bit private loads. llvm-svn: 204630	2014-03-24 17:50:46 +00:00
Tom Stellard	da99c6eff5	R600/SI: Promote fp64 SELECT to i64 This type promotion is replacing a Tablegen pattern and it is already covered by existing tests. llvm-svn: 204617	2014-03-24 16:07:30 +00:00
Tom Stellard	1583409e33	R600/SI: Handle MUBUF instructions in SIInstrInfo::moveToVALU() llvm-svn: 204476	2014-03-21 15:51:57 +00:00
Tom Stellard	def38c567d	R600/SI: Use SGPR_(32\|64) reg clases when lowering SI_ADDR64_RSRC The SReg_(32\|64) register classes contain special registers in addition to the numbered SGPRs. This can lead to machine verifier errors when these register classes are used as sub-registers for SReg_128, since SReg_128 only uses the numbered SGPRs. Replacing SReg_(32\|64) with SGPR_(32\|64) fixes this problem, since the SGPR_(32\|64) register classes contain only numbered SGPRs. Tests cases for this are comming in a later commit. llvm-svn: 204474	2014-03-21 15:51:53 +00:00
Tom Stellard	1c8788ef5a	R600/SI: Custom lower i1 stores These are sometimes created by the shrink to boolean optimization in the globalopt pass. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 203280	2014-03-07 20:12:33 +00:00
Matt Arsenault	f9a995d68c	R600: Fix extloads from i8 / i16 to i64. This appears to only be working for global loads. Private and local break for other reasons. llvm-svn: 203135	2014-03-06 17:34:12 +00:00
Tom Stellard	d61a1c3360	R600/SI: Expand all v16[if]32 operations llvm-svn: 202543	2014-02-28 21:36:37 +00:00
Tom Stellard	1f15bff0df	R600/SI: Custom select 64-bit ADD llvm-svn: 202194	2014-02-25 21:36:18 +00:00
Matt Arsenault	a81aee8277	Fix unused variable llvm-svn: 202080	2014-02-24 21:16:50 +00:00
Matt Arsenault	41e2f2bacd	R600/SI - Add new CI arithmetic instructions. Does not yet include larger part required to match v_mad_i64_i32 / v_mad_u64_u32. llvm-svn: 202077	2014-02-24 21:01:28 +00:00
Tom Stellard	967bf5813f	R600/SI: Expand all v8[if]32 operations llvm-svn: 201371	2014-02-13 23:34:15 +00:00
Tom Stellard	80be9650e3	R600/SI: Split global vector loads with more than 4 elements llvm-svn: 201368	2014-02-13 23:34:10 +00:00
Matt Arsenault	25793a3f22	Add address space argument to allowsUnalignedMemoryAccess. On R600, some address spaces have more strict alignment requirements than others. llvm-svn: 200887	2014-02-05 23:15:53 +00:00
Tom Stellard	0ec134f3d6	R600/SI: Custom lower i64 ISD::SELECT llvm-svn: 200774	2014-02-04 17:18:40 +00:00
Alp Toker	cb40291100	Fix known typos Sweep the codebase for common typos. Includes some changes to visible function names that were misspelt. llvm-svn: 200018	2014-01-24 17:20:08 +00:00
Tom Stellard	04c0e9851b	R600: Add support for global addresses with constant initializers llvm-svn: 199825	2014-01-22 19:24:21 +00:00
Tom Stellard	e93736057f	R600/SI: Add support for i8 and i16 private loads/stores llvm-svn: 199823	2014-01-22 19:24:14 +00:00
Matt Arsenault	a98cd6a56e	R600/SI: Make private pointers be 32-bit. Different sized address spaces should theoretically work most of the time now, and since 64-bit add is currently disabled, using more 32-bit pointers fixes some cases. llvm-svn: 197659	2013-12-19 05:32:55 +00:00
Tom Stellard	c0845334da	R600/SI: Fixing handling of condition codes We were ignoring the ordered/onordered bits and also the signed/unsigned bits of condition codes when lowering the DAG to MachineInstrs. NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195514	2013-11-22 23:07:58 +00:00
Matt Arsenault	fb826fa6e1	R600/SI: Implement add i64, but do not yet enable. Test doesn't actually check the output. I need to fix add i64 being matched for the addressing calculations. llvm-svn: 195040	2013-11-18 20:09:47 +00:00
Matt Arsenault	e8d214662a	R600/SI: addc / adde i32 are legal llvm-svn: 195038	2013-11-18 20:09:40 +00:00
Tom Stellard	81d871dee3	R600/SI: Add support for private address space load/store Private address space is emulated using the register file with MOVRELS and MOVRELD instructions. llvm-svn: 194626	2013-11-13 23:36:50 +00:00
Tom Stellard	03a5c08de6	R600/SI: Replace ffs(x) - 1 with countTrailingZeros(x) ffs(x) broke the mingw buildbot. llvm-svn: 193225	2013-10-23 03:50:25 +00:00
Tom Stellard	54774e5681	R600/SI: fix MIMG writemask adjustement This fixes piglit: - shaders/glsl-fs-texture2d-masked - shaders/glsl-fs-texture2d-masked-4 Patch by: Marek Olšák Signed-off-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 193222	2013-10-23 02:53:47 +00:00
Tom Stellard	af77543244	R600: Fix handling of vector kernel arguments The SelectionDAGBuilder was promoting vector kernel arguments to legal types, but this won't work for R600 and SI since kernel arguments are stored in memory and can't be promoted. In order to handle vector arguments correctly we need to look at the original types from the LLVM IR function. llvm-svn: 193215	2013-10-23 00:44:32 +00:00
Vincent Lejeune	5d6c2c318b	R600/SI: Remove some leftover MI dump call llvm-svn: 192743	2013-10-15 22:48:51 +00:00
Vincent Lejeune	d623644d17	R600/SI: Support byval arguments llvm-svn: 192555	2013-10-13 17:56:16 +00:00
Matt Arsenault	1408b60291	Fix typo llvm-svn: 192406	2013-10-10 23:05:37 +00:00
Tom Stellard	682bfbc43d	R600/SI: Define a separate MIMG instruction for each possible output value type During instruction selection, we rewrite the destination register class for MIMG instructions based on their writemasks. This creates machine verifier errors since the new register class does not match the register class in the MIMG instruction definition. We can avoid this by defining different MIMG instructions for each possible destination type and then switching to the correct instruction when we change the register class. llvm-svn: 192365	2013-10-10 17:11:24 +00:00
Tom Stellard	afcf12f33a	R600/SI: expose TBUFFER_STORE_FORMAT_* for OpenGL transform feedback For _XYZ, the type of VDATA is v4i32, because v3i32 doesn't exist. The ADDR64 bit is not exposed. A simpler intrinsic that doesn't take a resource descriptor might be nicer. The maximum number of input SGPRs is bumped to 17. Signed-off-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 190575	2013-09-12 02:55:14 +00:00
Matt Arsenault	6f24379974	R600: Fix i64 to i32 trunc on SI llvm-svn: 190091	2013-09-05 19:41:10 +00:00
Tom Stellard	35bb18c2a7	R600: Add support for vector local memory loads llvm-svn: 189226	2013-08-26 15:06:04 +00:00
Tom Stellard	fd155828ed	SelectionDAG: Use correct pointer size when lowering function arguments v2 This adds minimal support to the SelectionDAG for handling address spaces with different pointer sizes. The SelectionDAG should now correctly lower pointer function arguments to the correct size as well as generate the correct code when lowering getelementptr. This patch also updates the R600 DataLayout to use 32-bit pointers for the local address space. v2: - Add more helper functions to TargetLoweringBase - Use CHECK-LABEL for tests llvm-svn: 189221	2013-08-26 15:05:36 +00:00
Benjamin Kramer	a8eecee121	R600: Allocate memoperand in the MachienFunction so it doesn't leak. llvm-svn: 188555	2013-08-16 14:48:09 +00:00
Tom Stellard	d86003e31f	R600/SI: Improve legalization of vector operations This should fix hangs in the OpenCL piglit tests. llvm-svn: 188431	2013-08-14 23:25:00 +00:00
Tom Stellard	6785065ace	R600/SI: Replace v1i32 type with i32 in imageload and sample intrinsics llvm-svn: 188430	2013-08-14 23:24:53 +00:00
Tom Stellard	9fa1791a1b	R600/SI: Convert v16i8 resource descriptors to i128 Now that compute support is better on SI, we can't continue using v16i8 for descriptors since this is also a legal type in OpenCL. This patch fixes numerous hangs with the piglit OpenCL test and since we now use a target specific DAG node for LOAD_CONSTANT with the correct MemOperandFlags, this should also fix: https://bugs.freedesktop.org/show_bug.cgi?id=66805 llvm-svn: 188429	2013-08-14 23:24:45 +00:00
Tom Stellard	16a9a205c8	R600/SI: Assign a register class to the $vaddr operand for MIMG instructions The previous code declared the operand as unknown:$vaddr, which made it possible for scalar registers to be used instead of vector registers. llvm-svn: 188425	2013-08-14 23:24:17 +00:00
Niels Ole Salscheider	d3a039fed2	R600/SI: FMA is faster than fmul and fadd for f64 llvm-svn: 188136	2013-08-10 10:38:54 +00:00
Niels Ole Salscheider	719fbc9ae7	R600/SI: Implement fp32<->fp64 conversions llvm-svn: 187988	2013-08-08 16:06:15 +00:00
Tom Stellard	2f7cdda57e	R600/SI: Use VSrc_* register classes as the default classes for types Since the VSrc_* register classes contain both VGPRs and SGPRs, copies that used be emitted by isel like this: SGPR = COPY VGPR Will now be emitted like this: VSrC = COPY VGPR This patch also adds a pass that tries to identify and fix situations where a VGPR to SGPR copy may occur. Hopefully, these changes will make it impossible for the compiler to generate illegal VGPR to SGPR copies. llvm-svn: 187831	2013-08-06 23:08:28 +00:00
Tom Stellard	4c0ffccbbf	R600/SI: Add more special cases for opcodes to ensureSRegLimit() Also factor out the register class lookup to its own function. llvm-svn: 187830	2013-08-06 23:08:18 +00:00
Tom Stellard	98f675a994	R600/SI: Custom lower i64 ZERO_EXTEND llvm-svn: 187580	2013-08-01 15:23:26 +00:00
Tom Stellard	9f95033d33	R600: Improve support for < 32-bit loads Reviewed-by: Vincent Lejeune <vljn at ovi.com> llvm-svn: 186921	2013-07-23 01:48:35 +00:00
Tom Stellard	8374720aad	R600/SI: Fix crash with VSELECT https://bugs.freedesktop.org/show_bug.cgi?id=66175 llvm-svn: 186616	2013-07-18 21:43:53 +00:00
Tom Stellard	31209cc8eb	R600/SI: Add support for 64-bit loads https://bugs.freedesktop.org/show_bug.cgi?id=65873 llvm-svn: 186339	2013-07-15 19:00:09 +00:00
Tom Stellard	2a6a610516	R600/SI: Add double precision fsub pattern for SI Patch by: Niels Ole Salscheider Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186179	2013-07-12 18:15:08 +00:00
Tom Stellard	7512c0803c	R600/SI: Add initial double precision support for SI Patch by: Niels Ole Salscheider Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186177	2013-07-12 18:14:56 +00:00
Michel Danzer	49812b5bbd	R600/SI: Initial local memory support Enough for the radeonsi driver to use it for calculating derivatives. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186012	2013-07-10 16:37:07 +00:00
Aaron Watry	0a794a4612	R600: Consolidate expansion of v2i32/v4i32 ops for EG/SI By default, we expand these operations for both EG and SI. Move the duplicated code into a common space for now. If the targets ever actually implement these operations as instructions, we can override that in the relevant target. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184848	2013-06-25 13:55:57 +00:00
Aaron Watry	daabb20e1b	R600/SI: Expand xor v2i32/v4i32 Add test cases for both vector sizes on SI and also add v2i32 test for EG. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184846	2013-06-25 13:55:52 +00:00
Aaron Watry	83fa6006bc	R600/SI: Expand urem of v2i32/v4i32 for SI Also add lit test for both cases on SI, and v2i32 for evergreen. Note: I followed the guidance of the v4i32 EG check... UREM produces really complex code, so let's just check that the instruction was lowered successfully. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184844	2013-06-25 13:55:46 +00:00
Aaron Watry	5527b6c6b6	R600/SI: Expand udiv v[24]i32 for SI and v2i32 for EG Also add lit test for both cases on SI, and v2i32 for evergreen. Note: I followed the guidance of the v4i32 EG check... UDIV produces really complex code, so let's just check that the instruction was lowered successfully. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184843	2013-06-25 13:55:43 +00:00
Aaron Watry	16d80c0529	R600/SI: Expand ashr of v2i32/v4i32 for SI Also add lit test for both cases on SI, and v2i32 for evergreen. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184842	2013-06-25 13:55:40 +00:00
Aaron Watry	f63791e778	R600/SI: Expand srl of v2i32/v4i32 for SI Also add lit test for both cases on SI, and v2i32 for evergreen. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184841	2013-06-25 13:55:37 +00:00
Aaron Watry	5584553984	R600/SI: Expand shl of v2i32/v4i32 for SI Also add lit test for both cases on SI, and v2i32 for evergreen. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184840	2013-06-25 13:55:32 +00:00
Aaron Watry	2fa162e88e	R600/SI: Expand or of v2i32/v4i32 for SI Also add lit test for both cases on SI, and v2i32 for evergreen. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184839	2013-06-25 13:55:29 +00:00
Aaron Watry	265eef5efe	R600/SI: Expand mul of v2i32/v4i32 for SI Also add lit test for both cases on SI, and v2i32 for evergreen. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184838	2013-06-25 13:55:26 +00:00
Aaron Watry	00aeb119db	R600/SI: Expand and of v2i32/v4i32 for SI Also add lit test for both cases on SI, and v2i32 for evergreen. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184837	2013-06-25 13:55:23 +00:00
Tom Stellard	0125f2a6e4	R600/SI: Report unaligned memory accesses as legal for > 32-bit types In reality, some unaligned memory accesses are legal for 32-bit types and smaller too, but it all depends on the address space. Allowing unaligned loads/stores for > 32-bit types is mainly to prevent the legalizer from splitting one load into multiple loads of smaller types. https://bugs.freedesktop.org/show_bug.cgi?id=65873 llvm-svn: 184822	2013-06-25 02:39:35 +00:00
Tom Stellard	96d38760fc	R600/SI: Expand sub for v2i32 and v4i32 for SI Also add a v2i32 test to the existing v4i32 test. Patch by: Aaron Watry Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Aaron Watry<awatry@gmail.com> llvm-svn: 184482	2013-06-20 21:55:37 +00:00
Tom Stellard	043795e818	R600/SI: Expand add for v2i32 and v4i32 Also add SI tests to existing file and a v2i32 test for both R600 and SI. Patch by: Aaron Watry Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Aaron Watry <awatry@gmail.com> llvm-svn: 184481	2013-06-20 21:55:30 +00:00
Tom Stellard	a6c6e1bfc2	R600: Rework subtarget info and remove AMDILDevice classes This should simplify the subtarget definitions and make it easier to add new ones. Reviewed-by: Vincent Lejeune <vljn@ovi.com> llvm-svn: 183566	2013-06-07 20:37:48 +00:00
Bill Wendling	37e9adb091	Don't cache the instruction and register info from the TargetMachine, because the internals of TargetMachine could change. No functionality change intended. llvm-svn: 183561	2013-06-07 20:28:55 +00:00
Tom Stellard	acec99c948	R600: Replace predicate loop with predicate function llvm-svn: 183351	2013-06-05 23:39:50 +00:00
Tom Stellard	94593ee8c3	R600/SI: Add support for work item and work group intrinsics llvm-svn: 183138	2013-06-03 17:40:18 +00:00
Tom Stellard	ed882c2f1b	R600/SI: Add a calling convention for compute shaders llvm-svn: 183137	2013-06-03 17:40:11 +00:00
Tom Stellard	046039e81b	R600/SI: Custom lower i64 sign_extend llvm-svn: 183136	2013-06-03 17:40:03 +00:00
Tom Stellard	0518ff89ba	R600/SI: Adjust some instructions' out register class after ISel This is necessary to avoid generating VGPR to SGPR copies in some cases. llvm-svn: 183135	2013-06-03 17:39:58 +00:00
Tom Stellard	bad1f59212	R600/SI: Handle REG_SEQUENCE in fitsRegClass() llvm-svn: 183134	2013-06-03 17:39:54 +00:00
Tom Stellard	b5a97004fb	R600/SI: Handle nodes with glue results correctly SITargetLowering::foldOperands() llvm-svn: 183133	2013-06-03 17:39:50 +00:00
Tom Stellard	556d9aa841	R600/SI: Rework MUBUF store instructions The lowering of stores is now mostly handled in the tablegen files. No more BUFFER_STORE nodes I generated during legalization. llvm-svn: 183130	2013-06-03 17:39:37 +00:00
Andrew Trick	ef9de2a739	Track IR ordering of SelectionDAG nodes 2/4. Change SelectionDAG::getXXXNode() interfaces as well as call sites of these functions to pass in SDLoc instead of DebugLoc. llvm-svn: 182703	2013-05-25 02:42:55 +00:00
Benjamin Kramer	d78bb468bd	Move passes from namespace llvm into anonymous namespaces. Sort includes while there. llvm-svn: 182594	2013-05-23 17:10:37 +00:00
Benjamin Kramer	635e368e33	R600: Hide symbols of implementation details. Also removes an unused function. llvm-svn: 182587	2013-05-23 15:43:05 +00:00
Rafael Espindola	21ea01d132	Attempt to fix the mingw32 bot. This should hopefully fix http://lab.llvm.org:8011/builders/clang-x86_64-darwin11-self-mingw32 llvm-svn: 182446	2013-05-22 02:30:47 +00:00
Tom Stellard	b35efba4d9	R600/SI: Make fitsRegClass() operands const Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 182282	2013-05-20 15:02:01 +00:00
Matt Arsenault	75865923c9	Add LLVMContext argument to getSetCCResultType llvm-svn: 182180	2013-05-18 00:21:46 +00:00
Christian Konig	b7be72df5b	R600/SI: return undef instead of null for skipped arguments This is a candidate for the stable branch. Fixes: https://bugs.freedesktop.org/show_bug.cgi?id=64694 Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 182084	2013-05-17 09:46:48 +00:00
Tom Stellard	e363dbf7eb	R600/SI: Handle arbitrary destination type in SITargetLowering::adjustWritemask Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 181268	2013-05-06 23:02:15 +00:00
Michael Liao	b53d8963ce	ArrayRefize getMachineNode(). No functionality change. llvm-svn: 179901	2013-04-19 22:22:57 +00:00
Christian Konig	8b1ed28ef1	R600/SI: dynamical figure out the reg class of MIMG Depending on the number of bits set in the writemask. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 179166	2013-04-10 08:39:16 +00:00
Christian Konig	8e06e2a8c4	R600/SI: adjust writemask to only the used components Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 179165	2013-04-10 08:39:08 +00:00

1 2 3 4 5 ...

282 Commits