llvm-project

Commit Graph

Author	SHA1	Message	Date
Qiu Chaofan	7315d221a2	[PowerPC] Exploit vnmsubfp instruction On PowerPC, we have vnmsubfp Altivec instruction for fnmsub operation on v4f32 type. Default pattern for this instruction never works since we don't have legal fneg for v4f32 when VSX disabled. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D80617	2020-06-14 23:19:17 +08:00
Masoud Ataei	2d038370bb	DAGCombiner optimization for pow(x,0.75) and pow(x,0.25) on double and single precision even in case massv function is asked Here, I am proposing to add an special case for massv powf4/powd2 function (SIMD counterpart of powf/pow function in MASSV library) in MASSV pass to get later optimizations like conversion from pow(x,0.75) and pow(x,0.25) for double and single precision to sequence of sqrt's in the DAGCombiner in vector float case. My reason for doing this is: the optimized pow(x,0.75) and pow(x,0.25) for double and single precision to sequence of sqrt's is faster than powf4/powd2 on P8 and P9. In case MASSV functions is called, and if the exponent of pow is 0.75 or 0.25, we will get the sequence of sqrt's and if exponent is not 0.75 or 0.25 we will get the appropriate MASSV function. Reviewed By: steven.zhang Tags: #LLVM #PowerPC Differential Revision: https://reviews.llvm.org/D80744	2020-06-12 10:02:16 -04:00
Chen Zheng	9b6e86a1a5	[PowerPC] refactor convertToImmediateForm - NFC This is a NFC patch to make convertToImmediateForm a light wrapper for converting xform and imm form instructions on PowerPC. Reviewed By: Steven.zhang Differential Revision: https://reviews.llvm.org/D80907	2020-06-12 03:57:54 -04:00
diggerlin	c6be3ea524	[NFC] clean up the AsmPrinter::emitLinkage for AIX part SUMMARY: Since we deal with aix emitLinkage in the PPCAIXAsmPrinter::emitLinkage() in the patch https://reviews.llvm.org/D75866. It do not go to AsmPrinter::emitLinkage() any more, we clean up some aix related code in the AsmPrinter::emitLinkage() Reviewers: Jason liu Differential Revision: https://reviews.llvm.org/D81613	2020-06-11 13:33:51 -04:00
Sam Parker	fa8bff0cd1	[CostModel] Unify getArithmeticInstrCost Add the remaining arithmetic opcodes into the generic implementation of getUserCost and then call this from getInstructionThroughput. Most of the backends have been modified to return the base implementation for cost kinds other RecipThroughput. The outlier here is AMDGPU which already uses getArithmeticInstrCost for all the cost kinds. This change means that most of the opcodes can be removed from that backends implementation of getUserCost. Differential Revision: https://reviews.llvm.org/D80992	2020-06-10 09:08:45 +01:00
diggerlin	edd819c757	[AIX] supporting the visibility attribute for aix assembly SUMMARY: in the aix assembly , it do not have .hidden and .protected directive. in current llvm. if a function or a variable which has visibility attribute, it will generate something like the .hidden or .protected , it can not recognize by aix as. in aix assembly, the visibility attribute are support in the pseudo-op like .extern Name [ , Visibility ] .globl Name [, Visibility ] .weak Name [, Visibility ] in this patch, we implement the visibility attribute for the global variable, function or extern function . for example. extern __attribute__ ((visibility ("hidden"))) int bar(int* ip); __attribute__ ((visibility ("hidden"))) int b = 0; __attribute__ ((visibility ("hidden"))) int foo(int* ip){ return (*ip)++; } the visibility of .comm linkage do not support , we will have a separate patch for it. we have the unsupported cases ("default" and "internal") , we will implement them in a a separate patch for it. Reviewers: Jason Liu ,hubert.reinterpretcast,James Henderson Differential Revision: https://reviews.llvm.org/D75866	2020-06-09 16:15:06 -04:00
Sam Parker	37289615c0	[NFCI][CostModel] Unify getCmpSelInstrCost Add cases for icmp, fcmp and select into the switch statement of the generic getUserCost implementation with getInstructionThroughput then calling into it. The BasicTTI and backend implementations have be set to return a default value (1) when a cost other than throughput is being queried. Differential Revision: https://reviews.llvm.org/D80550	2020-06-09 07:41:22 +01:00
Kang Zhang	e3546c78ca	[NFC][PowerPC] Remove the redundant InstAlias for OR instruction Summary: We have handle the InstAlias for OR instructions, but we handle it agagin in PPCInstPrinter.cpp. This patch is to Remove the redundant InstAlias for OR instruction. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D80502	2020-06-09 03:32:27 +00:00
Chen Zheng	8aa52b19a7	[APInt] set all bits for getBitsSetWithWrap if loBit == hiBit differentiate getBitsSetWithWrap & getBitsSet when loBit == hiBit getBitsSetWithWrap sets all bits; getBitsSet does nothing. Reviewed By: lkail, RKSimon, lebedev.ri Differential Revision: https://reviews.llvm.org/D81325	2020-06-08 22:55:24 -04:00
Anil Mahmud	246d106094	[PowerPC] Fix pattern for DCBFL/DCBFLP instrinsics. The previous implementation used "asm parser only" pseudo-instructions in their output patterns. Those are not meant to emit code and will caused crashes when built with -filetype=obj. Differential Revision: https://reviews.llvm.org/D80151	2020-06-08 20:54:59 -05:00
Anil Mahmud	c9790d54f8	[PowerPC] Remove extra instruction left by emitRLDICWhenLoweringJumpTables The function emitRLDICWhenLoweringJumpTables in PPCMIPeephole.cpp was supposed to convert a pair of RLDICL and RLDICR to a single RLDIC, but it was leaving out the RLDICL instruction. This PR fixes the bug. Differential Revision: https://reviews.llvm.org/D78063	2020-06-08 20:43:56 -05:00
Stefan Pintilie	b4036329f1	[PowerPC] Fix incorrect PC Relative relocations for Big Endian Fix the incorrect PC Relative relocations for Big Endian for 34 bit offsets. The offset should be zero for both BE and LE in this situation. Differential Revision: https://reviews.llvm.org/D81033	2020-06-08 20:29:43 -05:00
Sam Parker	772349de88	[PPC] Try to fix builbots Attempt to handle unsupported types, such as structs, in getMemoryOpCost. The backend now checks for a supported type and calls into BasicTTI as a fallback. BasicTTI will now also perform the same check and will default to an expensive cost of 4 for 'Other' MVTs. Differential Revision: https://reviews.llvm.org/D80984	2020-06-08 09:13:37 +01:00
Guillaume Chatelet	1778564f91	[Alignment][NFC] Migrate the rest of backends Summary: This is a followup on D81196 Reviewers: courbet Subscribers: arsenm, dschuff, jyknight, dylanmckay, sdardis, nemanjai, jvesely, nhaehnle, sbc100, jgravelle-google, hiraditya, aheejin, kbarton, fedor.sergeev, asb, rbar, johnrusso, simoncook, sabuasal, niosHD, jrtc27, MaskRay, zzheng, edward-jones, atanasyan, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX, jocewei, Jim, lenary, s.egerton, pzheng, sameer.abuasal, apazos, luismarques, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81278	2020-06-08 07:17:20 +00:00
QingShan Zhang	3f0cc7ac5e	[NFC] Remove the extra ; to avoid the warning of build compiler	2020-06-08 03:51:05 +00:00
Nemanja Ivanovic	a56d057dfe	[PowerPC] Do not assume operand of ADDI is an immediate After pseudo-expansion, we may end up with ADDI (add immediate) instructions where the operand is not an immediate but a relocation. For such instructions, attempts to get the immediate result in assertion failures for obvious reasons. Fixes: https://bugs.llvm.org/show_bug.cgi?id=45432	2020-06-07 22:18:31 -05:00
QingShan Zhang	f8eabd6d01	[Power9] Add addi post-ra scheduling heuristic The instruction addi is usually used to post increase the loop indvar, which looks like this: label_X: load x, base(i) ... y = op x ... i = addi i, 1 goto label_X However, for PowerPC, if there are too many vsx instructions that between y = op x and i = addi i, 1, it will use all the hw resource that block the execution of i = addi, i, 1, which result in the stall of the load instruction in next iteration. So, a heuristic is added to move the addi as early as possible to have the load hide the latency of vsx instructions, if other heuristic didn't apply to avoid the starve. Reviewed By: jji Differential Revision: https://reviews.llvm.org/D80269	2020-06-08 01:31:07 +00:00
Stefan Pintilie	8dbf5a9501	[PowerPC] Remove extra nop after notoc call Calls that are marked as @notoc do not require the extra nop after the call for the TOC restore. Differential Revision: https://reviews.llvm.org/D81081	2020-06-05 06:47:44 -05:00
Sam Parker	9303546b42	[CostModel] Unify getMemoryOpCost Use getMemoryOpCost from the generic implementation of getUserCost and have getInstructionThroughput return the result of that for loads and stores. This also means that the X86 implementation of getUserCost can be removed with the functionality folded into its getMemoryOpCost. Differential Revision: https://reviews.llvm.org/D80984	2020-06-05 10:13:38 +01:00
Qiu Chaofan	7a001a2d92	[PowerPC] Require nsz flag for c-ab to FNMSUB On PowerPC, FNMSUB (both VSX and non-VSX version) means -(ab-c). But the backend used to generate these instructions regardless whether nsz flag exists or not. If a*b-c==0, such transformation changes sign of zero. This patch introduces PPC specific FNMSUB ISD opcode, which may help improving combined FMA code sequence. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D76585	2020-06-04 16:41:27 +08:00
David Tenty	d20fdcabf8	[AIX] Update data directives for AIX assembly Summary: The standard data emission directives (e.g. .short, .long) in the AIX assembler have the unintended consequence of aligning their output to the natural byte boundary. This cause problems because we aren't expecting behavior from the DatabitsDirectives, so the final alignment of data isn't correct in some cases on AIX. This patch updated the DatabitsDirectives to use .vbyte pseudo-ops instead to emit the data, since we will emit the .align directives as needed. We update the existing testcases and add a test for emission of struct data. Reviewers: hubert.reinterpretcast, Xiangling_L, jasonliu Reviewed By: hubert.reinterpretcast, jasonliu Subscribers: wuzish, nemanjai, hiraditya, kbarton, arphaman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80934	2020-06-03 10:55:59 -04:00
QingShan Zhang	a462561cee	[NFC][PowerPC] Remove unused node PPCISD::VMADDFP and PPCISD::VNMSUBFP These two nodes were added by `69caef2b78` in 2005 and they are not used by PowerPC backend anymore. And the ISD::FMA is a prefer way for VMADDFP if we really want to create that node. For VNMSUBFP, we will also add a more generic node FNMSUB in D76585 if we really want it. Reviewed By: qiucf Differential Revision: https://reviews.llvm.org/D80429	2020-06-03 06:36:30 +00:00
Li Rong Yi	3101601b54	[PowerPC] Exploit vabsd on P9 Summary: Exploit vabsd* for for absolute difference of vectors on P9, for example: void foo (char restrict p, char restrict q, char *restrict t) { for (int i = 0; i < 16; i++) t[i] = abs (p[i] - q[i]); } this case should be matched to the HW instruction vabsdub. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D80271	2020-06-01 02:30:27 +00:00
Zequan Wu	80e107ccd0	Add NoMerge MIFlag to avoid MIR branch folding Let the codegen recognized the nomerge attribute and disable branch folding when the attribute is given Differential Revision: https://reviews.llvm.org/D79537	2020-05-29 12:31:06 -07:00
Xiangling Liao	26604d06b6	[AIX] Emit AvailableExternally Linkage on AIX Since on AIX, our strategy is to not use -u to suppress any undefined symbols, we need to emit .extern for the symbols with AvailableExternally linkage. Differential Revision: https://reviews.llvm.org/D80642	2020-05-29 13:12:59 -04:00
Lei Huang	2368bf52cd	[PowerPC] Add support for -mcpu=pwr10 in both clang and llvm Summary: This patch simply adds support for the new CPU in anticipation of Power10. There isn't really any functionality added so there are no associated test cases at this time. Reviewers: stefanp, nemanjai, amyk, hfinkel, power-llvm-team, #powerpc Reviewed By: stefanp, nemanjai, amyk, #powerpc Subscribers: NeHuang, steven.zhang, hiraditya, llvm-commits, wuzish, shchenz, cfe-commits, kbarton, echristo Tags: #clang, #powerpc, #llvm Differential Revision: https://reviews.llvm.org/D80020	2020-05-27 13:14:25 -05:00
Lei Huang	559845f8fe	Revert "[PowerPC] Add support for -mcpu=pwr10 in both clang and llvm" This reverts commit `7eb666b155`.	2020-05-27 09:40:21 -05:00
Lei Huang	7eb666b155	[PowerPC] Add support for -mcpu=pwr10 in both clang and llvm Summary: This patch simply adds support for the new CPU in anticipation of Power10. There isn't really any functionality added so there are no associated test cases at this time. Reviewers: stefanp, nemanjai, amyk, hfinkel, power-llvm-team, #powerpc Reviewed By: stefanp, nemanjai, amyk, #powerpc Subscribers: NeHuang, steven.zhang, hiraditya, llvm-commits, wuzish, shchenz, cfe-commits, kbarton, echristo Tags: #clang, #powerpc, #llvm Differential Revision: https://reviews.llvm.org/D80020	2020-05-26 13:48:22 -05:00
Sean Fertile	3e62289f42	[PowerPC][NFC] Add colon to TODO's and fix indentation.	2020-05-26 13:33:32 -04:00
Sean Fertile	d6c8736287	[PowerPC][AIX] Spill CSRs to the ABI specified stack offsets. Extend the CSR save/restore insertion code to support both 32-bit and 64-bit AIX. Differential Revision: https://reviews.llvm.org/D79252	2020-05-26 12:24:29 -04:00
Nemanja Ivanovic	099a875f28	[PowerPC] Unaligned FP default should apply to scalars only As reported in PR45186, we could be in a situation where we don't want to handle unaligned memory accesses for FP scalars but still have VSX (which allows unaligned access for vectors). Change the default to only apply to scalars. Fixes: https://bugs.llvm.org/show_bug.cgi?id=45186	2020-05-26 10:19:06 -05:00
Sam Parker	8aaabadece	[CostModel] Unify getCastInstrCost Add the remaining cast instruction opcodes to the base implementation of getUserCost and directly return the result. This allows getInstructionThroughput to return getUserCost for the casts. This has required changes to PPC and SystemZ because they implement getUserCost and/or getCastInstrCost with adjustments for vector operations. Adjusts have also been made in the remaining backends that implement the method so that they still produce a cost of zero or one for cost kinds other than throughput. Differential Revision: https://reviews.llvm.org/D79848	2020-05-26 11:29:57 +01:00
Nemanja Ivanovic	793cc518b9	[PowerPC] Prevent legalization loop from promoting SELECT_CC from v4i32 to v4i32 As reported in https://bugs.llvm.org/show_bug.cgi?id=45709 we can hit an infinite loop in legalization since we set the legalization action for ISD::SELECT_CC for all fixed length vector types to Promote. Without some different legalization action for the type being promoted to, the legalizer simply loops. Since we don't have patterns to match the node, the right legalization action should be Expand. Differential revision: https://reviews.llvm.org/D79854	2020-05-25 20:09:07 -05:00
Stefan Pintilie	5a4bcec8db	[PowerPC][NFC] Split PPCELFStreamer::emitInstruction Split off PPCELFStreamer::emitPrefixedInstruction from PPCELFStreamer::emitInstruction. Differential Revision: https://reviews.llvm.org/D79626	2020-05-25 06:48:58 -05:00
Kang Zhang	86e3abc9e6	[PowerPC] Add some InstAlias definitions Summary: This patch add the InstAlias definitions for below instructions. ADDI ADDIS ADDI8 ADDIS8 RLWINM8 ISEL ISEL8 OR OR_rec ORI ORI8 XORI8 CNTLZW8 CNTLZW8_rec TEND TSR RFEBB NOR NOR_rec MTCRF SUBF SUBF_rec SUBFC SUBFC_rec RLDICL_32_64 TW Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D77559	2020-05-24 14:05:28 +00:00
Amy Kwan	b631f86ac5	[TLI][PowerPC] Introduce TLI query to check if MULH is cheaper than MUL + SHIFT This patch introduces a TargetLowering query, isMulhCheaperThanMulShift. Currently in DAG Combine, it will transform mulhs/mulhu into a wider multiply and a shift if the wide multiply is legal. This TLI function is implemented on 64-bit PowerPC, as it is more desirable to have multiply-high over multiply + shift for words and doublewords. Having multiply-high can also aid in further transformations that can be done. Differential Revision: https://reviews.llvm.org/D78271	2020-05-23 16:47:12 -05:00
Craig Topper	7392820f98	[Align] Remove operations on MaybeAlign that asserted that it had a defined value. If the caller needs to reponsible for making sure the MaybeAlign has a value, then we should just make the caller convert it to an Align with operator*. I explicitly deleted the relational comparison operators that were being inherited from Optional. It's unclear what the meaning of two MaybeAligns were one is defined and the other isn't should be. So make the caller reponsible for defining the behavior. I left the ==/!= operators from Optional. But now that exposed a weird quirk that ==/!= between Align and MaybeAlign required the MaybeAlign to be defined. But now we use the operator== from Optional that takes an Optional and the Value. Differential Revision: https://reviews.llvm.org/D80455	2020-05-22 21:54:28 -07:00
Fangrui Song	0840d725c4	[MC] Change MCCFIInstruction::createDefCfaOffset to cfiDefCfaOffset which does not negate Offset The negative Offset has caused a bunch of problems and confused quite a few call sites. Delete the unneeded negation and fix all call sites.	2020-05-22 17:07:11 -07:00
Fangrui Song	7e49dc6184	[MC] Change MCCFIInstruction::createDefCfa to cfiDefCfa which does not negate Offset The negative Offset has caused a bunch of problems and confused quite a few call sites. Delete the unneeded negation and fix all call sites.	2020-05-22 15:47:26 -07:00
Ahsan Saghir	a28e9f1208	[PowerPC] Add support for vmsumudm This patch adds support for Vector Multiply-Sum Unsigned Doubleword Modulo instruction; vmsumudm. Differential Revision: https://reviews.llvm.org/D80294	2020-05-22 14:35:13 -05:00
Nemanja Ivanovic	1a493b0fa5	[PowerPC] Add missing handling for half precision The fix for PR39865 took care of some of the handling for half precision but it missed a number of issues that still exist. This patch fixes the remaining issues that cause crashes in the PPC back end. Fixes: https://bugs.llvm.org/show_bug.cgi?id=45776 Differential revision: https://reviews.llvm.org/D79283	2020-05-22 07:50:11 -05:00
Chen Zheng	8086cdd1b0	[PowerPC] add more high latency opcodes for machine combiner pass Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D80097	2020-05-21 02:39:20 -04:00
Sam Parker	fb3ba38021	[CostModel] Remove getExtCost This has not been implemented by any backends which appear to cover the functionality through getCastInstrCost. Sink what there is in the default implementation into BasicTTI. Differential Revision: https://reviews.llvm.org/D78922	2020-05-21 07:18:06 +01:00
Sam Parker	8cc911fa5b	[NFCI][CostModel] Refactor getIntrinsicInstrCost Combine the two API calls into one by introducing a structure to hold the relevant data. This has the added benefit of moving the boiler plate code for arguments and flags, into the constructors. This is intended to be a non-functional change, but the complicated web of logic involved here makes it very hard to guarantee. Differential Revision: https://reviews.llvm.org/D79941	2020-05-20 11:59:08 +01:00
Florian Hahn	bcbd26bfe6	[SCEV] Move ScalarEvolutionExpander.cpp to Transforms/Utils (NFC). SCEVExpander modifies the underlying function so it is more suitable in Transforms/Utils, rather than Analysis. This allows using other transform utils in SCEVExpander. This patch was originally committed as `b8a3c34eee`, but broke the modules build, as LoopAccessAnalysis was using the Expander. The code-gen part of LAA was moved to lib/Transforms recently, so this patch can be landed again. Reviewers: sanjoy.google, efriedma, reames Reviewed By: sanjoy.google Differential Revision: https://reviews.llvm.org/D71537	2020-05-20 10:53:40 +01:00
Kang Zhang	3f376ecad0	[PowerPC] Enable machine verification for 3 passes Summary: For PowerPC, there are 3 passes has disabled the machine verification. ``` PPCTargetMachine.cpp: addPass(&LiveVariablesID, false); PPCTargetMachine.cpp: addPass(createPPCEarlyReturnPass(), false); PPCTargetMachine.cpp: addPass(createPPCBranchSelectionPass(), false); ``` This patch is to enable machine verification for above three passes. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D79840	2020-05-20 09:40:25 +00:00
Matt Arsenault	4dad4914f7	CodeGen: Use Register	2020-05-19 17:56:55 -04:00
Lei Huang	2e6e27583c	[PowerPC][NFC] Cleanup load/store spilling code Summary: Cleanup and commonize code used for spilling to the stack. Reviewers: stefanp, nemanjai, #powerpc, kamaub Reviewed By: nemanjai, #powerpc, kamaub Subscribers: kamaub, hiraditya, wuzish, shchenz, llvm-commits, kbarton Tags: #llvm, #powerpc Differential Revision: https://reviews.llvm.org/D79736	2020-05-19 14:57:32 -05:00
Simon Pilgrim	cdafe59f95	TargetLoweringObjectFile.h - remove unnecessary includes. NFCI. Replace with forward declarations and move includes down to source files where required. I also needed to move the TargetLoweringObjectFile::SectionForGlobal wrapper implementation down into TargetLoweringObjectFile.cpp	2020-05-19 09:28:13 +01:00
Chen Zheng	a6be4d17e3	[PowerPC-QPX] adjust operands order of qpx fma instructions. convert %3 = QVFMADD %2, %0, %1, implicit $rm to %3 = QVFMADD %2, %1, %0, implicit $rm Reviewed By: hfinkel, steven.zhang Differential Revision: https://reviews.llvm.org/D78986	2020-05-18 22:59:51 -04:00
Chen Zheng	9971839942	fix build failure due to commit rGddcb3cf213e8	2020-05-18 21:47:40 -04:00
Chen Zheng	ddcb3cf213	[TargetInstrInfo] add override function setSpecialOperandAttr - NFC	2020-05-18 21:20:52 -04:00
Christopher Tetreault	0d5d5a75e2	[SVE] Remove usages of VectorType::getNumElements() from PowerPC Reviewers: efriedma, sdesmalen, c-rhodes, hfinkel Reviewed By: c-rhodes Subscribers: wuzish, nemanjai, tschuett, hiraditya, kbarton, rkruppe, psnobl, shchenz, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79821	2020-05-15 12:30:56 -07:00
Li Rong Yi	80173566f4	[PowerPC] Add an intrinsic for Popcntb Summary: This patch adds the intrinsic llvm.ppc.popcntb for the HW instruction POPCNTB Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D79703	2020-05-15 15:19:12 +08:00
Sean Fertile	ce4ebc14a8	[PowerPC] Remove support for SplitCSR. SplitCSR was only suppored for functions with CXX_FAST_TLS calling convention. Clang only emits that calling convention for Darwin which is no longer supported by the PowerPC backend. Another IR producer could use the calling convention, but considering the calling convention is meant to be an optimization and the codegen for SplitCSR can be attrocious on Power (see the modifed lit test) it is best to remove it and codegen CXX_FAST_TLS same as the C calling convention. Differential Revision: https://reviews.llvm.org/D79018	2020-05-14 10:32:17 -04:00
Qiu Chaofan	8ffe8891cd	[PowerPC] Exploit VSX neg, abs and nabs for f32 xsnegdp, xsabsdp and xsnabsdp can be used to operate on f32 operand. This patch adds the missing patterns since we prefer VSX instructions when available. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D75344	2020-05-13 14:28:50 +08:00
Qiu Chaofan	e9753822b5	[PowerPC] Respect SDNodeFlags in lowering SELECT_CC Legalizer should respect both command-line options or SDNode-level fast-math flags. Also, this patch propagates other flags during custom simplifying. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D79074	2020-05-13 14:05:47 +08:00
Kang Zhang	782a4dd1a4	[PowerPC] Use add instead of addReg in ppc-early-ret pass Summary: The ppc-early-ret pass use the addReg() to add operand to the new instruction, it can't reserve the flag of old operand. This has caused machine verfications failed. This patch use add() to instead of addReg(). Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D77997	2020-05-13 05:59:52 +00:00
Justin Hibbits	0138cc0125	PowerPC: Treat llvm.fma.f* intrinsic as using CTR with SPE Summary: The SPE doesn't have a 'fma' instruction, so the intrinsic becomes a libcall. It really should become an expansion to two instructions, but for some reason the compiler doesn't think that's as optimal as a branch. Since this lowering is done after CTR is allocated for loops, tell the optimizer that CTR may be used in this case. This prevents a "Invalid PPC CTR loop!" assertion in the case that a fma() function call is used in a C/C++ file, and clang converts it into an intrinsic. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D78668	2020-05-12 17:19:43 -05:00
Kamau Bridgeman	cd83333fc8	[PowerPC] Fold redundant load immediates of zero and delete if possible This patch folds redundant load immediates into a zero for instructions which recognise this as the value zero and not the register. If the load immediate is no longer in use it is then deleted. This is already done in earlier passes but the ppc-mi-peephole allows for a more general implementation. Differential Revision: https://reviews.llvm.org/D69168	2020-05-12 13:15:06 -05:00
Craig Topper	8c72b0271b	[CodeGen] Use Align in MachineConstantPool.	2020-05-12 10:06:40 -07:00
Qiu Chaofan	e8d2ff22f0	[PowerPC] Add fma/fsqrt/fmax strict-fp intrinsics This patch adds strict-fp intrinsics support for fma, fsqrt, fmaxnum and fminnum on PowerPC. Reviewed By: hfinkel Differential Revision: https://reviews.llvm.org/D72749	2020-05-12 13:44:09 +08:00
jasonliu	51e6fc44d0	[XCOFF][AIX] Emit correct alignment for csect Summary: This patch tries to emit the correct alignment result for both object file generation path and assembly path. Reviewed by: hubert.reinterpretcast, DiggerLin, daltenty Differential Revision: https://reviews.llvm.org/D79127	2020-05-11 19:43:10 +00:00
Sean Fertile	1ea8d58f21	[PowerPC][NFC] Convert an if/else to a conditional. Change an if else to use a conditional which is shorter. Also name the conditonal value to make the code clearer.	2020-05-11 13:05:19 -04:00
Kang Zhang	dcc5ff3bc2	[PowerPC] Use PredictableSelectIsExpensive to enable select to branch in CGP Summary: This patch will set the variable PredictableSelectIsExpensive to do the select to if based on BranchProbability in CodeGenPrepare. When the BranchProbability more than MinPercentageForPredictableBranch, PPC will convert SELECT to branch. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D71883	2020-05-11 15:02:09 +00:00
Craig Topper	d1119980e5	[SelectionDAG] Use Align/MaybeAlign for ConstantPoolSDNode. This patch stores the alignment for ConstantPoolSDNode as an Align and updates the getConstantPool interface to take a MaybeAlign. Removing getAlignment() will be done as a follow up. Differential Revision: https://reviews.llvm.org/D79436	2020-05-08 16:04:11 -07:00
Hubert Tong	601d5bd516	[Target][XCOFF] Correctly halt when mixing AIX or XCOFF with ppc64le The code to prevent using `PPCXCOFFMCAsmInfo` with little-endian targets used an incorrect check. Also, there does not appear to be sufficient earlier checking to prevent failing this check, so the check here is upgraded to be a `report_fatal_error`. `PPCAIXAsmPrinter` was also missing a check against use with little-endian targets. This patch adds such a check in.	2020-05-08 16:51:34 -04:00
Hubert Tong	b116ded57d	[AIX] Avoid structor alias; die before bad alias codegen Summary: `AsmPrinter::emitGlobalIndirectSymbol` is dependent on `MCStreamer::emitAssignment` to produce `.set` directives for alias symbols; however, the `.set` pseudo-op on AIX is documented as not usable with external relocatable terms or expressions, which limits its applicability in generating alias symbols. Disable generating aliases on AIX until a different implementation strategy is available. Reviewers: cebowleratibm, jasonliu, sfertile, daltenty, DiggerLin Reviewed By: jasonliu Differential Revision: https://reviews.llvm.org/D79044	2020-05-08 16:51:34 -04:00
Sam Parker	40574fefe9	[NFC][CostModel] Add TargetCostKind to relevant APIs Make the kind of cost explicit throughout the cost model which, apart from making the cost clear, will allow the generic parts to calculate better costs. It will also allow some backends to approximate and correlate the different costs if they wish. Another benefit is that it will also help simplify the cost model around immediate and intrinsic costs, where we currently have multiple APIs. RFC thread: http://lists.llvm.org/pipermail/llvm-dev/2020-April/141263.html Differential Revision: https://reviews.llvm.org/D79002	2020-05-05 10:35:54 +01:00
Nemanja Ivanovic	8ca2fc9993	[PowerPC] Refactor PPCInstrVSX.td Over time, we have made many additions to this file and it has frankly become a bit of a mess. This has led to at least one issue - we have a number of instructions where the side effects flag should be set to false and we neglected to do this. This patch suggests a refactoring that should make the file much more maintainable. The file is split up into major sections and the nesting level is reduced, predicate blocks merged, etc. Sections: - Custom PPCISD node definitions - Predicate definitions - Instruction formats - Instruction definitions - Helper DAG definitions - Anonymous patterns - Instruction aliases Differential revision: https://reviews.llvm.org/D78132	2020-05-01 19:17:39 -05:00
Hubert Tong	a3515ab8af	[MC][Target][XCOFF] Consolidate MCAsmInfo XCOFF defaults; NFC The setting of `MCAsmInfo` properties for XCOFF got split between `MCAsmInfoXCOFF` and `PPCXCOFFMCAsmInfo`. Except for the properties that are dependent on the target information being passed via the constructor, the properties being set in `PPCXCOFFMCAsmInfo` had no fundamental reason for being treated as specific for XCOFF on PowerPC. Indeed, the property that might be considered more specific to PowerPC, `NeedsFunctionDescriptors`, was set in `MCAsmInfoXCOFF`. XCOFF being specific to PowerPC anyway, this patch consolidates the setting of the properties into `MCAsmInfoXCOFF` except for the cases that are dependent on the information provided via the `PPCXCOFFMCAsmInfo` constructor. This patch also reorders the assignments to the fields to match the declaration order in `MCAsmInfo`.	2020-04-30 20:48:30 -04:00
diggerlin	a2c8cd1812	[AIX] emit .extern and .weak directive linkage SUMMARY: emit .extern and .weak directive linkage Reviewers: hubert.reinterpretcast, Jason Liu Subscribers: wuzish, nemanjai, hiraditya Differential Revision: https://reviews.llvm.org/D76932	2020-04-30 09:54:10 -04:00
Sean Fertile	2a3cf5e583	[PowerPC][AIX] Pass ByVal formal args that span registers and stack. Implement passing of ByVal formal arguments when the argument is passed partly in the argument registers, with the remainder of the argument passed on the stack. Differential Revision: https://reviews.llvm.org/D78515	2020-04-28 14:57:14 -04:00
Nick Desaulniers	1b9fdec1f6	[TII] remove overrides of isUnpredicatedTerminator Summary: They all match the base implementation in TargetInstrInfo::isUnpredicatedTerminator. Follow up to D62749. Reviewers: echristo, MaskRay, hfinkel Reviewed By: echristo Subscribers: wuzish, nemanjai, hiraditya, kbarton, llvm-commits, srhines Tags: #llvm Differential Revision: https://reviews.llvm.org/D78976	2020-04-28 08:47:28 -07:00
Ng Zhi An	500b4ad5f4	[PowerPC] Fix downcast from nullptr for target streamer getTargetStreamer() might return null (e.g. when running inlined-strings.ll test), downcasting to a reference will be wrong. This is detectable with -fsanitize=null. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D78686	2020-04-28 09:20:10 +00:00
Sam Parker	e9c9329aa4	[TTI] Add TargetCostKind argument to getUserCost There are several different types of cost that TTI tries to provide explicit information for: throughput, latency, code size along with a vague 'intersection of code-size cost and execution cost'. The vectorizer is a keen user of RecipThroughput and there's at least 'getInstructionThroughput' and 'getArithmeticInstrCost' designed to help with this cost. The latency cost has a single use and a single implementation. The intersection cost appears to cover most of the rest of the API. getUserCost is explicitly called from within TTI when the user has been explicit in wanting the code size (also only one use) as well as a few passes which are concerned with a mixture of size and/or a relative cost. In many cases these costs are closely related, such as when multiple instructions are required, but one evident diverging cost in this function is for div/rem. This patch adds an argument so that the cost required is explicit, so that we can make the important distinction when necessary. Differential Revision: https://reviews.llvm.org/D78635	2020-04-28 08:57:45 +01:00
Chen Zheng	45d92806ea	[PowerPC] use inst-level fast-math-flags to drive MachineCombiner Currently, on PowerPC target, it uses function scope UnsafeFPMath option to drive Machine Combiner pass. This is not accurate in two ways: 1: the scope is not accurate. Machine Combiner pass only requires instruction-level flags instead of the function scope. 2: the float point flag is not accurate. Machine Combiner pass only requires float point flags reassoc and nsz. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D78183	2020-04-28 03:31:12 -04:00
Haojian Wu	b73290be9f	Fix the -Wunused-variable warning.	2020-04-28 08:44:15 +02:00
Craig Topper	a58b62b4a2	[IR] Replace all uses of CallBase::getCalledValue() with getCalledOperand(). This method has been commented as deprecated for a while. Remove it and replace all uses with the equivalent getCalledOperand(). I also made a few cleanups in here. For example, to removes use of getElementType on a pointer when we could just use getFunctionType from the call. Differential Revision: https://reviews.llvm.org/D78882	2020-04-27 22:17:03 -07:00
Kang Zhang	4bb0a1cb70	[PowerPC] Fix the liveins for ppc-expand-isel pass Summary: In the ppc-expand-isel pass, we use stepForward() to update the liveins, this function is not recommended, because it needs the accurate kill info. This patch uses the function computeAndAddLiveIns() to update the liveins, it's the recommended method and can fix the liveins bug for ppc-expand-isel pass.. Reviewed By: efriedma, lkail Differential Revision: https://reviews.llvm.org/D78657	2020-04-28 03:22:48 +00:00
Victor Huang	64d44ae7c2	[PowerPC][Future] Remove "unskipableSimplifyCode()" in PPCMIPeephole.cpp "unskipableSimplifyCode()" was added to handle unsafe BL8_NOTOC instruction when TOC was not completely removed. The function is not needed after confirming TOC pointer is not used in a function that uses PC-Relative addressing. Differential Revision: https://reviews.llvm.org/D78517	2020-04-27 14:57:02 -05:00
Stefan Pintilie	1354a03e74	[PowerPC][Future] Implement PC Relative Tail Calls Tail Calls were initially disabled for PC Relative code because it was not safe to make certain assumptions about the tail calls (namely that all compiled functions no longer used the TOC pointer in R2). However, once all of the TOC pointer references have been removed it is safe to tail call everything that was tail called prior to the PC relative additions as well as a number of new cases. For example, it is now possible to tail call indirect functions as there is no need to save and restore the TOC pointer for indirect functions if the caller is marked as may clobber R2 (st_other=1). For the same reason it is now also possible to tail call functions that are external. Differential Revision: https://reviews.llvm.org/D77788	2020-04-27 12:55:08 -05:00
Simon Pilgrim	a3982491db	[Pass] Ensure we don't include PassSupport.h or PassAnalysisSupport.h directly Both PassSupport.h and PassAnalysisSupport.h are only supposed to be included via Pass.h. Differential Revision: https://reviews.llvm.org/D78815	2020-04-26 12:58:20 +01:00
Fangrui Song	25e22613df	[XRay] Change ARM/AArch64/powerpc64le to use version 2 sled (PC-relative address) Follow-up of D78082 (x86-64). This change avoids dynamic relocations in `xray_instr_map` for ARM/AArch64/powerpc64le. MIPS64 cannot use 64-bit PC-relative addresses because R_MIPS_PC64 is not defined. Because MIPS32 shares the same code, for simplicity, we don't use PC-relative addresses for MIPS32 as well. Tested on AArch64 Linux and ppc64le Linux. Reviewed By: ianlevesque Differential Revision: https://reviews.llvm.org/D78590	2020-04-24 08:35:43 -07:00
Victor Huang	e20b07b021	[PowerPC][Future] Add missing changes for PC Realtive addressing 1. Use Subtarget.isUsingPCRelativeCalls() in LowerConstantPool to check if using PCRelative addressing. 2. Change MO_GOT_FLAG = 32 to MO_GOT_FLAG = 8 in PPC.h to use consecutive bits. Differential Revision: https://reviews.llvm.org/D78406	2020-04-23 10:26:43 -05:00
Simon Pilgrim	d8a4a99161	[PowerPC] Remove unused forward declarations. NFC.	2020-04-23 15:02:18 +01:00
Kazuaki Ishizaki	0312b9f550	[llvm] NFC: Fix trivial typo in rst and td files Differential Revision: https://reviews.llvm.org/D77469	2020-04-23 14:26:32 +09:00
Victor Huang	a60ca4b4e9	[PowerPC][Future] Initial support for PCRel addressing to get block address Add initial support for PCRelative addressing to get block address instead of using TOC. Differential Revision: https://reviews.llvm.org/D76294	2020-04-22 15:01:29 -05:00
Victor Huang	02141a17ae	[PowerPC][Future] Remove redundant r2 save and restore for indirect call Currently an indirect call produces the following sequence on PCRelative mode: extern void function( ); extern void (ptrfunc) ( ); void g() { ptrfunc=function; } void f() { (ptrfunc) ( ); } Producing paddi 3, 0, .LC0@PCREL, 1 ld 3, 0(3) std 2, 24(1) ld 12, 0(3) mtctr 12 bctrl ld 2, 24(1) Though the caller does not use or preserve r2, it is still saved and restored across a function call. This patch is added to remove these redundant save and restores for indirect calls. Differential Revision: https://reviews.llvm.org/D77749	2020-04-22 12:05:51 -05:00
Victor Huang	43abef06f4	[PowerPC][Future] Initial support for PCRel addressing for jump tables. Add initial support for PC Relative addressing to get jump table base address instead of using TOC. Differential Revision: https://reviews.llvm.org/D75931	2020-04-22 10:45:01 -05:00
Craig Topper	d22989c34e	[CallSite removal][Target] Replace CallSite with CallBase. NFC In some cases just delete an unneeded include.	2020-04-21 23:29:36 -07:00
Qiu Chaofan	c12722cde8	[PowerPC] Exploit RLDIMI for OR with large immediates This patch exploits rldimi instruction for patterns like `or %a, 0b000011110000`, which saves number of instructions when the operand has only one use, compared with `li-ori-sldi-or`. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D77850	2020-04-22 14:16:52 +08:00
Benjamin Kramer	d50bfd9764	Fix an unused-variable warning in Release mode.	2020-04-21 18:59:27 +02:00
Stefan Pintilie	a92ee77d85	[PowerPC][Future] Add offsets to PC Relative relocations. This is an optimization that applies to global addresses and allows for the following transformation: Convert this: paddi r3, 0, symbol@PCREL, 1 ld r4, 8(r3) To this: pld r4, symbol@PCREL+8(0), 1 An instruction is saved and the linker can do the addition when the symbol is resolved. Differential Revision: https://reviews.llvm.org/D76160	2020-04-21 11:08:19 -05:00
Shengchen Kan	8bb059ab63	[MC][Bugfix] Remove redundant parameter for relaxInstruction Summary: Before this patch, `relaxInstruction` takes three arguments, the first argument refers to the instruction before relaxation and the third argument is the output instruction after relaxation. There are two quite strange things: 1) The first argument's type is `const MCInst &`, the third argument's type is `MCInst &`, but they may be aliased to the same variable 2) The backends of ARM, AMDGPU, RISC-V, Hexagon assume that the third argument is a fresh uninitialized `MCInst` even if `relaxInstruction` may be called like `relaxInstruction(Relaxed, STI, Relaxed)` in a loop. In this patch, we drop the thrid argument, and let `relaxInstruction` directly modify the given instruction. Also, this patch fixes the bug https://bugs.llvm.org/show_bug.cgi?id=45580, which is introduced by D77851, and breaks the assumption of ARM, AMDGPU, RISC-V, Hexagon. Reviewers: Razer6, MaskRay, jyknight, asb, luismarques, enderby, rtaylor, colinl, bcain Reviewed By: Razer6, MaskRay, bcain Subscribers: bcain, nickdesaulniers, nathanchance, wuzish, annita.zhang, arsenm, dschuff, jyknight, dylanmckay, sdardis, nemanjai, jvesely, nhaehnle, tpr, sbc100, jgravelle-google, kristof.beyls, hiraditya, aheejin, kbarton, fedor.sergeev, asb, rbar, johnrusso, simoncook, sabuasal, niosHD, jrtc27, MaskRay, zzheng, edward-jones, atanasyan, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX, jocewei, Jim, lenary, s.egerton, pzheng, sameer.abuasal, apazos, luismarques, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78364	2020-04-21 11:06:55 +08:00
Christopher Tetreault	a9b137f9ff	[SVE] Remove calls to getBitWidth from PowerPC Reviewers: efriedma, sdesmalen, hfinkel, david-arm, fpetrogalli Reviewed By: efriedma, fpetrogalli Subscribers: wuzish, nemanjai, tschuett, hiraditya, kbarton, rkruppe, psnobl, shchenz, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77900	2020-04-20 14:18:37 -07:00
David Tenty	0098324947	[AIX] Return the correct set of callee saved regs Summary: r13 isn't reserved on 32-bit AIX, which is reflected in our calling convention but not callee saved regs. Reviewers: sfertile, ZarkoCA, cebowleratibm, jasonliu Reviewed By: sfertile Subscribers: thakis, lei, wuzish, nemanjai, hiraditya, kbarton, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77101	2020-04-20 14:31:08 -04:00
Nemanja Ivanovic	64b31d96df	[PowerPC] Do not attempt to reuse load for 64-bit FP_TO_UINT without FPCVT We call the function that attempts to reuse the conversion without checking whether the target matches the constraints that the callee expects. This patch adds the check prior to the call. Fixes: https://bugs.llvm.org/show_bug.cgi?id=43976 Differential revision: https://reviews.llvm.org/D77564	2020-04-20 13:00:06 -05:00
David Tenty	28ae1969dc	Revert "[AIX] Return the correct set of callee saved regs" This reverts commit `6c881bf1fe`.	2020-04-20 13:06:37 -04:00
Sean Fertile	d52bb6d099	[PowerPC][AIX] ByVal formal argument support: passing on the stack. Adds support for passing a ByVal formal argument completely on the stack (ie after all argument registers are exhausted). Differential Revision: https://reviews.llvm.org/D78263	2020-04-20 12:04:59 -04:00
David Tenty	6c881bf1fe	[AIX] Return the correct set of callee saved regs Summary: r13 isn't reserved on 32-bit AIX, which is reflected in our calling convention but not callee saved regs. Reviewers: sfertile, ZarkoCA, cebowleratibm, jasonliu Reviewed By: sfertile Subscribers: lei, wuzish, nemanjai, hiraditya, kbarton, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77101	2020-04-20 11:22:17 -04:00
LemonBoy	a5d161c119	[PowerPC] Don't use rldicl for PPC32 According to https://www.ibm.com/support/knowledgecenter/ssw_aix_72/assembler/idalangref_rldicl_rletdw_instrs.html rldicl should not be used when targeting 32bit CPUs. Reviewed By: #powerpc, nemanjai, MaskRay Differential Revision: https://reviews.llvm.org/D77946	2020-04-18 17:24:25 -07:00
Stefan Pintilie	b771c4a842	[PowerPC][Future] More support for PCRel addressing for global values Add initial support for PC Relative addressing for global values that require GOT indirect addressing. This patch adds PCRelative support for global addresses that may not be known at link time and may require access through the GOT. Differential Revision: https://reviews.llvm.org/D76064	2020-04-17 11:06:13 -05:00
jasonliu	77618cc237	[XCOFF][AIX] Fix getSymbol to return the correct qualname when necessary Summary: AIX symbol have qualname and unqualified name. The stock getSymbol could only return unqualified name, which leads us to patch many caller side(lowerConstant, getMCSymbolForTOCPseudoMO). So we should try to address this problem in the callee side(getSymbol) and clean up the caller side instead. Note: this is a "mostly" NFC patch, with a fix for the original lowerConstant behavior. Differential Revision: https://reviews.llvm.org/D78045	2020-04-17 13:45:14 +00:00
QingShan Zhang	4bd186c0ff	[PowerPC] Exploit the rldicl + rldicl when and with mask If we are and the constant like 0xFFFFFFC00000, for now, we are using several instructions to generate this 48bit constant and final an "and". However, we could exploit it with two rotate instructions. MB ME MB+63-ME +----------------------+ +----------------------+ \|0000001111111111111000\| -> \|0000000001111111111111\| +----------------------+ +----------------------+ 0 63 0 63 Rotate left ME + 1 bit first, and then, mask it with (MB + 63 - ME, 63), finally, rotate back. Notice that, we need to round it with 64 bit for the wrapping case. Reviewed by: ChenZheng, Nemanjai Differential Revision: https://reviews.llvm.org/D71831	2020-04-17 05:24:00 +00:00
Stefan Pintilie	18b6050324	[PowerPC][Future] Initial support for PC Relative addressing for global values This patch adds PC Relative support for global values that are known at link time. If a global value requires access through the global offset table (GOT) it is not covered in this patch. Differential Revision: https://reviews.llvm.org/D75280	2020-04-16 12:45:22 -05:00
Kang Zhang	513976df2e	[PowerPC] Ignore implicit register operands for MCInst Summary: When doing the conversion: MachineInst -> MCInst, we should ignore the implicit operands, it will expose more opportunity for InstiAlias. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D77118	2020-04-16 16:22:43 +00:00
Chris Bowler	bee6c234ed	[AIX][PowerPC] Implement caller byval arguments in stack memory Differential Revision: https://reviews.llvm.org/D77578	2020-04-15 17:57:31 -04:00
Nemanja Ivanovic	c196e2ca48	[PowerPC] Clear the set of symbols that need to be updated in MCTargetStreamer We have added code to correct the .localentry values on assignments. However, we never clear the set so presumably it will still contain the (now dangling) MCSymbol pointers across a call to finish() and reset() in the streamer. This is based on my speculation that it is the reason we are getting segmentation faults mentioned in https://bugs.llvm.org/show_bug.cgi?id=45366 Fixes: https://bugs.llvm.org/show_bug.cgi?id=45366 Differential revision: https://reviews.llvm.org/D78196	2020-04-15 15:42:02 -05:00
Mircea Trofin	447e2c3067	[llvm][NFC][CallSite] Remove Implementation uses of CallSite Reviewers: dblaikie, davidxl, craig.topper Subscribers: arsenm, dschuff, nemanjai, jvesely, nhaehnle, sbc100, jgravelle-google, hiraditya, aheejin, kbarton, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78142	2020-04-14 14:49:47 -07:00
Craig Topper	113f37a1f9	[CallSite removal][TargetLowering] Replace ImmutableCallSite with CallBase Differential Revision: https://reviews.llvm.org/D77995	2020-04-13 13:50:15 -07:00
Fangrui Song	0a55d3f557	[MC] Default MCAsmInfo::UseIntegratedAssembler to true	2020-04-11 10:13:52 -07:00
Fangrui Song	d2e5157c1f	[MC] Add UseIntegratedAssembler = false. NFC	2020-04-11 10:13:49 -07:00
Nemanja Ivanovic	512600e3c0	[PowerPC] Handle f16 as a storage type only The PPC back end currently crashes (fails to select) with f16 input. This patch expands it on subtargets prior to ISA 3.0 (Power9) and uses the HW conversions on Power9. Fixes https://bugs.llvm.org/show_bug.cgi?id=39865 Differential revision: https://reviews.llvm.org/D68237	2020-04-11 07:34:47 -05:00
Nemanja Ivanovic	04eae39617	[PowerPC] Another folow-up fix for `6c4b40def7` There was another issue introduced by this commit that the OP initially missed. Namely, for functions that are free to use R2 as a callee-saved register, we emit a TOC expression based on the address of the GEP label without emitting the GEP label. Since we only emit such expressions for the large code model, this issue only surfaced there. I have confirmed that with this fix, the kernel build is successful with target "all".	2020-04-10 21:09:59 -05:00
Kai Luo	b7d5229d78	[PowerPC] Update alignment for ReuseLoadInfo in LowerFP_TO_INTForReuse In LowerFP_TO_INTForReuse, when emitting `stfiwx`, alignment of 4 is set for the `MachineMemOperand`, but RLI(ReuseLoadInfo)'s alignment is not updated for following loads. It's related to failed alignment check reported in https://bugs.llvm.org/show_bug.cgi?id=45297 Differential Revision: https://reviews.llvm.org/D77624	2020-04-10 05:49:19 +00:00
Nemanja Ivanovic	7f3787c0f2	[PowerPC] Bail out of redundant LI elimination on an implicit kill The transformation currently does not differentiate between explicit and implicit kills. However, it is not valid to later simply clear an implicit kill flag since the kill could be due to a call or return. Fixes: https://bugs.llvm.org/show_bug.cgi?id=45374	2020-04-09 22:17:29 -05:00
Stefan Pintilie	5b18b6e9a8	[PowerPC][Future] Fix for `6c4b40def7` This is a fix for the previous patch `6c4b40def7`. In some cases it may be possible to have the compiler produce st_other=1 without the compiler using mcpu=future which should not be the case. This patch adds a guard to make sure that if we are using st_other=1 then we are also compiling for future CPU.	2020-04-10 01:12:11 +00:00
Nemanja Ivanovic	5fe2809447	[PowerPC] Don't assert on SELECT_CC with i1 type When we try to select a SELECT_CC on Power9, we check if it can be matched to a SETB instruction. In that function, we assert that the output type is i32/i64. This is unnecessary as it is perfectly reasonable to have an i1 SELECT_CC. Change that from an assert to an early exit condition. Fixes: https://bugs.llvm.org/show_bug.cgi?id=45448	2020-04-09 19:27:32 -05:00
James Y Knight	5e7b98fe75	Fix an unused-variable warning in Release mode.	2020-04-09 16:34:55 -04:00
Stefan Pintilie	64868cbfcf	[PowerPC][Future] Fix for `75828ef615` Used unsigned long where uint64_t should have been used by mistake. Fixed in this patch.	2020-04-09 19:33:12 +00:00
jasonliu	085689d44c	[PPC][AIX] Implement variadic function handling in LowerFormalArguments_AIX Summary: This patch adds support for handling of variadic functions for AIX. This includes ensuring that use and consume correct type of va_list (char *va_list) for AIX. Authored by: ZarkoCA Reviewers: cebowleratibm, sfertile, jasonliu Reviewed by: jasonliu Differential Revision: https://reviews.llvm.org/D76130	2020-04-09 16:49:44 +00:00
Stefan Pintilie	75828ef615	[PowerPC][Future] Initial support for PCRel addressing for constant pool loads Add initial support for PC Relative addressing for constant pool loads. This includes adding a new relocation for @pcrel and adding a new PowerPC flag to identify PC relative addressing. Differential Revision: https://reviews.llvm.org/D74486	2020-04-09 11:17:23 -05:00
Christopher Tetreault	49fd24fe9e	Clean up usages of asserting vector getters in Type Summary: Remove usages of asserting vector getters in Type in preparation for the VectorType refactor. The existence of these functions complicates the refactor while adding little value. Reviewers: hfinkel, efriedma, sdesmalen Reviewed By: efriedma Subscribers: wuzish, nemanjai, hiraditya, kbarton, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77266	2020-04-08 16:10:55 -07:00
Sean Fertile	d0b57b41f4	[PowerPC][AIX][NFC] Replace deprecated getByValAlign call. Replace call to deprecated 'getByValAlign()' with 'getNonZeroByValAlign()'.	2020-04-08 13:27:39 -04:00
Matt Arsenault	ca0ace7298	CodeGen: Use Register in MachineBasicBlock	2020-04-08 12:10:58 -04:00
Matt Arsenault	84aa58cbe2	CodeGen: Use Register in TargetLowering	2020-04-08 12:10:58 -04:00
Sean Fertile	8abfd2c3bb	[PowerPC][AIX] Enable passing byval formal arguments in multiple registers. Any or all the argument registers can be used to pass a byval formal argument, with the limitation that the argument must fit in the available registers (ie: is not split between registers and stack). Differential Revision: https://reviews.llvm.org/D76902	2020-04-08 11:16:33 -04:00
Stefan Pintilie	6c4b40def7	[PowerPC][Future] Add Support For Functions That Do Not Use A TOC. On PowerPC most functions require a valid TOC pointer. This is the case because either the function itself needs to use this pointer to access the TOC or because other functions that are called from that function expect a valid TOC pointer in the register R2. The main exception to this is leaf functions that do not access the TOC since they are guaranteed not to need a valid TOC pointer. This patch introduces a feature that will allow more functions to not require a valid TOC pointer in R2. Differential Revision: https://reviews.llvm.org/D73664	2020-04-08 08:07:35 -05:00
Matt Arsenault	6011627f51	CodeGen: More conversions to use Register	2020-04-07 18:54:36 -04:00
Matt Arsenault	aa26dd9858	CodeGen: Use Register in more places	2020-04-07 15:59:40 -04:00
Nemanja Ivanovic	ecd8435483	[NFC][PowerPC] Fix register class for patterns using XXPERMDIs There are a few patterns where we use a superclass for inputs to this instruction rather than the correct class. This can sometimes lead to unncessary copies.	2020-04-07 14:06:08 -05:00
David Tenty	b9245f14b7	[NFC][PowerPC] Cleanup 64-bit and Darwin CalleeSavedRegs Summary: - Remove the no longer used Darwin CalleeSavedRegs - Combine the SVR464 callee saved regs and AIX64 since the two are (and should be) identical into PPC64 - Update tests for 64-bit CSR change Reviewers: sfertile, ZarkoCA, cebowleratibm, jasonliu, #powerpc Reviewed By: sfertile Subscribers: wuzish, nemanjai, hiraditya, kbarton, shchenz, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77235	2020-04-07 11:49:10 -04:00
Chris Bowler	d6ea82d11c	[AIX][PPC] Implement by-val caller arguments in multiple registers Differential Revision: https://reviews.llvm.org/D76380	2020-04-06 11:06:51 -04:00
Matt Arsenault	30ebafaa56	CodeGen: Convert some TII hooks to use Register	2020-04-03 14:52:54 -04:00
jasonliu	d65557d15d	[NFC][XCOFF][AIX] Refactor get/setContainingCsect Summary: For current architect, we always require setContainingCsect to be called on every MCSymbol got used in XCOFF context. This is very hard to achieve because symbols gets created everywhere and other MCSymbol types(ELF, COFF) do not have similar rules. It's very easy to miss setting the containing csect, and we would need to add a lot of XCOFF specialized code around some common code area. This patch intendeds to do 1. Rely on getFragment().getParent() to get csect from labels. 2. Only use get/setRepresentedCsect (was get/setContainingCsect) if symbol itself represents a csect. Reviewers: DiggerLin, hubert.reinterpretcast, daltenty Differential Revision: https://reviews.llvm.org/D77080	2020-04-03 13:33:12 +00:00
Qiu Chaofan	71f1ab5354	[PowerPC] Remove unnecessary XSRSP instruction MI peephole will remove unnecessary FRSP instructions. This patch removes such unnecessary XSRSP. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D77208	2020-04-03 11:05:14 +08:00
Fangrui Song	cbd3969e8c	[PPCInstPrinter] Delete an unneeded overload of printBranchOperand. NFC It was added by D76591 for migration purposes (not all printBranchOperand users have migrated to the overload with `uint64_t Address`). Now that all have been migrated, the parameter can go away.	2020-04-01 22:45:25 -07:00
Fangrui Song	85adce3d73	[PPCInstPrinter] Change B to print the target address in hexadecimal form Follow-up of D76591 and D76907	2020-04-01 22:38:24 -07:00
Guillaume Chatelet	1dffa2550b	[Alignment][NFC] Transition to MachineFrameInfo::getObjectAlign() Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: arsenm, sdardis, nemanjai, jvesely, nhaehnle, hiraditya, kbarton, jrtc27, atanasyan, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77215	2020-04-01 14:08:28 +00:00
Guillaume Chatelet	c7468c1696	[Alignment][NFC] Use Align in SelectionDAG::getMemIntrinsicNode Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: jholewinski, nemanjai, hiraditya, kbarton, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77149	2020-04-01 09:32:05 +00:00
Kai Luo	8eb40e41f6	[PowerPC] Don't generate ST_VSR_SCAL_INT if power8-vector is disabled Summary: In https://bugs.llvm.org/show_bug.cgi?id=45297, it fails selecting instructions for `PPCISD::ST_VSR_SCAL_INT`. The reason it generate the `PPCISD::ST_VSR_SCAL_INT` with `-power8-vector` in IR is PPC's combiner checks `hasP8Altivec` rather than `hasP8Vector`. This patch should resolve PR45297. Differential Revision: https://reviews.llvm.org/D76773	2020-04-01 02:15:25 +00:00
Fangrui Song	4af7560b37	[PPCInstPrinter] Print conditional branches as `bt 2, $target` instead of `bt 2, .+$imm` Follow-up of D76591. Reviewed By: #powerpc, sfertile Differential Revision: https://reviews.llvm.org/D76907	2020-03-31 15:05:38 -07:00
Guillaume Chatelet	c9d5c19597	[Alignment][NFC] Transitionning more getMachineMemOperand call sites Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: arsenm, dylanmckay, sdardis, nemanjai, jvesely, nhaehnle, hiraditya, kbarton, jrtc27, atanasyan, Jim, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77121	2020-03-31 08:36:18 +00:00
QingShan Zhang	4eeb56d088	[PowerPC] Don't do the folding if the operand is R0/X0 We have this transformation in PowerPC peephole: Replace instruction: renamable $x28 = ADDI8 renamable $x7, -8 renamable $x28 = ADD8 killed renamable $x28, renamable $x0 STFD killed renamable $f0, -8, killed renamable $x28 :: (store 8 into %ir._ind_cast99.epil) with: renamable $x28 = ADDI8 renamable $x7, -16 STFDX killed renamable $f0, $x0, killed $x28 :: (store 8 into %ir._ind_cast99.epil) It is invalid as the '$x0' in STFDX is constant 0, not register r0. Reviewed By: Nemanjai Differential Revision: https://reviews.llvm.org/D77034	2020-03-31 02:50:19 +00:00
Guillaume Chatelet	bdf77209b9	[Alignment][NFC] Use Align version of getMachineMemOperand Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: jyknight, sdardis, nemanjai, hiraditya, kbarton, fedor.sergeev, asb, rbar, johnrusso, simoncook, sabuasal, niosHD, jrtc27, MaskRay, zzheng, edward-jones, atanasyan, rogfer01, MartinMosbeck, brucehoult, the_o, jfb, PkmX, jocewei, Jim, lenary, s.egerton, pzheng, sameer.abuasal, apazos, luismarques, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77059	2020-03-30 15:46:27 +00:00
Fangrui Song	fc93787d7e	[MC][PowerPC] Make .reloc support arbitrary relocation types Generalizes `ad7199f3e6` (R_PPC_NONE/R_PPC64_NONE).	2020-03-28 17:04:31 -07:00
diggerlin	9c20f09985	[AIX] Address comment https://reviews.llvm.org/D76162#inline-701237 SUMMARY: Address clang format issue: "clang format this block, I don't think the spaces are aligned correctly." Subscribers: wuzish, nemanjai, hiraditya Differential Revision: https://reviews.llvm.org/D76162	2020-03-27 16:21:53 -04:00
Guillaume Chatelet	74eac9031a	[Alignment][NFC] MachineMemOperand::getAlign/getBaseAlign Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: arsenm, dschuff, sdardis, nemanjai, jvesely, nhaehnle, sbc100, jgravelle-google, hiraditya, aheejin, kbarton, jrtc27, atanasyan, jfb, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76925	2020-03-27 15:49:13 +00:00
Fangrui Song	6728a9ae19	[MCInstPrinter] Add parameter `Address` to printCustomAliasOperand. NFC Follow-up of D72172 and llvmorg-11-init-6896-gb3cc5dcef0f.	2020-03-27 00:38:20 -07:00
Fangrui Song	b3cc5dcef0	[MCInstPrinter] Add parameter `Address` to MCInstPrinter::printAliasInstr. NFC Follow-up of D72172.	2020-03-27 00:03:32 -07:00
diggerlin	fdfe411e7c	[AIX] discard the label in the csect of function description and use qualname for linkage SUMMARY: SUMMARY for a source file "test.c" void foo() {}; llc will generate assembly code as (assembly patch) .globl foo .globl .foo .csect foo[DS] foo: .long .foo .long TOC[TC0] .long 0 and symbol table as (xcoff object file) [4] m 0x00000004 .data 1 unamex foo [5] a4 0x0000000c 0 0 SD DS 0 0 [6] m 0x00000004 .data 1 extern foo [7] a4 0x00000004 0 0 LD DS 0 0 After first patch, the assembly will be as .globl foo[DS] # -- Begin function foo .globl .foo .align 2 .csect foo[DS] .long .foo .long TOC[TC0] .long 0 and symbol table will as [6] m 0x00000004 .data 1 extern foo [7] a4 0x00000004 0 0 DS DS 0 0 Change the code for the assembly path and xcoff objectfile patch for llc. Reviewers: Jason Liu Subscribers: wuzish, nemanjai, hiraditya Differential Revision: https://reviews.llvm.org/D76162	2020-03-26 15:46:52 -04:00
Guillaume Chatelet	b727aabcb8	[Alignment][NFC] Use llvmTargetFrameLowering::getStackAlign Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Reviewed By: courbet Subscribers: wuzish, arsenm, jyknight, nemanjai, jvesely, nhaehnle, hiraditya, kbarton, fedor.sergeev, jrtc27, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76613	2020-03-26 18:15:53 +00:00
Justin Hibbits	459e8e9488	[PowerPC]: Don't allow r0 as a target for LD_GOT_TPREL_L/32 Summary: The linker is free to relax this (relocation R_PPC_GOT_TPREL16) against R_PPC_TLS, if it sees fit (initial exec to local exec). If r0 is used, this can generate execution-invalid code (converts to 'addi %rX, %r0, FOO, which translates in PPC-lingo to li %rX, FOO). Forbid this instead. This fixes static binaries using locales on FreeBSD/powerpc (tested on FreeBSD/powerpcspe). Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D76662	2020-03-26 10:59:28 -05:00
Fangrui Song	3eef47407b	[PPCInstPrinter] Change printBranchOperand(calltarget) to print the target address in hexadecimal form ``` // llvm-objdump -d output (before) 0: bl .-4 4: bl .+0 8: bl .+4 // llvm-objdump -d output (after) ; GNU objdump -d 0: bl 0xfffffffc / bl 0xfffffffffffffffc 4: bl 0x4 8: bl 0xc ``` Many Operand's are not annotated as OPERAND_PCREL. They are not affected (e.g. `b .+67108860`). I plan to fix them in future patches. Modified test/tools/llvm-objdump/ELF/PowerPC/branch-offset.s to test address space wraparound for powerpc32 and powerpc64. Reviewed By: sfertile, jhenderson Differential Revision: https://reviews.llvm.org/D76591	2020-03-26 08:32:29 -07:00
Fangrui Song	5fad05e80d	[MCInstPrinter] Pass `Address` parameter to MCOI::OPERAND_PCREL typed operands. NFC Follow-up of D72172 and D72180 This patch passes `uint64_t Address` to print methods of PC-relative operands so that subsequent target specific patches can change `*InstPrinter::print{Operand,PCRelImm,...}` to customize the output. Add MCInstPrinter::PrintBranchImmAsAddress which is set to true by llvm-objdump. ``` // Current llvm-objdump -d output aarch64: 20000: bl #0 ppc: 20000: bl .+4 x86: 20000: callq 0 // Ideal output aarch64: 20000: bl 0x20000 ppc: 20000: bl 0x20004 x86: 20000: callq 0x20005 // GNU objdump -d. The lack of 0x is not ideal because the result cannot be re-assembled aarch64: 20000: bl 20000 ppc: 20000: bl 0x20004 x86: 20000: callq 20005 ``` In `lib/Target/X86/X86GenAsmWriter1.inc` (generated by `llvm-tblgen -gen-asm-writer`): ``` case 12: // CALL64pcrel32, CALLpcrel16, CALLpcrel32, EH_SjLj_Setup, JCXZ, JECXZ, J... - printPCRelImm(MI, 0, O); + printPCRelImm(MI, Address, 0, O); return; ``` Some targets have 2 `printOperand` overloads, one without `Address` and one with `Address`. They should annotate derived `Operand` properly with `let OperandType = "OPERAND_PCREL"`. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D76574	2020-03-26 08:21:15 -07:00
Kang Zhang	4673699a47	[PowerPC] Remove the repeated definition for some InstAlias for mtspr/mfspr Summary: Below InstAlias have been redefined, this patch is to remove the repeated definition. mtdec/mfdec mtsdr1/mfsdr1 mtsrr0/mfsrr0 mtsrr1/mfsrr1 mtasr Reviewed By: nemanjai, steven.zhang Differential Revision: https://reviews.llvm.org/D75821	2020-03-26 09:58:30 +00:00
QingShan Zhang	1ef7bf4121	[PowerPC] Improve the way legalize mul for v8i16 and add pattern to match mul + add We can legalize the operation MUL for v8i16 with instruction (vmladduhm A, B, 0) if altivec enabled. Now, it is set as custom and expand it later, which is not the right way. And then, we can add the pattern to match the mul + add with (vmladduhm A, B, C) Reviewed By: Nemanjai Differential Revision: https://reviews.llvm.org/D76751	2020-03-26 04:46:49 +00:00
Sean Fertile	3282d875d6	[PowerPC][AIX] ByVal formal arguments in a single register. Adds support for passing ByVal formal arguments as long as they fit in a single register. Differential Revision: https://reviews.llvm.org/D76401	2020-03-25 11:09:40 -04:00
Chen Zheng	9d07d91fb6	[PowerPC] fix a typo in commit `3f85134d71` Implement target hook isProfitableToHoist - typo fix.	2020-03-24 01:56:15 -04:00
Nemanja Ivanovic	bfa9ce1cb2	[PowerPC] Improve handling of some BUILD_VECTOR nodes An analysis of real world code turned up a number of patterns with BUILD_VECTOR of nodes resulting from operations on extracted vector elements for which we produce poor code. This addresses those cases. No attempt is made for completeness as that would entail a large amount of work for something that there is no evidence of in real code. Differential revision: https://reviews.llvm.org/D72660	2020-03-23 17:34:29 -05:00
Justin Hibbits	f0990e104b	[PowerPC]: e500 target can't use lwsync, use msync instead The e500 core has a silicon bug that triggers an illegal instruction program trap on any sync other than msync. Other cores will typically ignore illegal sync types, and the documentation even implies that the 'illegal' bits are ignored. Address this hardware deficiency by only using msync, like the PPC440. Differential Revision: https://reviews.llvm.org/D76614	2020-03-23 17:15:27 -05:00
Guillaume Chatelet	3ba550a05a	[Alignment][NFC] Use TFL::getStackAlign() Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: dylanmckay, sdardis, nemanjai, hiraditya, kbarton, asb, rbar, johnrusso, simoncook, sabuasal, niosHD, jrtc27, MaskRay, zzheng, edward-jones, atanasyan, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX, jocewei, Jim, lenary, s.egerton, pzheng, sameer.abuasal, apazos, luismarques, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76551	2020-03-23 13:48:29 +01:00
Fangrui Song	85c30f3374	[X86] Reland D71360 Clean up UseInitArray initialization for X86ELFTargetObjectFile -fuse-init-array is now the CC1 default but TargetLoweringObjectFileELF::UseInitArray still defaults to false. The following two unknown OS target triples continue using .ctors/.dtors because InitializeELF is not called. clang -target i386 -c a.c clang -target x86_64 -c a.c This cleanup fixes this as a bonus. X86SpeculativeLoadHardeningPass::tracePredStateThroughCall can call MCContext::createTempSymbol before TargetLoweringObjectFileELF::Initialize(). We need to call TargetLoweringObjectFileELF::Initialize() ealier. test/CodeGen/X86/speculative-load-hardening-indirect.ll Differential Revision: https://reviews.llvm.org/D71360	2020-03-20 21:57:34 -07:00
Eric Christopher	fc7233d774	Temporarily Revert "[X86] Reland D71360 Clean up UseInitArray initialization for X86ELFTargetObjectFile" as it's causing msan failures. This reverts commit `7899fe9da8`.	2020-03-20 17:36:12 -07:00
Fangrui Song	7899fe9da8	[X86] Reland D71360 Clean up UseInitArray initialization for X86ELFTargetObjectFile UseInitArray is now the CC1 default but TargetLoweringObjectFileELF::UseInitArray still defaults to false. The following two unknown OS target triples continue using .ctors/.dtors because InitializeELF is not called. clang -target i386 -c a.c clang -target x86_64 -c a.c This cleanup fixes this as a bonus. Differential Revision: https://reviews.llvm.org/D71360	2020-03-20 11:18:36 -07:00
Chen Zheng	3f85134d71	[PowerPC] implement target hook isProfitableToHoist On Powerpc fma is faster than fadd + fmul for some types, (PPCTargetLowering::isFMAFasterThanFMulAndFAdd). we should implement target hook isProfitableToHoist to prevent simplifyCFGpass from breaking fma pattern by hoisting fmul to predecessor block. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D76207	2020-03-19 00:17:25 -04:00
Chen Zheng	aacf022cd5	[PowerPC] add IR level isFMAFasterThanFMulAndFAdd - NFC And also refactor legacy MIR level isFMAFasterThanFMulAndFAdd. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D76265	2020-03-18 23:24:40 -04:00
Nemanja Ivanovic	e009fad342	[PowerPC] Remove UB from PPCInstrInfo when handling rotates fed by constants As pointed out in https://bugs.llvm.org/show_bug.cgi?id=45232 this code can end up shifting a 64-bit unsigned value left by 64 bits. Althought this works as expected on some platforms it is definitely UB. This patch removes the UB and adds the associated test case. Fixes: https://bugs.llvm.org/show_bug.cgi?id=45232	2020-03-18 13:40:39 -05:00
Chris Bowler	c21866476e	[PowerPC][AIX] Implement by-val caller arguments in a single register. This is the first of a series of patches that adds caller support for by-value arguments. This patch add support for arguments that are passed in a single GPR. There are 3 limitation cases: -The by-value argument is larger than a single register. -There are no remaining GPRs even though the by-value argument would otherwise fit in a single GPR. -The by-value argument requires alignment greater than register width. Future patches will be required to add support for these cases as well as for the callee handling (in LowerFormalArguments_AIX) that corresponds to this work. Differential Revision: https://reviews.llvm.org/D75863	2020-03-18 10:57:28 -04:00
Guillaume Chatelet	d000655a8c	[Alignment][NFC] Deprecate getMaxAlignment Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: jholewinski, arsenm, dschuff, jyknight, sdardis, nemanjai, jvesely, nhaehnle, sbc100, jgravelle-google, hiraditya, aheejin, kbarton, fedor.sergeev, asb, rbar, johnrusso, simoncook, sabuasal, niosHD, jrtc27, MaskRay, zzheng, edward-jones, atanasyan, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX, jocewei, Jim, lenary, s.egerton, pzheng, sameer.abuasal, apazos, luismarques, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76348	2020-03-18 14:48:45 +01:00
QingShan Zhang	b83490bdb7	[PowerPC] Fix a typo of the condition of checking the fusion candidate	2020-03-17 10:04:18 +00:00
QingShan Zhang	0b126eec6d	[NFC][PowerPC] Simplify the logic in lower select_cc The logic in select_cc is messy and hard to follow. This is a NFC patch to simplify the logic. Differential Revision: https://reviews.llvm.org/D75834	2020-03-17 03:47:39 +00:00
QingShan Zhang	d0fb34dc09	[PowerPC] Replace the PPCISD:: SExtVElems with ISD::SIGN_EXTEND_INREG to leverage the combine rules The PPCISD::SExtVElems was added by commit https://reviews.llvm.org/D34009. However, we have another ISD node ISD::SIGN_EXTEND_INREG that perfectly match the semantics of SExtVElems. And the DAGCombiner has some combine rules for SIGN_EXTEND_INREG that produce better code. Differential Revision: https://reviews.llvm.org/D70771	2020-03-13 07:28:28 +00:00
Amy Kwan	1ba3d2639d	[PowerPC][NFC] Rename instruction formats in PPCInstrPrefix.td This patch renames some of the instruction formats within PPCInstrPrefix.td to adopt a more uniform naming convention. It also adds the naming convention extension, `_MEM` to indicate instruction formats for memory ops. Differential Revision: https://reviews.llvm.org/D75819	2020-03-13 00:50:08 -05:00
Zarko Todorovski	d688312660	[PowerPC][AIX] Implement formal arguments passed in stack memory. This patch is the callee side counterpart for https://reviews.llvm.org/D73209. It removes the fatal error when we pass more formal arguments than available registers. Differential Revision: https://reviews.llvm.org/D74225	2020-03-12 11:48:00 -04:00
Xiangling Liao	3e53bf5781	[PowerPC32] Fix the `setcc` inconsistent result type problem Summary: On 32-bit PPC target[AIX and BE], when we convert an `i64` to `f32`, a `setcc` operand expansion is needed. The expansion will set the result type of expanded `setcc` operation based on if the subtarget use CRBits or not. If the subtarget does use the CRBits, like AIX and BE, then it will set the result type to `i1`, leading to an inconsistency with original `setcc` result type[i32]. And the reason why it crashed underneath is because we don't set result type of setcc consistent in those two places. This patch fixes this problem by setting original setcc opnode result type also with `getSetCCResultType` interface. Reviewers: sfertile, cebowleratibm, hubert.reinterpretcast, Xiangling_L Reviewed By: sfertile Subscribers: wuzish, nemanjai, hiraditya, kbarton, jsji, shchenz, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75702	2020-03-12 10:50:37 -04:00
Sean Fertile	8b39341fb0	[PowerPC][AIX] Fix printing of program counter for AIX assembly. Program counter on AIX is the dollar-sign. Differential Revision:https://reviews.llvm.org/D75627	2020-03-12 10:37:18 -04:00
Qiu Chaofan	096d545376	[PowerPC] Add strict-fp intrinsic to FP arithmetic This patch adds basic strict-fp intrinsics support to PowerPC backend, including basic arithmetic operations (add/sub/mul/div). Reviewed By: steven.zhang, andrew.w.kaylor Differential Revision: https://reviews.llvm.org/D63916	2020-03-12 17:02:54 +08:00
QingShan Zhang	518292dbdf	[PowerPC] Add the MacroFusion support for Power8 This patch is intend to implement the missing P8 MacroFusion for LLVM according to Power8 User's Manual Section 10.1.12 Instruction Fusion Differential Revision: https://reviews.llvm.org/D70651	2020-03-12 05:15:41 +00:00
Teresa Johnson	8f5e3c74b6	[PowerPC] Fix compile time issue in recursive CTR analysis code Summary: Avoid re-examining operands on recursive walk looking for CTR. This was causing huge compile time after some earlier optimization created a large expression. The start of the expression (created by IndVarSimplify) looked like: %469 = lshr i64 trunc (i128 xor (i128 udiv (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 ptrtoint (i8 @_ZN4absl13hash_internal13CityHashState5kSeedE to i64), i64 120) to i128), i128 8192506886679785011), i128 64), i128 mul (i128 zext (i64 add (i64 ptrtoint (i8 @_ZN4absl13hash_internal13CityHashState5kSeedE to i64), i64 120) to i128), i128 8192506886679785011)) to i64), i64 45) to i128), i128 8192506886679785011), i128 64), i128 mul (i128 zext (i64 add (i64 trunc (i128 xor (i128 lshr (i128 mul (i128 zext (i64 add (i64 ptrtoint (i8 @_ZN4absl13hash_internal13CityHashState5kSeedE to i64), i64 120) to i128), i128 8192506886679785011), i128 64), i128 mul (i128 zext (i64 add (i64 ptrtoint (i8 @_ZN4absl13hash_internal13CityHashState5kSeedE to i64), i64 120) to i128), i128 8192506886679785011)) to i64), i64 45) to i128), ... with the _ZN4absl13hash_internal13CityHashState5kSeedE referenced many times. Reviewers: hfinkel Subscribers: nemanjai, hiraditya, kbarton, jsji, shchenz, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75790	2020-03-11 16:11:14 -07:00
Anna Welker	a6d3bec83f	[TTI][ARM][MVE] Refine gather/scatter cost model Refines the gather/scatter cost model, but also changes the TTI function getIntrinsicInstrCost to accept an additional parameter which is needed for the gather/scatter cost evaluation. This did require trivial changes in some non-ARM backends to adopt the new parameter. Extending gathers and truncating scatters are now priced cheaper. Differential Revision: https://reviews.llvm.org/D75525	2020-03-11 10:23:41 +00:00
Kang Zhang	b0f3d49a05	[NFC][PowerPC] Order the MTSTR/MFSPR InstAlias definetion by SPR Summary: This NFC patch is only modify the position of MTSTR/MFSPR InstAlias definition. So it will be easy to read.	2020-03-08 11:58:53 +00:00
Kang Zhang	0bec7e47d0	Revert "[NFC][PowerPC] Remove the repeated definition for some InstAlias of mtspr/mfspr" This reverts commit `46126a30f2`. Some test cases failed.	2020-03-08 06:32:12 +00:00
Kang Zhang	46126a30f2	[NFC][PowerPC] Remove the repeated definition for some InstAlias of mtspr/mfspr Summary: Below InstAlias have been redeclaration, this patch is to remove them. mtdec/mfdec mtsdr1/mfsdr1 mtsrr0/mfsrr0 mtsrr1/mfsrr1	2020-03-08 06:02:55 +00:00
Xiangling Liao	362456bc53	[AIX] Handle LinkOnceODRLinkage and AppendingLinkage for static init gloabl arrays Handle LinkOnceODRLinkage; Handle AppendingLinkage type for llvm.global_ctors/dtors static init global arrays; Differential Revision: https://reviews.llvm.org/D75305	2020-03-06 09:26:55 -05:00
Fangrui Song	3e851f4a68	[PowerPC] Delete PPCMachObjectWriter and powerpc{,64}-apple-darwin Reviewed By: #powerpc, sfertile Differential Revision: https://reviews.llvm.org/D75494	2020-03-05 11:05:26 -08:00
Chris Bowler	c7b6fa8f4b	[AIX] Extend int arguments to register width when passed in stack memory. This is a follow up to the previous patch: [AIX] Implement caller arguments passed in stack memory. This corrects a defect in AIX 64-bit where an i32 is written to the stack with stw (4 bytes) rather than the expected std (8 bytes.) Integer arguments pass on the stack as images of their register representation. I also took the opportunity to tidy up some of the calling convention AIX tests I added in my last commit. This patch adds the missed assembly expected output for the stack arg int case, which would have caught this problem. Differential Revision: https://reviews.llvm.org/D75126	2020-03-05 11:49:16 -05:00
Joerg Sonnenberger	eb812efa12	Explicitly include <cassert> when using assert Depending on the OS used, a module-enabled build can fail due to the special handling <cassert> gets as textual header.	2020-03-02 22:45:28 +01:00
Kang Zhang	4962a0b26a	[NFC][PowerPC] Move some alias definition from PPCInstrInfo.td to PPCInstr64Bit.td Summary: Some 64-bit instructions alias definition is in PPCInstrInfo.td, it should be moved to PPCInstr64Bit.td.	2020-03-02 09:54:15 +00:00
Fangrui Song	9569a1472e	[PowerPC] Move .got2/.toc logic from PPCLinuxAsmPrinter::doFinalization() to emitEndOfAsmFile() Delete redundant .p2align 2 and improve tests.	2020-02-29 17:12:36 -08:00
Benjamin Kramer	186dd63182	ArrayRef'ize restoreCalleeSavedRegisters. NFCI. restoreCalleeSavedRegisters can mutate the contents of the CalleeSavedInfos, so use a MutableArrayRef.	2020-02-29 09:50:23 +01:00
Xiangling Liao	e7375e9932	[AIX] Remove whitelist checking for ExternalSymbolSDNodes Allow all ExternalSymbolSDNode on AIX, and rely on the linker error to find symbols which we don't have definitions from any library/compiler-rt. Differential Revision: https://reviews.llvm.org/D75075	2020-02-26 10:09:25 -05:00
Kang Zhang	b083d7a346	[PowerPC] Fix the unexpected modification caused by D62993 in LowerSELECT_CC for power9 Summary: The patch D62993 : `[PowerPC] Emit scalar min/max instructions with unsafe fp math` has modified the functionality when `Subtarget.hasP9Vector() && (!HasNoInfs \|\| !HasNoNaNs)`, this modification is not expected. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D74701	2020-02-26 02:59:03 +00:00
Craig Topper	735d27dc40	[SelectionDAG][PowerPC][AArch64][X86][ARM] Add chain input and output the ISD::FLT_ROUNDS_ This node reads the rounding control which means it needs to be ordered properly with operations that change the rounding control. So it needs to be chained to maintain order. This patch adds a chain input and output to the node and connects it to the chain in SelectionDAGBuilder. I've update all in-tree targets to connect their chain through their lowering code. Differential Revision: https://reviews.llvm.org/D75132	2020-02-25 16:58:23 -08:00
jasonliu	bee70bfff0	[XCOFF][AIX] Fix incorrect alignment for function descriptor csect Summary: Function descriptor csect on AIX should be 4 byte align instead of 1 byte align. Reviewer: daltenty Differential Revision: https://reviews.llvm.org/D74974	2020-02-24 18:15:17 +00:00
Sean Fertile	8efc2f5723	[PowerPC][AIX] Spill/restore the callee-saved condition register bits. Extends the existing support for spilling and restoring the condition register to the linkage area for 32-bit targets, and enables for AIX. Differential Revision: https://reviews.llvm.org/D74349	2020-02-24 11:24:46 -05:00
QingShan Zhang	8b3a62dc98	[NFC][PowerPC] Refactor the tryAndWithMask() Split the tryAndWithMask into several small calls. Differential Revision: https://reviews.llvm.org/D72250	2020-02-24 04:02:24 +00:00
Francis Visoiu Mistrih	a32d539798	[Target] Remove libObject dependency in lib/Target This removes a couple useless includes and the dependency of X86Desc on Object, which was useless as well.	2020-02-21 14:52:31 -08:00
Fangrui Song	fad1c750f1	[AArch64][SVE] Fix -DBUILD_SHARED_LIBS=on builds after -D74808/1874dee5662603c9251228c71b66de72cec0c979	2020-02-21 13:59:47 -08:00
Francis Visoiu Mistrih	1874dee566	[macho][NFC] Extract all CPU_(SUB_)TYPE logic to BinaryFormat This moves all the logic of converting LLVM Triples to MachO::CPU_(SUB_)TYPE from the specific target (Target)AsmBackend to more convenient functions in lib/BinaryFormat. This also gets rid of the separate two X86AsmBackend classes. The previous attempt was to add it to libObject, but that adds an unnecessary dependency to libObject from all the targets. Differential Revision: https://reviews.llvm.org/D74808	2020-02-21 12:43:29 -08:00
Sean Fertile	4fdaac0e1e	[PowerPC][NFC] Remove Darwin specific logic in frame finalization. Remove some cumbersome Darwin specific logic for updating the frame offsets of the condition-register spill slots. The containing function has an early return if the subtarget is not ELF based which makes the Darwin logic dead.	2020-02-21 09:32:24 -05:00
Francesco Petrogalli	0c8fa6db90	[llvm][build] Fix shared lib builds. [NFC] The code at https://reviews.llvm.org/D74808 has broken builds that are configured with -DBUILD_SHARED_LIBS=On. This patch adds the correct library dependencies.	2020-02-20 19:42:53 +00:00
Sean Fertile	da181d4ba0	[PowerPC][NFC] Cleanup some of the Darwin mentions in the README.txt.	2020-02-20 13:57:13 -05:00
Francis Visoiu Mistrih	3f785212e9	Revert "[macho][NFC] Extract all CPU_(SUB_)TYPE logic to libObject" This reverts commit `726c342ce2`. This breaks the windows bots with linker errors.	2020-02-20 10:51:25 -08:00
Francis Visoiu Mistrih	726c342ce2	[macho][NFC] Extract all CPU_(SUB_)TYPE logic to libObject This moves all the logic of converting LLVM Triples to MachO::CPU_(SUB_)TYPE from the specific target (Target)AsmBackend to more convenient functions in libObject. This also gets rid of the separate two X86AsmBackend classes. Differential Revision: https://reviews.llvm.org/D74808	2020-02-20 10:28:07 -08:00
Sean Fertile	3126b556d1	[PowerPC][NFC] Add defines to help creating the SpillSlot arrays. Create preprocessor defines for callee saved floating-point register spill slots, vector register spill slots, and both 32-bit and 64-bit general purpose register spill slots. This is an NFC refactor to prepare for adding ABI compliant callee saves and restores for AIX.	2020-02-18 11:52:04 -05:00
QingShan Zhang	113df90388	[PowerPC] Add the missing InstrAliasing for 64-bit rotate instructions We have the InstAlias rules for 32-bit rotate but missing the 64-bit one. Rotate left immediate rotlwi ra,rs,n rlwinm ra,rs,n,0,31 Rotate left rotlw ra,rs,rb rlwnm ra,rs,rb,0,31 Differential Revision: https://reviews.llvm.org/D72676	2020-02-17 05:42:49 +00:00
Zheng Chen	04377a81ae	[Powerpc] set instruction count as lsr first priority of lsr. On Powerpc, set instruction count as lsr first priority of lsr by default. Add an option ppc-lsr-no-insns-cost to return back to default lsr cost model. Reviewed By: steven.zhang, jsji Differential Revision: https://reviews.llvm.org/D72683	2020-02-16 21:04:55 -05:00
Fangrui Song	549b436beb	[MC] De-capitalize MCStreamer::Emit{Bundle,Addrsig}* etc So far, all non-COFF-related Emit* functions have been de-capitalized.	2020-02-15 09:11:48 -08:00
Fangrui Song	774971030d	[MCStreamer] De-capitalize EmitValue EmitIntValue{,InHex}	2020-02-14 23:08:40 -08:00
Fangrui Song	6d2d589b06	[MC] De-capitalize another set of MCStreamer::Emit* functions Emit{ValueTo,Code}Alignment Emit{DTP,TP,GP}* EmitSymbolValue etc	2020-02-14 19:26:52 -08:00
Fangrui Song	a55daa1461	[MC] De-capitalize some MCStreamer::Emit* functions	2020-02-14 19:11:53 -08:00
Fangrui Song	bcd24b2d43	[AsmPrinter][MCStreamer] De-capitalize EmitInstruction and EmitCFI*	2020-02-13 22:08:55 -08:00
Fangrui Song	1d49eb00d9	[AsmPrinter] De-capitalize all AsmPrinter::Emit* but EmitInstruction Similar to rL328848.	2020-02-13 17:06:24 -08:00
Fangrui Song	0bc77a0f0d	[AsmPrinter] De-capitalize some AsmPrinter::Emit* functions Similar to rL328848.	2020-02-13 13:38:33 -08:00
Fangrui Song	0dce409cee	[AsmPrinter] De-capitalize Emit{Function,BasicBlock]* and Emit{Start,End}OfAsmFile	2020-02-13 13:22:49 -08:00
Sean Fertile	b2d1e002ca	[PowerPC][NFC] Small cleanup to restore CR field code in PPCFrameLowering. Skip the loop over the CalleSavedInfos in 'restoreCalleeSavedRegisters' when the register is a CR field and we are not targeting 32-bit ELF. This is safe because: 1) The helper function 'restoreCRs' returns if the target is not 32-bit ELF, making all the code in the loop related to CR fields dead for every other subtarget. This code is only called on ELF right now, but the patch to extend it for AIX also needs to skip 'restoreCRs'. 2) The loop will not otherwise modify the iterator, so the iterator manipulations at the bottom of the loop end up setting 'I' to its current value. This simplifciation allows us to remove one argument from 'restoreCRs'. Also add a helper function to determine if a register is one of the callee saved condition register fields.	2020-02-13 09:50:28 -05:00
Qiu Chaofan	87c773082a	[PowerPC] Exploit VSX rounding instrs for rint Exploit native VSX rounding instruction, x(v\|s)r(d\|s)pic, which does rounding using current rounding mode. According to C standard library, rint may raise INEXACT exception while nearbyint won't. Reviewed By: lkail Differential Revision: https://reviews.llvm.org/D72685	2020-02-13 20:59:50 +08:00
Victor Huang	caa10988be	[PowerPC] Add options for PPC to enable/disable using non-volatile CR An option is added for PowerPC to disable use of non-volatile CR register fields and avoid CR spilling in the prologue. Differential Revision: https://reviews.llvm.org/D69835	2020-02-12 09:23:11 -06:00
Anil Mahmud	ab4d606421	[PowerPC] Add support for intrinsic llvm.ppc.eieio Add support for the intrinsic llvm.ppc.eieio to emit the instruction eieio. Differential Revision: https://reviews.llvm.org/D69066	2020-02-12 09:02:17 -06:00
Anil Mahmud	b413e5c309	[PowerPC] Add support for intrinsics llvm.ppc.dcbfl and llvm.ppc.dcbflp Added support for the intrinsic llvm.ppc.dcbfl and llvm.ppc.dcbflp. These will be used for emitting cache control instructions dcbfl and dcbflp which are actually mnemonics for using dcbf instruction with different immediate arguments. dcbfl ra, rb -> dcbf ra, rb, 1 dcbflp, ra, rb -> dcbf ra, rb, 3 Differential Revision: https://reviews.llvm.org/D68411	2020-02-12 09:02:17 -06:00
Eric Astor	8d5bf0422b	[ms] [llvm-ml] Add support for attempted register parsing Summary: Add a new method (tryParseRegister) that attempts to parse a register specification. MASM allows the use of IFDEF <register>, as well as IFDEF <symbol>. To accommodate this, we make it possible to check whether a register specification can be parsed at the current location, without failing the entire parse if it can't. Reviewers: thakis Reviewed By: thakis Tags: #llvm Differential Revision: https://reviews.llvm.org/D73486	2020-02-11 10:45:33 -05:00
Xiangling Liao	660b0d7f7b	[AIX] Enable frame pointer for AIX and add related test suite This patch: - enable frame pointer for AIX; - update some of red zone comments; - add/update testcases; Differential Revision: https://reviews.llvm.org/D72454	2020-02-10 15:43:41 -05:00
diggerlin	aa86311e62	[AIX][XCOFF] Support Mergeable2ByteCString and Mergeable4ByteCString SUMMARY: The patch is enable to support Mergeable2ByteCString and Mergeable4ByteCString Reviewers: daltenty Subscribers: wuzish, nemanjai, hiraditya Differential Revision: https://reviews.llvm.org/D74164	2020-02-10 14:45:54 -05:00
Craig Topper	eeb63944e4	[LegalizeTypes][ARM][AArch64][PowerPC][RISCV][X86] Use BUILD_PAIR to return expanded integer results from ReplaceNodeResults instead of just returning two results. Remove code from LegalizeTypes that allowed this to work. We were already using BUILD_PAIR for this in some places so this standardizes on a single way to do this.	2020-02-08 09:52:31 -08:00
Benjamin Kramer	e4230a9f6c	ArrayRef'ize spillCalleeSavedRegisters. NFCI.	2020-02-08 12:19:23 +01:00
Nemanja Ivanovic	26bf877ec5	[PowerPC] Fix spilling of vector registers in PEI of EH aware functions On little endian targets prior to Power9, we spill vector registers using a swapping store (i.e. stdxvd2x saves the vector with the two doublewords in big endian order regardless of endianness). This is generally not a problem since we restore them using the corresponding swapping load (lxvd2x). However if the restore is done by the unwinder, the vector register contains data in the incorrect order. This patch fixes that by using Altivec loads/stores for vector saves and restores in PEI (which keep the order correct) under those specific conditions: - EH aware function - Subtarget requires swaps for VSX memops (Little Endian prior to Power9) Differential revision: https://reviews.llvm.org/D73692	2020-02-07 14:41:52 -06:00
Sean Fertile	88073d40c7	[PowerPC] Create a FixedStack object for CR save in linkage area. hasReservedSpillSlot returns a dummy frame index of '0' on PPC64 for the non-volatile condition registers, which leads to the CalleSavedInfo either referencing an unrelated stack object, or an invalid object if there are no stack objects. The latter case causes the mir-printer to crash due to assertions that checks if the frame index referenced by a CalleeSavedInfo is valid. To fix the problem create an immutable FixedStack object at the correct offset in the linkage area of the previous stack frame (ie SP + positive offset). Differential Revision: https://reviews.llvm.org/D73709	2020-02-07 13:33:44 -05:00
Guillaume Chatelet	f85d3408e6	[NFC] Introduce an API for MemOp Summary: This patch introduces an API for MemOp in order to simplify and tighten the client code. Reviewers: courbet Subscribers: arsenm, nemanjai, jvesely, nhaehnle, hiraditya, kbarton, jsji, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73964	2020-02-07 11:32:27 +01:00
Chris Bowler	b373ec8ce7	[AIX] Implement caller arguments passed in stack memory. This patch implements the caller side of placing function call arguments in stack memory. This removes the current limitation where LLVM on AIX will report fatal error when arguments can't be contained in registers. There is a particular oddity that a float argument that passes in a register and also in stack memory requires that the caller initialize both. From what AIX "ABI" documentation I have it's not clear that this needs to be done, however, it is necessary for compatibility with the AIX XL compiler so I think it's best to implement it the same way. Note a later patch will follow to address the callee side. Differential Revision: https://reviews.llvm.org/D73209	2020-02-06 12:07:34 -05:00
Victor Huang	043e478721	[PowerPC][NFC] Clang-format on commit 4b414d	2020-02-05 13:47:54 -06:00
Justin Hibbits	b8dc54cf39	PowerPC: Remove redundancy in ternary for predicate selection rG2c4620ad57b8 inadvertently added redundancies in selection of GT and LE predicates for SPE. Correct this. Partially addresses PR 44768.	2020-02-04 10:38:21 -06:00
Guillaume Chatelet	b8144c0536	[NFC] Encapsulate MemOp logic Summary: This patch simply introduces functions instead of directly accessing the fields. This helps introducing additional check logic. A second patch will add simplifying functions. Reviewers: courbet Subscribers: arsenm, nemanjai, jvesely, nhaehnle, hiraditya, kbarton, jsji, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73945	2020-02-04 10:36:26 +01:00
David Tenty	77e71c5217	[AIX] Don't use a zero fill with a second parameter Summary: The AIX assembler .space directive can't take a second non-zero argument to fill with. But LLVM emitFill currently assumes it can. We add a flag to the AsmInfo to check if non-zero fill is supported, and if we can't zerofill non-zero values we just splat the .byte directives. Reviewers: stevewan, sfertile, DiggerLin, jasonliu, Xiangling_L Reviewed By: jasonliu Subscribers: Xiangling_L, wuzish, nemanjai, hiraditya, kbarton, jsji, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73554	2020-02-03 15:16:08 -05:00
Guillaume Chatelet	333f2ad8b8	[Alignment][NFC] Use Align for getMemcpy/Memmove/Memset Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: arsenm, dschuff, jyknight, sdardis, nemanjai, jvesely, nhaehnle, sbc100, jgravelle-google, hiraditya, aheejin, kbarton, fedor.sergeev, asb, rbar, johnrusso, simoncook, sabuasal, niosHD, jrtc27, MaskRay, zzheng, edward-jones, atanasyan, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX, jocewei, jsji, Jim, lenary, s.egerton, pzheng, sameer.abuasal, apazos, luismarques, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73885	2020-02-03 17:13:19 +01:00
Simon Moll	5c8ba508b2	[NFC] unsigned->Register in storeRegTo/loadRegFromStack Summary: This patch makes progress on the 'unsigned -> Register' rewrite for `TargetInstrInfo::loadRegFromStack` and `TII::storeRegToStack`. Reviewers: arsenm, craig.topper, uweigand, jpienaar, atanasyan, venkatra, robertlytton, dylanmckay, t.p.northover, kparzysz, tstellar, k-ishizaka Reviewed By: arsenm Subscribers: wuzish, merge_guards_bot, jyknight, sdardis, nemanjai, jvesely, wdng, nhaehnle, hiraditya, kbarton, fedor.sergeev, asb, rbar, johnrusso, simoncook, sabuasal, niosHD, jrtc27, MaskRay, zzheng, edward-jones, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX, jocewei, jsji, Jim, lenary, s.egerton, pzheng, sameer.abuasal, apazos, luismarques, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73870	2020-02-03 14:22:16 +01:00
Guillaume Chatelet	3c89b75f23	[NFC] Introduce a type to model memory operation Summary: This is a first step before changing the types to llvm::Align and introduce functions to ease client code. Reviewers: courbet Subscribers: arsenm, sdardis, nemanjai, jvesely, nhaehnle, hiraditya, kbarton, jrtc27, atanasyan, jsji, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73785	2020-01-31 17:29:01 +01:00
jasonliu	3bbe7a681e	[XCOFF][AIX] Support basic relocation type on AIX Summary: This patch intends to support three most common relocation type on AIX: R_POS, R_TOC, R_RBR. These three relocation type will be needed for object file generation on AIX for small code model. We will have follow up patches to bring relocation support for large code model on AIX. Reviewers: hubert.reinterpretcast, daltenty, DiggerLin Differential Revision: https://reviews.llvm.org/D72027	2020-01-30 15:59:09 +00:00
Stefan Pintilie	9de1241bb2	[PowerPC][Future] Branch Distance Estimation For Prefixed Instructions By adding the prefixed instructions the branch distances are no longer computed correctly. Since prefixed instructions cannot cross a 64 byte boundary we have to assume that a prefixed instruction may have a nop prepended to it. This patch tries to take that nop into consideration when computing the size of basic blocks. Differential Revision: https://reviews.llvm.org/D72572	2020-01-30 08:54:33 -06:00
Stefan Pintilie	f00be8da62	[PowerPC][Future] Prefixed Instructions 64 Byte Boundary Support A known limitation for Future CPU is that the new prefixed instructions may not cross 64 Byte boundaries. All instructions are already 4 byte aligned so the only situation where this can occur is when the prefix is in one 64 byte block and the instruction that is prefixed is at the top of the next 64 byte block. To fix this case PPCELFStreamer was added to intercept EmitInstruction. When a prefixed instruction is emitted we try to align it to 64 Bytes by adding a maximum of 4 bytes. If the prefixed instruction crosses the 64 Byte boundary then the alignment would trigger and a 4 byte nop would be added to push the instruction into the next 64 byte block. Differential Revision: https://reviews.llvm.org/D72570	2020-01-30 06:52:30 -06:00
Victor Huang	1492b70a03	[PowerPC][Future] Add prefixed loads and stores for future CPU A previous patch should have added pld and pstd and any support code in the backend that is required for prefixed load and store type operations. This patch adds a number of additional prefixed load and store type instructions for the future CPU. Differential Revision: https://reviews.llvm.org/D72577	2020-01-29 14:45:56 -06:00
Benjamin Kramer	adcd026838	Make llvm::StringRef to std::string conversions explicit. This is how it should've been and brings it more in line with std::string_view. There should be no functional change here. This is mostly mechanical from a custom clang-tidy check, with a lot of manual fixups. It uncovers a lot of minor inefficiencies. This doesn't actually modify StringRef yet, I'll do that in a follow-up.	2020-01-28 23:25:25 +01:00
Victor Huang	4b414d9ade	[PowerPC][Future] Add pld and pstd to future CPU Add the prefixed instructions pld and pstd to future CPU. These are load and store instructions that require new operand types that are 34 bits. This patch adds the two instructions as well as the operand types required. Note that this patch also makes a minor change to tablegen to account for the fact that some instructions are going to require shifts greater than 31 bits for the new 34 bit instructions. Differential Revision: https://reviews.llvm.org/D72574	2020-01-28 08:23:29 -06:00
@justice_adams (Justice Adams)	daee63f974	[SelectionDag] Updated FoldConstantArithmetic method signature in preparation for merge with FoldConstantVectorArithmetic Updated FoldConstantArithmetic method signature to match that of FoldConstantVectorArithmetic in preparation for merging the two functions together https://bugs.llvm.org/show_bug.cgi?id=36544 This is the first step in combining the various FoldConstantVectorArithmetic and FoldConstantVectorArithmetic functions into one FoldConstantArithmetic function. Differential Revision: https://reviews.llvm.org/D72870	2020-01-24 18:00:58 -05:00
Victor Huang	5cee34013c	[PowerPC][Future] Add prefixed instruction paddi to future CPU Future CPU will include support for prefixed instructions. These prefixed instructions are formed by a 4 byte prefix immediately followed by a 4 byte instruction effectively making an 8 byte instruction. The new instruction paddi is a prefixed form of addi. This patch adds paddi and all of the support required for that instruction. The majority of the patch deals with supporting the new prefixed instructions. The addition of paddi is mainly to allow for testing. Differential Revision: https://reviews.llvm.org/D72569	2020-01-24 07:27:25 -06:00
Guillaume Chatelet	805c157e8a	[Alignment][NFC] Deprecate Align::None() Summary: This is a follow up on https://reviews.llvm.org/D71473#inline-647262. There's a caveat here that `Align(1)` relies on the compiler understanding of `Log2_64` implementation to produce good code. One could use `Align()` as a replacement but I believe it is less clear that the alignment is one in that case. Reviewers: xbolva00, courbet, bollu Subscribers: arsenm, dylanmckay, sdardis, nemanjai, jvesely, nhaehnle, hiraditya, kbarton, jrtc27, atanasyan, jsji, Jim, kerbowa, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D73099	2020-01-24 12:53:58 +01:00
Fangrui Song	253379a56f	[PowerPC] Delete IsDarwin from AsmPrinter functions	2020-01-24 00:22:24 -08:00
Fangrui Song	a50567a31c	[PowerPC][MC] Delete PPCMCExpr::IsDarwin	2020-01-23 22:30:08 -08:00
Sean Fertile	9aa816a816	[PowerPC] Collect some CallLowering arguments into a struct. [NFC] Collect the calling convention and a number of boolean arguments into a structure to slightly reduces the number of arguments passed around between LowerCall_<Subtarget>, FinishCall and a few of the helpers. Also calulates if a call is indirect once using the exisitng helper and caches the result replacing several instances where we duplicated the logic determining if a call is indirect.	2020-01-22 16:55:27 -05:00

... 3 4 5 6 7 ...

6272 Commits