llvm-project

History

Peter Smith e63455d5e0 [MC] Use local MCSubtargetInfo in writeNops On some architectures such as Arm and X86 the encoding for a nop may change depending on the subtarget in operation at the time of encoding. This change replaces the per module MCSubtargetInfo retained by the targets AsmBackend in favour of passing through the local MCSubtargetInfo in operation at the time. On Arm using the architectural NOP instruction can have a performance benefit on some implementations. For Arm I've deleted the copy of the AsmBackend's MCSubtargetInfo to limit the chances of this causing problems in the future. I've not done this for other targets such as X86 as there is more frequent use of the MCSubtargetInfo and it looks to be for stable properties that we would not expect to vary per function. This change required threading STI through MCNopsFragment and MCBoundaryAlignFragment. I've attempted to take into account the in tree experimental backends. Differential Revision: https://reviews.llvm.org/D45962		2021-09-07 15:46:19 +01:00
..
AsmParser	[llvm] Rename StringRef _lower() method calls to _insensitive()	2021-06-25 00:22:01 +03:00
Disassembler	[SPARC] recognize the "rd %pc, reg" special form	2021-05-23 22:52:59 +02:00
MCTargetDesc	[MC] Use local MCSubtargetInfo in writeNops	2021-09-07 15:46:19 +01:00
TargetInfo	…
CMakeLists.txt	…
DelaySlotFiller.cpp	…
LeonFeatures.td	…
LeonPasses.cpp	[llvm] Rename StringRef _lower() method calls to _insensitive()	2021-06-25 00:22:01 +03:00
LeonPasses.h	…
README.txt	…
Sparc.h	…
Sparc.td	…
SparcAsmPrinter.cpp	[SPARC] Recognize and handle the %lm(sym) operator	2021-02-08 19:25:33 -05:00
SparcCallingConv.td	…
SparcFrameLowering.cpp	[NFC][CodeGen] Tidy up TargetRegisterInfo stack realignment functions	2021-03-30 17:31:39 +01:00
SparcFrameLowering.h	…
SparcISelDAGToDAG.cpp	…
SparcISelLowering.cpp	Rename MachineMemOperand::getOrdering -> getSuccessOrdering.	2021-06-21 16:49:27 -07:00
SparcISelLowering.h	[TargetLowering] move "o" and "X" constraint handling to base class	2021-04-19 10:53:31 -07:00
SparcInstr64Bit.td	…
SparcInstrAliases.td	[SPARC] recognize the "rd %pc, reg" special form	2021-05-23 22:52:59 +02:00
SparcInstrFormats.td	[Sparc] Fix multiclass template parameter types. NFC.	2021-02-06 15:33:09 +00:00
SparcInstrInfo.cpp	…
SparcInstrInfo.h	…
SparcInstrInfo.td	[SPARC] recognize the "rd %pc, reg" special form	2021-05-23 22:52:59 +02:00
SparcInstrVIS.td	…
SparcMCInstLower.cpp	…
SparcMachineFunctionInfo.cpp	…
SparcMachineFunctionInfo.h	…
SparcRegisterInfo.cpp	…
SparcRegisterInfo.h	…
SparcRegisterInfo.td	[SPARC] recognize the "rd %pc, reg" special form	2021-05-23 22:52:59 +02:00
SparcSchedule.td	…
SparcSubtarget.cpp	…
SparcSubtarget.h	…
SparcTargetMachine.cpp	Normalize interaction with boolean attributes	2021-04-17 08:17:33 +02:00
SparcTargetMachine.h	…
SparcTargetObjectFile.cpp	…
SparcTargetObjectFile.h	…

README.txt

To-do
-----

* Keep the address of the constant pool in a register instead of forming its
  address all of the time.
* We can fold small constant offsets into the %hi/%lo references to constant
  pool addresses as well.
* When in V9 mode, register allocate %icc[0-3].
* Add support for isel'ing UMUL_LOHI instead of marking it as Expand.
* Emit the 'Branch on Integer Register with Prediction' instructions.  It's
  not clear how to write a pattern for this though:

float %t1(int %a, int* %p) {
        %C = seteq int %a, 0
        br bool %C, label %T, label %F
T:
        store int 123, int* %p
        br label %F
F:
        ret float undef
}

codegens to this:

t1:
        save -96, %o6, %o6
1)      subcc %i0, 0, %l0
1)      bne .LBBt1_2    ! F
        nop
.LBBt1_1:       ! T
        or %g0, 123, %l0
        st %l0, [%i1]
.LBBt1_2:       ! F
        restore %g0, %g0, %g0
        retl
        nop

1) should be replaced with a brz in V9 mode.

* Same as above, but emit conditional move on register zero (p192) in V9
  mode.  Testcase:

int %t1(int %a, int %b) {
        %C = seteq int %a, 0
        %D = select bool %C, int %a, int %b
        ret int %D
}

* Emit MULX/[SU]DIVX instructions in V9 mode instead of fiddling
  with the Y register, if they are faster.

* Codegen bswap(load)/store(bswap) -> load/store ASI

* Implement frame pointer elimination, e.g. eliminate save/restore for
  leaf fns.
* Fill delay slots

* Use %g0 directly to materialize 0. No instruction is required.