llvm-project/clang/lib/CodeGen
Yaxun (Sam) Liu 1d97cb1f6e [HIP] Emit amdgpu_code_object_version module flag
code object version determines ABI, therefore should not be mixed.

This patch emits amdgpu_code_object_version module flag in LLVM IR
based on code object version (default 4).

The amdgpu_code_object_version value is code object version times 100.

LLVM IR with different amdgpu_code_object_version module flag cannot
be linked.

The -cc1 option -mcode-object-version=none is for ROCm device library use
only, which supports multiple ABI.

Reviewed by: Artem Belevich

Differential Revision: https://reviews.llvm.org/D119026
2022-02-08 21:58:40 -05:00
..
ABIInfo.h Introduce _BitInt, deprecate _ExtInt 2021-12-06 12:52:01 -05:00
Address.h [clang] Use proper type to left shift after D117262 2022-01-26 17:54:37 -08:00
BackendUtil.cpp [BitcodeWriter] Fix cases of some functions 2022-01-31 16:46:11 -08:00
CGAtomic.cpp [clang] Remove some getPointerElementType() uses 2022-01-25 12:09:06 +01:00
CGBlocks.cpp [CodeGen] Treat ObjC `__unsafe_unretained` and class types as trivial 2022-01-11 11:18:24 -08:00
CGBlocks.h [CodeGen] Treat ObjC `__unsafe_unretained` and class types as trivial 2022-01-11 11:18:24 -08:00
CGBuilder.h [clang] Fix some clang->llvm type cache invalidation issues 2022-02-07 18:59:09 -08:00
CGBuiltin.cpp [X86] Remove __builtin_ia32_padd/psub saturated intrinsics and use generic __builtin_elementwise_add/sub_sat 2022-02-08 15:00:10 +00:00
CGCUDANV.cpp [HIP] Fix -fgpu-rdc for Windows 2021-12-06 16:42:23 -05:00
CGCUDARuntime.cpp
CGCUDARuntime.h [HIP] Emit kernel symbol 2021-03-01 16:31:40 -05:00
CGCXX.cpp [OpaquePtr] Remove uses of CGF.Builder.CreateConstInBoundsGEP1_64() without type 2021-07-17 17:07:46 +02:00
CGCXXABI.cpp [MS] Implement on-demand TLS initialization for Microsoft CXX ABI 2022-01-13 21:23:23 -08:00
CGCXXABI.h [MS] Implement on-demand TLS initialization for Microsoft CXX ABI 2022-01-13 21:23:23 -08:00
CGCall.cpp [X86] Implement -fzero-call-used-regs option 2022-02-08 17:42:54 -08:00
CGCall.h [clang] Remove some getPointerElementType() uses 2022-01-25 12:09:06 +01:00
CGClass.cpp [NFC][Clang][OpaquePtr] Move away from deprecated Address constructor in EmitCXXMemberDataPointerAddress() 2022-01-27 14:44:53 -08:00
CGCleanup.cpp Skip exception cleanups when the innermost scope is EHTerminateScope. 2022-02-02 17:50:18 -05:00
CGCleanup.h [clang] Remove unused forward declarations (NFC) 2022-01-08 11:56:40 -08:00
CGCoroutine.cpp [Coroutines] Set presplit attribute in Clang and mlir 2022-01-05 10:25:02 +08:00
CGDebugInfo.cpp [ADT] Add an in-place version of toHex() 2022-01-11 11:51:04 +01:00
CGDebugInfo.h [clang] Remove unused forward declarations (NFC) 2022-01-08 11:56:40 -08:00
CGDecl.cpp [CodeGen] Make element type in emitArrayDestroy() predictable 2022-01-11 09:25:29 +01:00
CGDeclCXX.cpp [CodeGen] Make element type in emitArrayDestroy() predictable 2022-01-11 09:25:29 +01:00
CGException.cpp [CodeGen] Avoid some pointer element type accesses 2021-12-21 14:16:28 +01:00
CGExpr.cpp [MatrixBuilder] Remove unnecessary IRBuilder template (NFC) 2022-02-07 16:42:38 +01:00
CGExprAgg.cpp [NFC][Clang][OpaquePtr] Move away from deprecated Address constructor in VisitArrayInitLoopExpr 2022-01-27 14:44:53 -08:00
CGExprCXX.cpp [NFC][Clang][OpaquePtr] Move away from deprecated Address constructor in EmitNewArrayInitializer() 2022-01-27 14:00:16 -08:00
CGExprComplex.cpp [Matrix] Implement C-style explicit type conversions for matrix types. 2021-04-10 11:48:41 +01:00
CGExprConstant.cpp Remove reference type when checking const structs 2022-01-28 13:08:58 -08:00
CGExprScalar.cpp [MatrixBuilder] Remove unnecessary IRBuilder template (NFC) 2022-02-07 16:42:38 +01:00
CGGPUBuiltin.cpp [OpenMP] Lower printf to __llvm_omp_vprintf 2021-11-10 15:30:56 +00:00
CGLoopInfo.cpp [Clang] Ensure vector predication loop metadata is always emitted when pragma is specified. 2021-02-13 17:35:54 -06:00
CGLoopInfo.h [SVE] Add support to vectorize_width loop pragma for scalable vectors 2021-01-08 11:37:27 +00:00
CGNonTrivialStruct.cpp [CodeGen] Prefer CreateElementBitCast() where possible 2021-12-15 11:48:39 +01:00
CGObjC.cpp Revert "Rename llvm::array_lengthof into llvm::size to match std::size from C++17" 2022-01-26 16:55:53 +01:00
CGObjCGNU.cpp [NFC] Remove uses of PointerType::getElementType() 2022-01-25 09:44:52 +01:00
CGObjCMac.cpp [ObjC] Emit selector load right before msgSend call. 2022-01-26 20:52:54 -08:00
CGObjCRuntime.cpp [clang] Add range accessor for ObjCAtTryStmt catch_stmts and use it 2021-10-27 08:57:05 -04:00
CGObjCRuntime.h Fix a variety of bugs with nil-receiver checks when targeting 2021-10-08 05:44:06 -04:00
CGOpenCLRuntime.cpp [clang] Cache OpenCL types 2022-02-07 09:23:04 -08:00
CGOpenCLRuntime.h [clang] Cache OpenCL types 2022-02-07 09:23:04 -08:00
CGOpenMPRuntime.cpp Enable inoutset dependency-type in depend clause. 2022-02-08 08:35:36 -05:00
CGOpenMPRuntime.h [clang] Remove unused forward declarations (NFC) 2022-01-08 11:56:40 -08:00
CGOpenMPRuntimeGPU.cpp [OpenMP] Completely remove old device runtime 2022-02-04 15:31:33 -05:00
CGOpenMPRuntimeGPU.h [OpenMP][FIX] Pass the num_threads value directly to parallel_51 2021-12-09 16:30:29 -05:00
CGRecordLayout.h [clang] Remove redundant member initialization (NFC) 2022-01-02 10:20:23 -08:00
CGRecordLayoutBuilder.cpp [clang] Fix bugprone argument comments (NFC) 2022-01-09 00:19:49 -08:00
CGStmt.cpp OMPIRBuilder for Interop directive 2022-01-27 14:53:18 -05:00
CGStmtOpenMP.cpp [Clang][Sema][OpenMP] Sema support for `atomic compare` 2022-02-04 12:30:56 -05:00
CGVTT.cpp [AMDGPU] Set the default globals address space to 1 2020-11-20 15:46:53 +00:00
CGVTables.cpp [clang] Use true/false instead of 1/0 (NFC) 2022-01-09 00:19:47 -08:00
CGVTables.h
CGValue.h [CodeGen] Store element type in RValue 2021-12-17 09:05:59 +01:00
CMakeLists.txt [clang][openmp][NFC] Remove arch-specific CGOpenMPRuntimeGPU files 2021-11-09 15:11:05 -05:00
CodeGenABITypes.cpp
CodeGenAction.cpp [OpenMP] Add a flag for embedding a file into the module 2022-01-31 15:56:00 -05:00
CodeGenFunction.cpp [X86] Implement -fzero-call-used-regs option 2022-02-08 17:42:54 -08:00
CodeGenFunction.h OMPIRBuilder for Interop directive 2022-01-27 14:53:18 -05:00
CodeGenModule.cpp [HIP] Emit amdgpu_code_object_version module flag 2022-02-08 21:58:40 -05:00
CodeGenModule.h [clang][CodeGen][NFC] Remove unused CodeGenModule fields 2022-01-31 23:45:53 +02:00
CodeGenPGO.cpp [clang] Remove redundant member initialization (NFC) 2022-01-02 10:20:23 -08:00
CodeGenPGO.h [PGO] Don't reference functions unless value profiling is enabled 2021-05-20 11:09:24 -07:00
CodeGenTBAA.cpp Introduce _BitInt, deprecate _ExtInt 2021-12-06 12:52:01 -05:00
CodeGenTBAA.h [clang] Remove unused forward declarations (NFC) 2022-01-08 11:56:40 -08:00
CodeGenTypeCache.h Fix __attribute__((annotate("")) with non-zero globals AS 2021-08-26 10:09:40 +01:00
CodeGenTypes.cpp [clang] Properly cache member pointer LLVM types 2022-02-08 13:22:24 -08:00
CodeGenTypes.h [clang] Properly cache member pointer LLVM types 2022-02-08 13:22:24 -08:00
ConstantEmitter.h
ConstantInitBuilder.cpp
CoverageMappingGen.cpp [clang] Use isa instead of dyn_cast (NFC) 2021-11-14 09:32:40 -08:00
CoverageMappingGen.h [Driver] Rename -fprofile-{prefix-map,compilation-dir} to -fcoverage-{prefix-map,compilation-dir} 2021-02-25 21:40:12 -08:00
EHScopeStack.h [Windows SEH]: HARDWARE EXCEPTION HANDLING (MSVC -EHa) - Part 1 2021-05-17 22:42:17 -07:00
ItaniumCXXABI.cpp [NFC] Remove uses of PointerType::getElementType() 2022-01-25 09:44:52 +01:00
MacroPPCallbacks.cpp
MacroPPCallbacks.h [clang] Remove unused forward declarations (NFC) 2022-01-08 11:56:40 -08:00
MicrosoftCXXABI.cpp [MS] Implement on-demand TLS initialization for Microsoft CXX ABI 2022-01-13 21:23:23 -08:00
ModuleBuilder.cpp [clang][macho] add clang frontend support for emitting macho files with two build version load commands 2022-02-02 08:30:39 -08:00
ObjectFilePCHContainerOperations.cpp Emit the C++ dialect in -gmodules .pcm files. 2022-01-10 16:13:40 -08:00
PatternInit.cpp
PatternInit.h
README.txt Revert "This is a test commit" 2020-12-23 13:04:37 -06:00
SanitizerMetadata.cpp Revert "[asan] Add support for disable_sanitizer_instrumentation attribute" 2021-12-10 14:33:38 -08:00
SanitizerMetadata.h
SwiftCallingConv.cpp Teach the swift calling convention about _Atomic types 2020-08-31 07:07:25 -07:00
TargetInfo.cpp [clang][CodeGen] Use memory type representation in `va_arg` 2022-02-04 12:10:57 +01:00
TargetInfo.h [clang] Remove unused forward declarations (NFC) 2022-01-08 11:56:40 -08:00
VarBypassDetector.cpp [clang,NFC] Fix typos in file headers 2021-02-25 12:47:02 -08:00
VarBypassDetector.h Use {DenseSet,SmallPtrSet}::contains (NFC) 2021-10-29 20:26:07 -07:00

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//