llvm-project

Commit Graph

Author	SHA1	Message	Date
zhoujing	198eea9938	[VENTUS][feat] Support varadic function && enable address space in vastart/vaend	2023-08-08 15:45:41 +08:00
zhoujingya	8ba248d102	[VENTUS][RISCV] Add vararg support Because ventus riscv is designed specially for OpenCL language, we originally add or remove some language features mainly for serving OpenCL, but we now need to add customized `printf` function which is expected to be written in C, so we need also to add support for C language features in current ventus Signed-off-by: zhoujingya <jing.zhou@terapines.com>	2023-04-13 15:00:35 +08:00
Aries	9c54c010b2	[clang] Add initial support to Ventus GPGPU calling convention for llvm IR codegen.	2022-12-14 11:31:30 +08:00
Weining Lu	47edc70866	[LoongArch] Specify registers used for exception handling See definition in backend D134709 and the doc [1] for more detail. With the benefit of this change, most libcxx and libcxxabi tests pass. [1]: https://llvm.org/docs/ExceptionHandling.html Reviewed By: xen0n, wangleiat Differential Revision: https://reviews.llvm.org/D139177	2022-12-05 11:42:41 +08:00
Vitaly Buka	9e8787821f	[test][CodeGen] Check noundef for omited return	2022-12-04 19:10:17 -08:00
Vitaly Buka	262d6d495c	[test][CodeGen] Check noundef for return value	2022-12-04 19:10:17 -08:00
Fangrui Song	eecb22d8e1	[SanitizerBinaryMetadata] Use weak __start_/__stop_ instead of dummy empty section D130887 uses a dummy empty section `sanmd_covered` (with the SHF_GNU_RETAIN flag on ELF) to prevent `undefined symbol: __start_sanmd_covered` if all `sanmd_covered` are discarded by `ld --gc-sections` (in `-z start-stop-gc` mode). The dummy `sanmd_covered` does not have the SHF_LINK_ORDER flag, so mixing it with SHF_LINK_ORDER `sanmd_covered` causes an issue to GNU ld<2.36 (https://sourceware.org/bugzilla/show_bug.cgi?id=26256). Similar to D98903 for SanitizerCoverage, let's make encapsulation symbols undefined weak[1]. This additionally avoids size cost due to the dummy section and symbol. [1]: https://maskray.me/blog/2021-01-31-metadata-sections-comdat-and-shf-link-order Reviewed By: melver Differential Revision: https://reviews.llvm.org/D139276	2022-12-04 15:06:34 -08:00
John McIver	ee13633c46	[NFC][clang] Strengthen checks in avx512fp16-builtins.c * Add end-of-line check to load instructions	2022-12-04 14:57:43 +00:00
John McIver	2389488437	[NFC][clang] Strengthen checks in avx512f-builtins.c * Add check to unnamed portion of nontemporal attribute * Add end-of-line check to load instructions	2022-12-04 14:55:41 +00:00
Paul Robinson	64e4d03c68	[lit][AIX] Convert clang tests to use 'target={{.}}-aix{{.}}' Part of the project to eliminate special handling for triples in lit expressions. Differential Revision: https://reviews.llvm.org/D137437	2022-12-02 09:44:15 -08:00
Xiang1 Zhang	94c5df8a76	[AMX] Support AMX-FP16 new intrinsic interface We support AMX-FP16 isa in https://reviews.llvm.org/D135941 now. The old intrinsic interface need to manually write tile registers. So we support its new intrinsic interface to let it be able to do register allocation. Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D138987	2022-12-01 09:47:53 +08:00
gonglingqin	624401612c	[LoongArch] Add remaining intrinsics for CRC check instructions After D137316 implements the intrinsics of the first crc check instruction and related diagnosis, this patch implements the intrinsics of all remaining crc check instructions. Differential Revision: https://reviews.llvm.org/D138418	2022-12-01 09:40:50 +08:00
Paul Robinson	2fbcf8b9b3	[Hexagon] Convert tests to check 'target=hexagon-.*' Part of the project to eliminate special handling for triples in lit expressions.	2022-11-30 13:36:10 -08:00
Henrik G. Olsson	8fa2e93538	[clang] Do not merge traps in functions annotated optnone This aligns the behaviour with that of disabling optimisations for the translation unit entirely. Not merging the traps allows us to keep separate debug information for each, improving the debugging experience when finding the cause for a ubsan trap. Differential Revision: https://reviews.llvm.org/D137714	2022-11-30 15:06:32 +01:00
Bjorn Pettersson	076cda0aaa	[clang][CodeGen] Switch tests to use opt -passes	2022-11-28 12:12:49 +01:00
Ayke van Laethem	131cddcba2	[AVR] Fix broken bitcast for aliases in non-zero address space This was triggered by some code in picolibc. The minimal version looks like this: double infinity(void) { return 5; } extern long double infinityl() __attribute__((__alias__("infinity"))); These two declarations have a different type (not because of the 'long double', which is also 'double' in IR, but because infinityl has variadic parameters). This led to a crash in the bitcast which assumed address space 0. Differential Revision: https://reviews.llvm.org/D138681	2022-11-27 15:27:42 +01:00
Alex Richardson	54ad4d2dd1	Drop redundant pipe to opt -instnamer in clang tests This used to be required, but the difference between asserts/!asserts builds no longer exists for %clang_cc1 (only for %clang), so they pass just fine without this flag.	2022-11-25 11:34:55 +00:00
Sami Tolvanen	5a3d6ce956	[Clang][Driver] Add KCFI to SupportsCoverage Allow `-fsanitize=kcfi` to be enabled with `-fsanitize-coverage=` modes such as `trace-{pc,cmp}`. Link: https://github.com/ClangBuiltLinux/linux/issues/1743 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D138458	2022-11-22 18:20:04 +00:00
KAWASHIMA Takahiro	3a95d7d098	[clang] Fix -fp-model={strict\|precise} to disable -fapprox-func `-fapprox-func` should be disabled by `-fp-model={strict\|precise}`, as well as other fast-math flags. See the last changes in `clang/test/Driver/fp-model.c`. Probably this route (`case options::OPT_ffp_model_EQ`) was forgot to update in D106191 and D114564. There is no appropriate reason not to disable the flag. This commit also updates other regression tests, which are not directly related to this bug, for consistency with other fast-math flags. Differential Revision: https://reviews.llvm.org/D138109	2022-11-22 13:04:26 +09:00
Thomas Lively	ae96b5bd2d	[WebAssembly] Update relaxed-simd instruction names Including builtin and intrinsic names. These should be the final names for the proposal. https://github.com/WebAssembly/relaxed-simd/blob/main/proposals/relaxed-simd/Overview.md Reviewed By: aheejin, maratyszcza Differential Revision: https://reviews.llvm.org/D138249	2022-11-21 12:40:15 -08:00
Nathan Sidwell	eff9d72b9b	[clang] NFC: Robustify sret test regex Replace old-style, brittle, grep with new-fangled FileCheck technology. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D137941	2022-11-21 14:20:47 -05:00
John Brawn	9e3264ab20	[FPEnv] Enable strict fp for AArch64 in clang The AArch64 target now has the necessary support for strict fp, so enable it in clang. Differential Revision: https://reviews.llvm.org/D138143	2022-11-21 16:02:54 +00:00
gonglingqin	c2ec455f18	[LoongArch] Add intrinsics for ibar, break and syscall Diagnostics for intrinsic input parameters have also been added. Differential Revision: https://reviews.llvm.org/D138094	2022-11-21 09:31:26 +08:00
yronglin	80f444646c	[CodeGen][ARM] Fix ARMABIInfo::EmitVAAarg crash with empty record type variadic arg Fix ARMABIInfo::EmitVAAarg crash with empty record type variadic arg Open issue: https://github.com/llvm/llvm-project/issues/58794 Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D138137	2022-11-19 15:14:10 +08:00
Xing Xue	fa7477eb87	[Clang][CodeGen][AIX] Map __builtin_frexpl, __builtin_ldexpl, and __builtin_modfl to 'double' version lib calls in 64-bit 'long double' mode Summary: AIX library functions frexpl(), ldexpl(), and modfl() are for 128-bit IBM long double, i.e. __ibm128. Other *l() functions, e.g., acosl(), are for 64-bit long double. The AIX Clang compiler currently maps builtin functions __builtin_frexpl(), __builtin_ldexpl(), and __builtin_modfl() to frexpl(), ldexpl(), and modfl() in 64-bit long double mode which results in seg-faults or incorrect return values. This patch changes to map __builtin_frexpl(), __builtin_ldexpl(), and __builtin_modfl() to double version lib functions frexp(), ldexp() and modf() in 64-bit long double mode. Reviewed by: hubert.reinterpretcast, daltenty Differential Revision: https://reviews.llvm.org/D137986	2022-11-18 11:36:56 -05:00
Alexander Shaposhnikov	f102fe7304	Revert "Revert "[opt][clang] Enable using -module-summary/-flto=thin with -S/-emit-llvm"" This reverts commit `7f608a2497` and removes the dependency of Object on IRPrinter.	2022-11-18 08:58:31 +00:00
Mikhail Goncharov	7f608a2497	Revert "[opt][clang] Enable using -module-summary/-flto=thin with -S/-emit-llvm" This reverts commit `34ab474348`. as it has introduced circular dependency lib - analysis	2022-11-18 09:25:45 +01:00
Alexander Shaposhnikov	34ab474348	[opt][clang] Enable using -module-summary/-flto=thin with -S/-emit-llvm Enable using -module-summary with -S (similarly to what currently can be achieved with opt <input> -o - \| llvm-dis). This is a recommit of `ef9e62469`. Test plan: ninja check-all Differential revision: https://reviews.llvm.org/D137768	2022-11-18 05:04:07 +00:00
Qiu Chaofan	cab9c02bd9	[Clang] Fix behavior of -ffp-model option when overriden -ffp-model=strict -ffp-model=fast will still enable strict exception handling behavior, therefore clang still emits constrained FP operations in IR. -ffp-model=fast -ffp-model=strict emits two warnings: one for strict overriding fast, the other for strict overriding strict, which is confusing. Reviewed By: zahiraam Differential Revision: https://reviews.llvm.org/D137618	2022-11-18 10:34:41 +08:00
Craig Topper	c9320bc871	[X86] Use correctly sized floating point literals in *zero_ps/pd. This avoids depending on int->float or double->float conversion. Improving codegen with #pragma STDC FENV_ACCESS ON. Really we should improve constant folding somewhere, but this was a cheap and easy improvement. Fixes PR59052.	2022-11-17 14:28:52 -08:00
Roman Lebedev	8adfa29706	[Pipelines] Introduce SROA after (final, run-time) loop unrolling Now that we are done with loop unrolling, be it either by LoopVectorizer, or LoopUnroll passes, some variable-offset GEP's into alloca's could have become constant-offset, thus enabling SROA and alloca promotion, yet we don't capitalize on that, which is surprizing. While it would be good to not introduce one more SROA invocation, but instead move the one from `PassBuilder::buildFunctionSimplificationPipeline()`, the existing test coverage says that is a bad idea, though it would be fine compile-time wise: https://llvm-compile-time-tracker.com/compare.php?from=b150d34c47efbd8fa09604bce805c0920360f8d7&to=5a9a5c855158b482552be8c7af3e73d67fa44805&stat=instructions So instead, i add yet another SROA run. I have checked, and it needs to be at least after said final loop unrolling. This is still fine compile-time wise: https://llvm-compile-time-tracker.com/compare.php?from=70324cd88328c0924e605fa81b696572560aa5c9&to=fb489bbef687ad821c3173a931709f9cad9aee8a&stat=instructions I've encountered this in a real code, `SROA-after-final-loop-unrolling.ll` has been reduced from https://godbolt.org/z/fsdMhETh3 Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D136806	2022-11-17 21:31:30 +03:00
Alex Brachet	0dff945bbc	Fix debug-info test	2022-11-17 16:02:54 +00:00
Ben Shi	84ef723573	[clang] Fix wrong ABI of AVRTiny. A scalar which exceeds 4 bytes should be returned via a stack slot, on an AVRTiny device. Reviewed By: aykevl Differential Revision: https://reviews.llvm.org/D138125	2022-11-17 08:38:44 +08:00
gonglingqin	ddbb21bdb5	[LoongArch] Add immediate operand validity check for __builtin_loongarch_dbar Differential Revision: https://reviews.llvm.org/D137809	2022-11-16 14:47:45 +08:00
Michele Scandale	b7d7c448df	Fix `unsafe-fp-math` attribute emission. The conditions for which Clang emits the `unsafe-fp-math` function attribute has been modified as part of `84a9ec2ff1ee97fd7e8ed988f5e7b197aab84a7`. In the backend code generators `"unsafe-fp-math"="true"` enable floating point contraction for the whole function. The intent of the change in `84a9ec2ff1ee97fd7e8ed988f5e7b197aab84a7` was to prevent backend code generators performing contractions when that is not expected. However the change is inaccurate and incomplete because it allows `unsafe-fp-math` to be set also when only in-statement contraction is allowed. Consider the following example ``` float foo(float a, float b, float c) { float tmp = a * b; return tmp + c; } ``` and compile it with the command line ``` clang -fno-math-errno -funsafe-math-optimizations -ffp-contract=on \ -O2 -mavx512f -S -o - ``` The resulting assembly has a `vfmadd213ss` instruction which corresponds to a fused multiply-add. From the user perspective there shouldn't be any contraction because the multiplication and the addition are not in the same statement. The optimized IR is: ``` define float @test(float noundef %a, float noundef %b, float noundef %c) #0 { %mul = fmul reassoc nsz arcp afn float %b, %a %add = fadd reassoc nsz arcp afn float %mul, %c ret float %add } attributes #0 = { [...] "no-signed-zeros-fp-math"="true" "no-trapping-math"="true" [...] "unsafe-fp-math"="true" } ``` The `"unsafe-fp-math"="true"` function attribute allows the backend code generator to perform `(fadd (fmul a, b), c) -> (fmadd a, b, c)`. In the current IR representation there is no way to determine the statement boundaries from the original source code. Because of this for in-statement only contraction the generated IR doesn't have instructions with the `contract` fast-math flag and `llvm.fmuladd` is being used to represent contractions opportunities that occur within a single statement. Therefore `"unsafe-fp-math"="true"` can only be emitted when contraction across statements is allowed. Moreover the change in `84a9ec2ff1ee97fd7e8ed988f5e7b197aab84a7` doesn't take into account that the floating point math function attributes can be refined during IR code generation of a function to handle the cases where the floating point math options are modified within a compound statement via pragmas (see `CGFPOptionsRAII`). For consistency `unsafe-fp-math` needs to be disabled if the contraction mode for any scope/operation is not `fast`. Similarly for consistency reason the initialization of `UnsafeFPMath` of in `TargetOptions` for the backend code generation should take into account the contraction mode as well. Reviewed By: zahiraam Differential Revision: https://reviews.llvm.org/D136786	2022-11-14 20:40:57 -08:00
Roman Lebedev	b2fbafc911	[NFC][Clang] Autogenerate checklines in a test being affected by a patch	2022-11-15 03:51:24 +03:00
Fangrui Song	77bf0df376	Revert "[opt][clang] Enable using -module-summary/-flto=thin with -S/-emit-llvm" This reverts commit `bf8381a8bc`. There is a layering violation: LLVMAnalysis depends on LLVMCore, so LLVMCore should not include LLVMAnalysis header llvm/Analysis/ModuleSummaryAnalysis.h	2022-11-14 15:51:03 -08:00
Alexander Shaposhnikov	bf8381a8bc	[opt][clang] Enable using -module-summary/-flto=thin with -S/-emit-llvm Enable using -module-summary with -S (similarly to what currently can be achieved with opt <input> -o - \| llvm-dis). This is a recommit of `ef9e62469`. Test plan: ninja check-all Differential revision: https://reviews.llvm.org/D137768	2022-11-14 23:24:08 +00:00
Alexander Shaposhnikov	8c15c17e3b	Revert "[opt][clang] Enable using -module-summary/-flto=thin with -S/-emit-llvm" This reverts commit `ef9e624694` for further investigation offline. It appears to break the buildbot llvm-clang-x86_64-sie-ubuntu-fast.	2022-11-14 21:31:30 +00:00
Alexander Shaposhnikov	ef9e624694	[opt][clang] Enable using -module-summary/-flto=thin with -S/-emit-llvm Enable using -module-summary with -S (similarly to what currently can be achieved with opt <input> -o - \| llvm-dis). Test plan: ninja check-all Differential revision: https://reviews.llvm.org/D137768	2022-11-14 21:11:07 +00:00
Joshua Batista	a5d14f757b	Add builtin_elementwise_sin and builtin_elementwise_cos Add codegen for llvm cos and sin elementwise builtins The sin and cos elementwise builtins are necessary for HLSL codegen. Tests were added to make sure that the expected errors are encountered when these functions are given inputs of incompatible types. The new builtins are restricted to floating point types only. Reviewed By: craig.topper, fhahn Differential Revision: https://reviews.llvm.org/D135011	2022-11-10 23:30:27 -08:00
gonglingqin	da34aff90d	[Clang][LoongArch] Implement __builtin_loongarch_crc_w_d_w builtin and add diagnostics This patch adds support to prevent __builtin_loongarch_crc_w_d_w from compiling on loongarch32 in the front end and adds diagnostics accordingly. Reference: https://github.com/gcc-mirror/gcc/blob/master/gcc/config/loongarch/larchintrin.h#L175-L184 Depends on D136906 Differential Revision: https://reviews.llvm.org/D137316	2022-11-11 09:16:57 +08:00
gonglingqin	85f08c4197	[Clang][LoongArch] Implement __builtin_loongarch_dbar builtin Differential Revision: https://reviews.llvm.org/D136906	2022-11-10 17:27:44 +08:00
Matt Jacobson	dd9f7963e4	[ObjC] avoid crashing when emitting synthesized getter/setter and ptrdiff_t is smaller than long On targets where ptrdiff_t is smaller than long, clang crashes when emitting synthesized getters/setters that call objc_[gs]etProperty. Explicitly emit a zext/trunc of the ivar offset value (which is defined to long) to ptrdiff_t, which objc_[gs]etProperty takes. Add a test using the AVR target, where ptrdiff_t is smaller than long. Test failed previously and passes now. Differential Revision: https://reviews.llvm.org/D112049	2022-11-10 02:10:30 -05:00
OCHyams	4b6b2b1a42	Reapply: [Assignment Tracking][7/*] Add assignment tracking functionality to clang Reverted in `98fa95492f`. The Assignment Tracking debug-info feature is outlined in this RFC: https://discourse.llvm.org/t/ rfc-assignment-tracking-a-better-way-of-specifying-variable-locations-in-ir This patch plumbs the AssignmentTrackingPass (AKA declare-to-assign), added in the previous patch in this set, into the optimisation pipeline from clang. clang/test/CodeGen/assignment-tracking/assignment-tracking.cpp is the main test for this patch. Note: while clang (with the help of the declare-to-assign pass) can now emit Assignment Tracking metadata, the llvm middle and back ends don't yet understand it. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D132226	2022-11-09 09:28:41 +00:00
Freddy Ye	84a18a260e	[X86] Support -march=sierraforest, grandridge, graniterapids. Reviewed By: skan, pengfei, MaskRay Differential Revision: https://reviews.llvm.org/D137153	2022-11-09 16:56:03 +08:00
David Green	f0e6c403c2	[AArch64] Allow users-facing feature names in clang target attributes D133848 added support for the GCC format of target("..") attributes. The supported formats to match gcc are: // "arch=<arch>" - parsed to features as per -march=.. // "cpu=<cpu>" - parsed to features as per -mcpu=.., with CPU set to <cpu> // "tune=<cpu>" - TuneCPU set to <cpu> // "+feature", "+nofeature" - Add (or remove) feature. We also support the existing formats, previously accepted by clang, for compatibility with the existing code and intrinsics code: // "feature", "no-feature" - Add (or remove) feature. The clang formats would accept and use internal feature names ("fullfp16"/"neon"/"sve") as opposed to the user facing names ("fp16"/"simd"/"sve"). Usually they use the same names, but can be different for cases like fp, fullfp16 and mte (among others). This patch makes the clang format also except the user facing names, by parsing the features through getArchExtFeature. There is a fallback if the name is not recognized (like "fullfp16"), where we add the existing string which should then be checked later for consistency. This allows the internal names to be used as before, so long as they are recognized as internal names. (Note that we currently don't have an implementation of isValidFeatureName. The backend will currently give an error like "'-sid' is not a recognized feature for this target (ignoring feature)." This should be improved in a later patch once an implementation of isValidFeatureName in clang is present). Differential Revision: https://reviews.llvm.org/D137617	2022-11-08 19:30:26 +00:00
OCHyams	98fa95492f	Revert "[Assignment Tracking][7/*] Add assignment tracking functionality to clang" This reverts commit `28f9636edd`. Bot failure: https://lab.llvm.org/buildbot/#/builders/109/builds/50251	2022-11-08 18:43:05 +00:00
OCHyams	28f9636edd	[Assignment Tracking][7/*] Add assignment tracking functionality to clang The Assignment Tracking debug-info feature is outlined in this RFC: https://discourse.llvm.org/t/ rfc-assignment-tracking-a-better-way-of-specifying-variable-locations-in-ir This patch plumbs the AssignmentTrackingPass (AKA declare-to-assign), added in the previous patch in this set, into the optimisation pipeline from clang. clang/test/CodeGen/assignment-tracking/assignment-tracking.cpp is the main test for this patch. Note: while clang (with the help of the declare-to-assign pass) can now emit Assignment Tracking metadata, the llvm middle and back ends don't yet understand it. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D132226	2022-11-08 17:49:08 +00:00
Bjorn Pettersson	5f9a82683d	[clang][test] Use opt -passes=<name> instead of opt -name Updated the RUN line in several test cases to use the new PM syntax opt -passes=<pipeline> instead of the deprecated syntax opt -pass1 -pass2 This was not a complete cleanup in clang/test. But just a swipe using some simple search-and-replace. Mainly for RUN lines involving -mem2reg, -instnamer and -early-cse.	2022-11-08 12:15:42 +01:00

1 2 3 4 5 ...

7878 Commits