llvm-project

Commit Graph

Author	SHA1	Message	Date
Thomas Lively	fb67f3d969	[WebAssembly] Add prototype relaxed float to int trunc instructions Add i32x4.relaxed_trunc_f32x4_s, i32x4.relaxed_trunc_f32x4_u, i32x4.relaxed_trunc_f64x2_s_zero, i32x4.relaxed_trunc_f64x2_u_zero. These are only exposed as builtins, and require user opt-in. Differential Revision: https://reviews.llvm.org/D112186	2021-10-28 14:01:53 -07:00
Mike Rice	6f9c25167d	[OpenMP] Initial parsing/sema for the 'omp loop' construct Adds basic parsing/sema/serialization support for the #pragma omp loop directive. Differential Revision: https://reviews.llvm.org/D112499	2021-10-28 08:26:43 -07:00
Kai Luo	6ea2431d3f	[clang][compiler-rt][atomics] Add `__c11_atomic_fetch_nand` builtin and support `__atomic_fetch_nand` libcall Add `__c11_atomic_fetch_nand` builtin to language extensions and support `__atomic_fetch_nand` libcall in compiler-rt. Reviewed By: theraven Differential Revision: https://reviews.llvm.org/D112400	2021-10-28 02:18:43 +00:00
Florian Hahn	01870d51b8	[Clang] Add elementwise abs builtin. This patch implements __builtin_elementwise_abs as specified in D111529. Reviewed By: aaron.ballman, scanon Differential Revision: https://reviews.llvm.org/D111986	2021-10-27 21:01:44 +01:00
Nico Weber	c7aaa2efef	[clang] Add range accessor for ObjCAtTryStmt catch_stmts and use it No behavior change. Differential Revision: https://reviews.llvm.org/D112543	2021-10-27 08:57:05 -04:00
Shraiysh Vaishay	9fb52cb3f1	[MLIR][OpenMP] Added omp.atomic.read and omp.atomic.write This patch supports the atomic construct (read and write) following section 2.17.7 of OpenMP 5.0 standard. Also added tests and verifier for the same. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D111992	2021-10-27 14:05:44 +05:30
Uday Bondhugula	9fb9c6b91e	[Clang][NFC] Clang CUDA codegen clean-up Update an instance of dyn_cast -> cast and other NFC clang-tidy fixes for Clang CUDA codegen. Differential Revision: https://reviews.llvm.org/D112284	2021-10-27 11:07:20 +05:30
Florian Hahn	1ef25d28c1	[Clang] Add elementwise min/max builtins. This patch implements __builtin_elementwise_max and __builtin_elementwise_min, as specified in D111529. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D111985	2021-10-26 16:53:40 +01:00
Mike Rice	d8699391a4	[OPENMP51]Initial parsing/sema for append_args clause for 'declare variant' Adds initial parsing and sema for the 'append_args' clause. Note that an AST clause is not created as it instead adds its values to the OMPDeclareVariantAttr. Differential Revision: https://reviews.llvm.org/D111854	2021-10-25 09:38:50 -07:00
Kazu Hirata	16ceb44e62	[clang] Use llvm::{count,count_if,find_if,all_of,none_of} (NFC)	2021-10-25 09:14:45 -07:00
Kazu Hirata	4bd46501c3	Use llvm::any_of and llvm::none_of (NFC)	2021-10-24 17:35:33 -07:00
Duncan P. N. Exon Smith	2410fb4616	Support: Use Expected<T>::moveInto() in a few places These are some usage examples for `Expected<T>::moveInto()`. Differential Revision: https://reviews.llvm.org/D112280	2021-10-22 12:40:10 -07:00
Kazu Hirata	dccfaddc6b	[clang] Use StringRef::contains (NFC)	2021-10-21 08:58:19 -07:00
Yonghong Song	f6811cec84	[DebugInfo] Support typedef with btf_decl_tag attributes Clang patch ([1]) added support for btf_decl_tag attributes with typedef types. This patch added llvm support including dwarf generation. For example, for typedef typedef unsigned * __u __attribute__((btf_decl_tag("tag1"))); __u u; the following shows llvm-dwarfdump result: 0x00000033: DW_TAG_typedef DW_AT_type (0x00000048 "unsigned int *") DW_AT_name ("__u") DW_AT_decl_file ("/home/yhs/work/tests/llvm/btf_tag/t.c") DW_AT_decl_line (1) 0x0000003e: DW_TAG_LLVM_annotation DW_AT_name ("btf_decl_tag") DW_AT_const_value ("tag1") 0x00000047: NULL [1] https://reviews.llvm.org/D110127 Differential Revision: https://reviews.llvm.org/D110129	2021-10-21 08:42:58 -07:00
Aaron Ballman	aad244dfc5	Revert "AddGlobalAnnotations for function with or without function body." This reverts commit `121b2252de`. The following code causes a crash in some circumstances: struct k { ~k() __attribute__((annotate(""))) {} }; void m() { k(); }	2021-10-21 07:08:18 -04:00
Itay Bookstein	08ed216000	[IR] Refactor GlobalIFunc to inherit from GlobalObject, Remove GlobalIndirectSymbol As discussed in: * https://reviews.llvm.org/D94166 * https://lists.llvm.org/pipermail/llvm-dev/2020-September/145031.html The GlobalIndirectSymbol class lost most of its meaning in https://reviews.llvm.org/D109792, which disambiguated getBaseObject (now getAliaseeObject) between GlobalIFunc and everything else. In addition, as long as GlobalIFunc is not a GlobalObject and getAliaseeObject returns GlobalObjects, a GlobalAlias whose aliasee is a GlobalIFunc cannot currently be modeled properly. Creating aliases for GlobalIFuncs does happen in the wild (e.g. glibc). In addition, calling getAliaseeObject on a GlobalIFunc will currently return nullptr, which is undesirable because it should return the object itself for non-aliases. This patch refactors the GlobalIFunc class to inherit directly from GlobalObject, and removes GlobalIndirectSymbol (while inlining the relevant parts into GlobalAlias and GlobalIFunc). This allows for calling getAliaseeObject() on a GlobalIFunc to return the GlobalIFunc itself, making getAliaseeObject() more consistent and enabling alias-to-ifunc to be properly modeled in the IR. I exercised some judgement in the API clients of GlobalIndirectSymbol: some were 'monomorphized' for GlobalAlias and GlobalIFunc, and some remained shared (with the type adapted to become GlobalValue). Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D108872	2021-10-20 10:29:47 -07:00
Zhi An Ng	e1fb13401e	[WebAssembly] Add prototype relaxed float min max instructions Add relaxed. f32x4.min, f32x4.max, f64x2.min, f64x2.max. These are only exposed as builtins, and require user opt-in. Differential Revision: https://reviews.llvm.org/D112146	2021-10-20 09:41:51 -07:00
Arthur Eubanks	063c2f89aa	[clang] Add option to disable -clear-ast-before-backend Some downstream users have plugins that -clear-ast-before-backend may affect. Add an option to opt out. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D112100	2021-10-19 20:51:48 -07:00
Zhi An Ng	2542bfa43a	[WebAssembly] Add prototype relaxed swizzle instructions Add i8x16 relaxed_swizzle instructions. These are only exposed as builtins, and require user opt-in. Differential Revision: https://reviews.llvm.org/D112022	2021-10-19 17:53:04 -07:00
Yuta Saito	1813fde9cc	[WebAssembly] Emit clangast in custom section aligned by 4 bytes Emit __clangast in custom section instead of named data segment to find it while iterating sections. This could be avoided if all data segements (the wasm sense) were represented as their own sections (in the llvm sense). This can be resolved by https://github.com/WebAssembly/tool-conventions/issues/138 And the on-disk hashtable in clangast needs to be aligned by 4 bytes, so add paddings in name length field in custom section header. The length of clangast section name can be represented in 1 byte by leb128, and possible maximum pads are 3 bytes, so the section name length won't be invalid in theory. Fixes https://bugs.llvm.org/show_bug.cgi?id=35928 Differential Revision: https://reviews.llvm.org/D74531	2021-10-19 15:50:08 -07:00
Noah Shutty	e678c51177	[Support][ThinLTO] Move ThinLTO caching to LLVM Support library We would like to move ThinLTO’s battle-tested file caching mechanism to the LLVM Support library so that we can use it elsewhere in LLVM. Patch By: noajshu Differential Revision: https://reviews.llvm.org/D111371	2021-10-18 18:57:25 -07:00
Petr Hosek	8e46e34d24	Revert "[Support][ThinLTO] Move ThinLTO caching to LLVM Support library" This reverts commit `92b8cc52bb` since it broke the gold plugin.	2021-10-18 12:24:05 -07:00
Noah Shutty	92b8cc52bb	[Support][ThinLTO] Move ThinLTO caching to LLVM Support library We would like to move ThinLTO’s battle-tested file caching mechanism to the LLVM Support library so that we can use it elsewhere in LLVM. Patch By: noajshu Differential Revision: https://reviews.llvm.org/D111371	2021-10-18 12:08:49 -07:00
Juneyoung Lee	f193bcc701	Revert D105169 due to the two-stage failure in ASAN This reverts the following commits: `37ca7a795b` `9aa6c72b92` `705387c507` `8ca4b3ef19` `80dba72a66`	2021-10-18 23:52:46 +09:00
Kazu Hirata	d245f2e859	[clang] Use llvm::erase_if (NFC)	2021-10-17 13:50:29 -07:00
Juneyoung Lee	80dba72a66	[Clang/Test]: Rename enable_noundef_analysis to disable-noundef-analysis and turn it off by default Turning on `enable_noundef_analysis` flag allows better codegen by removing freeze instructions. I modified clang by renaming `enable_noundef_analysis` flag to `disable-noundef-analysis` and turning it off by default. Test updates are made as a separate patch: D108453 Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D105169	2021-10-16 12:01:37 +09:00
Zhi An Ng	da07942834	[WebAssembly] Add prototype relaxed laneselect instructions Add i8x16, i16x8, i32x4, i64x2 laneselect instructions. These are only exposed as builtins, and require user opt-in.	2021-10-15 17:45:09 -07:00
Kazu Hirata	6a154e606e	[clang] Use llvm::is_contained (NFC)	2021-10-15 10:07:08 -07:00
Richard Smith	effbf0bdd0	PR52183: Don't emit code for a void-typed constant expression. This is unnecessary in general, and wrong when the expression invokes a consteval function.	2021-10-14 20:55:51 -07:00
Arthur Eubanks	d0a5f61c4f	[clang] Support -clear-ast-before-backend without -disable-free Previously without -disable-free, -clear-ast-before-backend would crash in ~ASTContext() due to various reasons. This works around that by doing a lot of the cleanup ahead of the destructor so that the destructor doesn't actually do any manual cleanup if we've already cleaned up beforehand. This actually does save a measurable amount of memory with -clear-ast-before-backend, although at an almost unnoticeable runtime cost: https://llvm-compile-time-tracker.com/compare.php?from=5d755b32f2775b9219f6d6e2feda5e1417dc993b&to=58ef1c7ad7e2ad45f9c97597905a8cf05a26258c&stat=max-rss Previously we weren't doing any cleanup with -disable-free, so I tried measuring the impact of always doing the cleanup and didn't measure anything noticeable on llvm-compile-time-tracker. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D111767	2021-10-14 13:43:53 -07:00
Aaron Ballman	68157fe15b	Fix a crash on valid consteval code. Not all constants are emitted within the context of a function, so use the module's ASTContext instead because 1) that's the same as the current function ASTContext, and 2) the module can never be null. Fixes PR50787.	2021-10-14 15:48:10 -04:00
Mike Rice	fb4c451001	[OPENMP51]Initial parsing/sema for adjust_args clause for 'declare variant' Adds initial parsing and sema for the 'adjust_args' clause. Note that an AST clause is not created as it instead adds its expressions to the OMPDeclareVariantAttr. Differential Revision: https://reviews.llvm.org/D99905	2021-10-13 09:34:09 -07:00
Hsiangkai Wang	5158cfef8b	[RISCV] After reverting _mt builtins, add `ta` argument for LLVM IR. Previous patch only reverts C builtins for tail policy. In order to keep LLVM IR intact, add the `ta` argument in vector builtins.	2021-10-13 19:41:49 +08:00
Hsiangkai Wang	ff3ed78304	Revert "[RISCV] Define _m intrinsics as builtins, instead of macros." This reverts commit `97f0c63783`. As discussed in https://reviews.llvm.org/D110684, it increased the compile time and the binary size of clang more than 1%. I reverted this patch first to think about a better way to do it.	2021-10-13 12:21:51 +08:00
Arthur Eubanks	b6a8c69554	[NFC] Rename EmitAssemblyHelper new/legacy PM methods To reflect the fact that the new PM is the default now. Differential Revision: https://reviews.llvm.org/D111680	2021-10-12 15:41:44 -07:00
Arthur Eubanks	2cadef6537	[clang] Teardown new PM data structures before running codegen pipeline Do this by refactoring the optimization and codegen pipelines into separate functions. This saves a tiny bit of memory in non-LTO builds [1]. [1] https://llvm-compile-time-tracker.com/compare.php?from=fbddf22ef72d3c2e9b14e1501841b03380eef12b&to=cd276df52eb6f2b84a8e1efe5318460c6debf82d&stat=max-rss Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D111582	2021-10-12 14:17:11 -07:00
Kazu Hirata	57b40b5f34	[AST, CodeGen, Driver] Use llvm::is_contained (NFC)	2021-10-12 09:19:49 -07:00
Nathan Sidwell	dcd74716f9	[clang] p0388 conversion to incomplete array This implements the new implicit conversion sequence to an incomplete (unbounded) array type. It is mostly Richard Smith's work, updated to trunk, testcases added and a few bugs fixed found in such testing. It is not a complete implementation of p0388. Differential Revision: https://reviews.llvm.org/D102645	2021-10-12 07:35:20 -07:00
Yonghong Song	a162b67c98	[Clang][Attr] rename btf_tag to btf_decl_tag Current btf_tag is applied to declaration only. Per discussion in https://reviews.llvm.org/D111199, we plan to introduce btf_type_tag attribute for types. So rename btf_tag to btf_decl_tag to make it easily differentiable from btf_type_tag. Differential Revision: https://reviews.llvm.org/D111588	2021-10-11 22:17:17 -07:00
hsmahesha	db9c2d7751	[CFE][Codegen] Remove CodeGenFunction::InitTempAlloca() Sequel patch to https://reviews.llvm.org/D111316 Finally, remove the defintion of CodeGenFunction::InitTempAlloca(). Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D111324	2021-10-12 10:04:15 +05:30
hsmahesha	f7de6962c8	[CFE][Codegen][In-progress] Remove CodeGenFunction::InitTempAlloca() Sequel patch to https://reviews.llvm.org/D111293. Remove call to CodeGenFunction::InitTempAlloca() from OpenMP related codegen part. Also remove the metadata `!llvm.access.group` from the updated lit tests. Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D111316	2021-10-12 10:01:46 +05:30
Hsiangkai Wang	97f0c63783	[RISCV] Define _m intrinsics as builtins, instead of macros. In the original design, we levarage _mt intrinsics to define macros for _m intrinsics. Such as, ``` __builtin_rvv_vadd_vv_i8m1_mt((vbool8_t)(op0), (vint8m1_t)(op1), (vint8m1_t)(op2), (vint8m1_t)(op3), (size_t)(op4), (size_t)VE_TAIL_AGNOSTIC) ``` However, we could not define generic interface for mask intrinsics any more due to clang_builtin_alias only accepts clang builtins as its argument. In the example, ``` __rvv_overloaded __attribute__((clang_builtin_alias(__builtin_rvv_vadd_vv_i8m1_mt))) vint8m1_t vadd(vbool8_t op0, vint8m1_t op1, vint8m1_t op2, vint8m1_t op3, size_t op4, size_t op5); ``` op5 is the tail policy argument. When users want to use vadd generic interface for masked vector add, they need to specify tail policy in the previous design. In this patch, we define _m intrinsics as clang builtins to solve the problem. Differential Revision: https://reviews.llvm.org/D110684	2021-10-12 10:47:55 +08:00
Chris Bieneman	121b2252de	AddGlobalAnnotations for function with or without function body. When AnnotateAttr is on a function, AddGlobalAnnotations is only called in CodeGenModule::EmitGlobalFunctionDefinition which means AnnotateAttr on function declaration without function body will be ignored. The patch will move AddGlobalAnnotations to CodeGenModule::SetFunctionAttributes, so with or without function body, the AnnotateAttr will get code gen for a function. It'll help case when AnnotateAttr is on external function, and the AnnotateAttr will be consumed in IR level. For example, a pass to collect num of uses for functions with __attribute((annotate("count_use"))) after optimizations, As long as there's __attribute((annotate("count_use"))), function with or without function body should be counted. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D111109 Patch by: python3kgae (Xiang Li)	2021-10-11 14:50:34 -05:00
hsmahesha	0481682996	[CFE][Codegen][In-progress] Remove CodeGenFunction::InitTempAlloca() CodeGenFunction::InitTempAlloca() inits the static alloca within the entry block which may not necessarily be correct always. For example, the current instruction insertion point (pointed by the instruction builder) could be a program point which is hit multiple times during the program execution, and it is expected that the static alloca is initialized every time the program point is hit. Hence remove CodeGenFunction::InitTempAlloca(), and initialize the static alloca where the instruction insertion point is at the moment. This patch, as a starting attempt, removes the calls to CodeGenFunction::InitTempAlloca() which do not have any side effect on the lit tests. Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D111293	2021-10-09 09:23:14 +05:30
Richard Smith	7eae8c6e62	Don't update the vptr at the start of the destructor of a final class. In this case, we know statically that we're destroying the most-derived class, so the vptr must already point to the current class and never needs to be updated.	2021-10-08 19:59:42 -07:00
Qiu Chaofan	8a714722e2	[NFC] [Clang] Use global enum for explicit float mode Currently, there're multiple float types that can be represented by __attribute__((mode(xx))). It's parsed, and then a corresponding type is created if available. This refactor moves the enum for mode into a global enum class visible to ASTContext. Reviewed By: aaron.ballman, erichkeane Differential Revision: https://reviews.llvm.org/D111391	2021-10-09 10:39:10 +08:00
Joseph Huber	bad44d5f39	[OpenMP] Add RTL function for getting number of threads in block. This patch adds support for the `__kmpc_get_hardware_num_threads_in_block` function that returns the number of threads. This was missing in the new runtime and was used by the AMDGPU plugin which prevented it from using the new runtime. This patchs also unified the interface for getting the thread numbers in the frontend. Originally authored by jdoerfert. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D111475	2021-10-08 22:21:59 -04:00
Richard Smith	222305d6ff	PR51079: Treat thread_local variables with an incomplete class type as being not trivially destructible when determining if we can skip calling their thread wrapper function.	2021-10-08 18:46:01 -07:00
Reid Kleckner	89b57061f7	Move TargetRegistry.(h\|cpp) from Support to MC This moves the registry higher in the LLVM library dependency stack. Every client of the target registry needs to link against MC anyway to actually use the target, so we might as well move this out of Support. This allows us to ensure that Support doesn't have includes from MC/*. Differential Revision: https://reviews.llvm.org/D111454	2021-10-08 14:51:48 -07:00
Arthur Eubanks	a6891d2104	[clang] Set max allowed alignment to 2^32 Followup to D110451 which set LLVM's max allowed alignment to 2^32. Reviewed By: hans Differential Revision: https://reviews.llvm.org/D111250	2021-10-08 11:44:15 -07:00
Masoud Ataei	b0f68791f0	[clang] Option control afn flag Clang option to set/unset afn fast-math flag. Differential: https://reviews.llvm.org/D106191 Reviewd with: aaron.ballman, erichkeane, and others	2021-10-08 14:26:14 -04:00
Keith Smiley	68e49aea9a	Revert "[clang] Fix absolute file paths with -fdebug-prefix-map" This reverts commit `a23a596793`. This broke a windows test https://buildkite.com/llvm-project/premerge-checks/builds/59492#7dad207c-6cbe-40ad-95e4-c48b47fe2527 Differential Revision: https://reviews.llvm.org/D111444	2021-10-08 10:39:44 -07:00
Keith Smiley	a23a596793	[clang] Fix absolute file paths with -fdebug-prefix-map Previously if you passed an absolute path to clang, where only part of the path to the file was remapped, it would result in the file's DIFile being stored with a duplicate path, for example: ``` !DIFile(filename: "./ios/Sources/bar.c", directory: "./ios/Sources") ``` This change handles absolute paths, specifically in the case they are remapped to something relative, and uses the dirname for the directory, and basename for the filename. This also adds a test verifying this behavior for more standard uses as well. Differential Revision: https://reviews.llvm.org/D111352	2021-10-08 10:35:17 -07:00
John McCall	5ab6ee7599	Fix a variety of bugs with nil-receiver checks when targeting non-Darwin ObjC runtimes: - Use the same logic the Darwin runtime does for inferring that a receiver is non-null and therefore doesn't require null checks. Previously we weren't skipping these for non-super dispatch. - Emit a null check when there's a consumed parameter so that we can destroy the argument if the call doesn't happen. This mostly involves extracting some common logic from the Darwin-runtime code. - Generate a zero aggregate by zeroing the same memory that was used in the method call instead of zeroing separate memory and then merging them with a phi. This uses less memory and avoids unnecessary copies. - Emit zero initialization, and generate zero values in phis, using the proper zero-value routines instead of assuming that the zero value of the result type has a bitwise-zero representation.	2021-10-08 05:44:06 -04:00
Wang, Pengfei	c0f9c7c015	[X86] Check if struct is blank before getting the inner types This fixes pr52011. Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D111037	2021-10-08 17:09:34 +08:00
Joseph Huber	9efdca87c7	[OpenMP] Introduce new flags to assert thread and team usage in the runtime This patch adds two flags to be supported for the new runtime. The flags are `-fopenmp-assume-threads-oversubscription` and -fopenmp-assume-teams-oversubscription`. These add global values that can be checked by the work sharing runtime functions to make better judgements about how to distribute work between the threads. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D111348	2021-10-07 22:23:09 -04:00
Itay Bookstein	40ec1c0f16	[IR][NFC] Rename getBaseObject to getAliaseeObject To better reflect the meaning of the now-disambiguated {GlobalValue, GlobalAlias}::getBaseObject after breaking off GlobalIFunc::getResolverFunction (D109792), the function is renamed to getAliaseeObject.	2021-10-06 19:33:10 -07:00
David Blaikie	f6a561c4d6	DebugInfo: Use clang's preferred names for integer types This reverts `c7f16ab3e3` / r109694 - which suggested this was done to improve consistency with the gdb test suite. Possible that at the time GCC did not canonicalize integer types, and so matching types was important for cross-compiler validity, or that it was only a case of over-constrained test cases that printed out/tested the exact names of integer types. In any case neither issue seems to exist today based on my limited testing - both gdb and lldb canonicalize integer types (in a way that happens to match Clang's preferred naming, incidentally) and so never print the original text name produced in the DWARF by GCC or Clang. This canonicalization appears to be in `integer_types_same_name_p` for GDB and in `TypeSystemClang::GetBasicTypeEnumeration` for lldb. (I tested this with one translation unit defining 3 variables - `long`, `long ()()`, and `int ()()`, and another translation unit that had main, and a function that took `long ()()` as a parameter - then compiled them with mismatched compilers (either GCC+Clang, or Clang+(Clang with this patch applied)) and no matter the combination, despite the debug info for one CU naming the type "long int" and the other naming it "long", both debuggers printed out the name as "long" and were able to correctly perform overload resolution and pass the `long int ()()` variable to the `long (*)()` function parameter) Did find one hiccup, identified by the lldb test suite - that CodeView was relying on these names to map them to builtin types in that format. So added some handling for that in LLVM. (these could be split out into separate patches, but seems small enough to not warrant it - will do that if there ends up needing any reverti/revisiting) Differential Revision: https://reviews.llvm.org/D110455	2021-10-06 16:02:34 -07:00
Jennifer Yu	a4743eba3c	Fix assert of "Unable to find base lambda address" from adjustMemberOfForLambdaCaptures. The problem is happening when user passes lambda function with reference type in the map clause. The natural of the problem when processing generateInfoForCapture, the BasePointer is generated with new load for a lambda variable with reference type. It is not expected in adjustMemberOfForLambdaCaptures. One way to fix this is to skipping call to generateInfoForCapture for map(to:lambda). The map info will be generated later in the call to generateDefaultMapInfo samiler as firsprivate clase. This to fix https://bugs.llvm.org/show_bug.cgi?id=52071 Differential Revision:https://reviews.llvm.org/D111115	2021-10-06 14:14:28 -07:00
Arthur Eubanks	6522b7cc32	[clang] Add option to clear AST memory before running LLVM passes This is to save memory for Clang compiles. Measuring building PassBuilder.cpp under /usr/bin/time, max rss goes from 0.93GB to 0.7GB. This does not turn it by default yet. I've turned on the option locally and run it over a good amount of files without any issues. For more background, see https://lists.llvm.org/pipermail/cfe-dev/2021-September/068930.html. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D111105	2021-10-06 13:42:22 -07:00
Arthur Eubanks	05392466f0	Reland [IR] Increase max alignment to 4GB Currently the max alignment representable is 1GB, see D108661. Setting the align of an object to 4GB is desirable in some cases to make sure the lower 32 bits are clear which can be used for some optimizations, e.g. https://crbug.com/1016945. This uses an extra bit in instructions that carry an alignment. We can store 15 bits of "free" information, and with this change some instructions (e.g. AtomicCmpXchgInst) use 14 bits. We can increase the max alignment representable above 4GB (up to 2^62) since we're only using 33 of the 64 values, but I've just limited it to 4GB for now. The one place we have to update the bitcode format is for the alloca instruction. It stores its alignment into 5 bits of a 32 bit bitfield. I've added another field which is 8 bits and should be future proof for a while. For backward compatibility, we check if the old field has a value and use that, otherwise use the new field. Updating clang's max allowed alignment will come in a future patch. Reviewed By: hans Differential Revision: https://reviews.llvm.org/D110451	2021-10-06 13:29:23 -07:00
Arthur Eubanks	569346f274	Revert "Reland [IR] Increase max alignment to 4GB" This reverts commit `8d64314ffe`.	2021-10-06 11:38:11 -07:00
Arthur Eubanks	8d64314ffe	Reland [IR] Increase max alignment to 4GB Currently the max alignment representable is 1GB, see D108661. Setting the align of an object to 4GB is desirable in some cases to make sure the lower 32 bits are clear which can be used for some optimizations, e.g. https://crbug.com/1016945. This uses an extra bit in instructions that carry an alignment. We can store 15 bits of "free" information, and with this change some instructions (e.g. AtomicCmpXchgInst) use 14 bits. We can increase the max alignment representable above 4GB (up to 2^62) since we're only using 33 of the 64 values, but I've just limited it to 4GB for now. The one place we have to update the bitcode format is for the alloca instruction. It stores its alignment into 5 bits of a 32 bit bitfield. I've added another field which is 8 bits and should be future proof for a while. For backward compatibility, we check if the old field has a value and use that, otherwise use the new field. Updating clang's max allowed alignment will come in a future patch. Reviewed By: hans Differential Revision: https://reviews.llvm.org/D110451	2021-10-06 11:03:51 -07:00
Arthur Eubanks	72cf8b6044	Revert "[IR] Increase max alignment to 4GB" This reverts commit `df84c1fe78`. Breaks some bots	2021-10-06 10:21:35 -07:00
Arthur Eubanks	df84c1fe78	[IR] Increase max alignment to 4GB Currently the max alignment representable is 1GB, see D108661. Setting the align of an object to 4GB is desirable in some cases to make sure the lower 32 bits are clear which can be used for some optimizations, e.g. https://crbug.com/1016945. This uses an extra bit in instructions that carry an alignment. We can store 15 bits of "free" information, and with this change some instructions (e.g. AtomicCmpXchgInst) use 14 bits. We can increase the max alignment representable above 4GB (up to 2^62) since we're only using 33 of the 64 values, but I've just limited it to 4GB for now. The one place we have to update the bitcode format is for the alloca instruction. It stores its alignment into 5 bits of a 32 bit bitfield. I've added another field which is 8 bits and should be future proof for a while. For backward compatibility, we check if the old field has a value and use that, otherwise use the new field. Updating clang's max allowed alignment will come in a future patch. Reviewed By: hans Differential Revision: https://reviews.llvm.org/D110451	2021-10-06 09:54:14 -07:00
Michael Kruse	f37e8b0b83	[Clang][OpenMP] Infix OMPLoopTransformationDirective abstract class. NFC. Insert OMPLoopTransformationDirective between OMPLoopBasedDirective and the loop transformations OMPTileDirective and OMPUnrollDirective. This simplifies handling of loop transformations not requiring distinguishing between OMPTileDirective and OMPUnrollDirective anymore. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D111119	2021-10-06 10:49:07 -05:00
Simon Pilgrim	b9b90bb542	[clang] Replace report_fatal_error(std::string) uses with report_fatal_error(Twine) As described on D111049, we're trying to remove the <string> dependency from error handling and replace uses of report_fatal_error(const std::string&) with the Twine() variant which can be forward declared.	2021-10-06 11:43:19 +01:00
Corentin Jabot	424733c12a	Implement if consteval (P1938) Modify the IfStmt node to suppoort constant evaluated expressions. Add a new ExpressionEvaluationContext::ImmediateFunctionContext to keep track of immediate function contexts. This proved easier/better/probably more efficient than walking the AST backward as it allows diagnosing nested if consteval statements.	2021-10-05 08:04:14 -04:00
Arthur Eubanks	2568286892	[clang] Don't use the AST to display backend diagnostics We keep a map from function name to source location so we don't have to do it via looking up a source location from the AST. However, since function names can be long, we actually use a hash of the function name as the key. Additionally, we can't rely on Clang's printing of function names via the AST, so we just demangle the name instead. This is necessary to implement https://lists.llvm.org/pipermail/cfe-dev/2021-September/068930.html. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D110665	2021-10-04 14:14:32 -07:00
serge-sans-paille	0f0e31cf51	Update inline builtin handling to honor gnu inline attribute Per the GCC info page: If the function is declared 'extern', then this definition of the function is used only for inlining. In no case is the function compiled as a standalone function, not even if you take its address explicitly. Such an address becomes an external reference, as if you had only declared the function, and had not defined it. Respect that behavior for inline builtins: keep the original definition, and generate a copy of the declaration suffixed by '.inline' that's only referenced in direct call. This fixes holes in `c3717b6858`. Differential Revision: https://reviews.llvm.org/D111009	2021-10-04 22:26:25 +02:00
Alexey Bataev	bfc8f9e9b0	[clang] Fix computation of number of dependencies using OpenMP iterator, by Raul Penacoba. The size of kmp_depend_info and the number of dependencies are computed multiplying the iterator sizes, which not right. Now size is computed as: itersize1numclausedeps1 + itersize2numclausedeps2 + ... + itersizeN*numclausedepsN where itersizeX is the size of the iterator and numclausedepsX the number of dependencies in that depend clause. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D111045	2021-10-04 07:06:51 -07:00
Stefan Pintilie	4fc2f4979c	[PowerPC] Fix __builtin_ppc_load2r to return short instead of int. This patch fixes the return value of the builtin __builtin_ppc_load2r to correctly return short instead of int. Reviewed By: nemanjai, #powerpc Differential Revision: https://reviews.llvm.org/D110771	2021-10-04 06:17:02 -05:00
Jay Foad	d933adeaca	[APInt] Stop using soft-deprecated constructors and methods in clang. NFC. Stop using APInt constructors and methods that were soft-deprecated in D109483. This fixes all the uses I found in clang. Differential Revision: https://reviews.llvm.org/D110808	2021-10-04 09:38:11 +01:00
Dávid Bolvanský	b1fcca3884	Fixed warnings in LLVM produced by -Wbitwise-instead-of-logical	2021-10-03 13:04:18 +02:00
Joseph Huber	d12502a3ab	[OpenMP] Apply OpenMP assumptions to applicable call sites This patch adds OpenMP assumption attributes to call sites in applicable regions. Currently this applies the caller's assumption attributes to any calls contained within it. So, if a call occurs inside an OpenMP assumes region to a function outside that region, we will assume that call respects the assumptions. This is primarily useful for inline assembly calls used heavily in the OpenMP GPU device runtime, which allows us to then make judgements about what the ASM will do. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D110655	2021-09-29 16:08:21 -04:00
Quinn Pham	67a3d1e275	[PowerPC] swdiv builtins for XL compatibility This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch implements the software divide builtin as wrappers for a floating point divide. XL provided these builtins because it didn't produce software estimates by default at `-Ofast`. When compiled with `-Ofast` these builtins will produce the software estimate for divide. Reviewed By: #powerpc, nemanjai Differential Revision: https://reviews.llvm.org/D106959	2021-09-29 11:31:07 -05:00
Sven van Haastregt	4da744a20f	[OpenCL] Fix as_type3 invalid store creation With -fpreserve-vec3-type enabled, a cast was not created when converting from a non-vec3 type to a vec3 type, even though a conversion to vec3 was performed. This resulted in creation of invalid store instructions. Differential Revision: https://reviews.llvm.org/D108470	2021-09-29 09:40:06 +01:00
Arthur Eubanks	aa53785f23	Reland [clang] Rework dontcall attributes To avoid using the AST when emitting diagnostics, split the "dontcall" attribute into "dontcall-warn" and "dontcall-error", and also add the frontend attribute value as the LLVM attribute value. This gives us all the information to report diagnostics we need from within the IR (aside from access to the original source). One downside is we directly use LLVM's demangler rather than using the existing Clang diagnostic pretty printing of symbols. Previous revisions didn't properly declare the new dependencies. Reviewed By: nickdesaulniers Differential Revision: https://reviews.llvm.org/D110364	2021-09-28 15:31:30 -07:00
Arthur Eubanks	7833d20f1f	Revert "[clang] Rework dontcall attributes" This reverts commit `2943071e2e`. Breaks bots	2021-09-28 14:49:27 -07:00
Arthur Eubanks	2943071e2e	[clang] Rework dontcall attributes To avoid using the AST when emitting diagnostics, split the "dontcall" attribute into "dontcall-warn" and "dontcall-error", and also add the frontend attribute value as the LLVM attribute value. This gives us all the information to report diagnostics we need from within the IR (aside from access to the original source). One downside is we directly use LLVM's demangler rather than using the existing Clang diagnostic pretty printing of symbols. Reviewed By: nickdesaulniers Differential Revision: https://reviews.llvm.org/D110364	2021-09-28 14:21:10 -07:00
serge-sans-paille	c3717b6858	Simplify handling of builtin with inline redefinition (This is a recommit of `3d6f49a569` that should no longer break validation since `bd379915de`). It is a common practice in glibc header to provide an inline redefinition of an existing function. It is especially the case for fortified function. Clang currently has an imperfect approach to the problem, using a combination of trivially recursive function detection and noinline attribute. Simplify the logic by suffixing these functions by `.inline` during codegen, so that they are not recognized as builtin by llvm. After that patch, clang passes all tests from https://github.com/serge-sans-paille/fortify-test-suite Differential Revision: https://reviews.llvm.org/D109967	2021-09-28 21:00:47 +02:00
Kevin Athey	0d76d4833d	Revert "Simplify handling of builtin with inline redefinition" This reverts commit `3d6f49a569`. Broke bot: https://lab.llvm.org/buildbot/#/builders/5/builds/12360	2021-09-28 11:30:37 -07:00
David Blaikie	85f612efeb	DebugInfo: Use sugared function type when emitting function declarations for call sites Otherwise we're losing type information for these functions.	2021-09-28 10:44:35 -07:00
Quinn Pham	70391b3468	[PowerPC] FP compare and test XL compat builtins. This patch is in a series of patches to provide builtins for compatability with the XL compiler. This patch adds builtins for compare exponent and test data class operations on floating point values. Reviewed By: #powerpc, lei Differential Revision: https://reviews.llvm.org/D109437	2021-09-28 11:01:51 -05:00
serge-sans-paille	3d6f49a569	Simplify handling of builtin with inline redefinition It is a common practice in glibc header to provide an inline redefinition of an existing function. It is especially the case for fortified function. Clang currently has an imperfect approach to the problem, using a combination of trivially recursive function detection and noinline attribute. Simplify the logic by suffixing these functions by `.inline` during codegen, so that they are not recognized as builtin by llvm. After that patch, clang passes all tests from https://github.com/serge-sans-paille/fortify-test-suite Differential Revision: https://reviews.llvm.org/D109967	2021-09-28 13:24:25 +02:00
Ahsan Saghir	593b074a09	[PowerPC] MMA - Add __builtin_vsx_build_pair and __builtin_mma_build_acc builtins This patch adds the following built-ins: __builtin_vsx_build_pair __builtin_mma_build_acc Reviewed By: #powerpc, nemanjai, lei Differential Revision: https://reviews.llvm.org/D107647	2021-09-27 19:51:28 -05:00
Joseph Huber	b4a5543624	[OpenMP] Introduce a new worksharing RTL function for distribute This patch adds a new RTL function for worksharing. Currently we use `__kmpc_for_static_init` for both the `distribute` and `parallel` portion of the loop clause. This patch replaces the `distribute` portion with a new runtime call `__kmpc_distribute_static_init`. Currently this will be used exactly the same way, but will make it easier in the future to fine-tune the distribute and parallel portion of the loop. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D110429	2021-09-27 11:36:37 -04:00
Wang, Pengfei	7d6889964a	[X86][FP16] Add more builtins to avoid multi evaluation problems & add 2 missed intrinsics Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D110336	2021-09-27 09:27:04 +08:00
David Blaikie	8d9ddd4f50	DebugInfo: STN: Handle unreconstitutable types in function types	2021-09-23 21:13:16 -07:00
David Blaikie	25ac0d3c73	DebugInfo: Implement the -gsimple-template-names functionality This excludes certain names that can't be rebuilt from the available DWARF: * Atomic types - no DWARF differentiating int from atomic int. * Vector types - enough DWARF (an attribute on the array type) to do this, but I haven't written the extra code to add the attributes required for this * Lambdas - ambiguous with any other unnamed class * Unnamed classes/enums - would need column info for the type in addition to file/line number * noexcept function types - not encoded in DWARF	2021-09-23 19:58:32 -07:00
Thomas Lively	2f519825ba	[WebAssembly] Add prototype relaxed SIMD fma/fms instructions Add experimental clang builtins, LLVM intrinsics, and backend definitions for the new {f32x4,f64x2}.{fma,fms} instructions in the relaxed SIMD proposal: https://github.com/WebAssembly/relaxed-simd/blob/main/proposals/relaxed-simd/Overview.md. Do not allow these instructions to be selected without explicit user opt-in. Differential Revision: https://reviews.llvm.org/D110295	2021-09-23 11:01:36 -07:00
Hongtao Yu	d9b511d8e8	[CSSPGO] Set PseudoProbeInserter as a default pass. Currenlty PseudoProbeInserter is a pass conditioned on a target switch. It works well with a single clang invocation. It doesn't work so well when the backend is called separately (i.e, through the linker or llc), where user has always to pass -pseudo-probe-for-profiling explictly. I'm making the pass a default pass that requires no command line arg to trigger, but will be actually run depending on whether the CU comes with `llvm.pseudo_probe_desc` metadata. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D110209	2021-09-22 09:09:48 -07:00
Shilei Tian	ca999f7191	[OpenMP][Offloading] Use bitset to indicate execution mode instead of value The execution mode of a kernel is stored in a global variable, whose value means: - 0 - SPMD mode - 1 - indicates generic mode - 2 - SPMD mode execution with generic mode semantics We are going to add support for SIMD execution mode. It will be come with another execution mode, such as SIMD-generic mode. As a result, this value-based indicator is not flexible. This patch changes to bitset based solution to encode execution mode. Each position is: [0] - generic mode [1] - SPMD mode [2] - SIMD mode (will be added later) In this way, `0x1` is generic mode, `0x2` is SPMD mode, and `0x3` is SPMD mode execution with generic mode semantics. In the future after we add the support for SIMD mode, `0b1xx` will be in SIMD mode. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D110029	2021-09-22 11:40:52 -04:00
Florian Hahn	ea21d688dc	[Matrix] Emit assumption that matrix indices are valid. The matrix extension requires the indices for matrix subscript expression to be valid and it is UB otherwise. extract/insertelement produce poison if the index is invalid, which limits the optimizer to not be bale to scalarize load/extract pairs for example, which causes very suboptimal code to be generated when using matrix subscript expressions with variable indices for large matrixes. This patch updates IRGen to emit assumes to for index expression to convey the information that the index must be valid. This also adjusts the order in which operations are emitted slightly, so indices & assumes are added before the load of the matrix value. Reviewed By: erichkeane Differential Revision: https://reviews.llvm.org/D102478	2021-09-22 12:27:37 +01:00
David Blaikie	2ff049b12e	DebugInfo: Don't use preferred template names in debug info Using the preferred name creates a mismatch between the textual name of a type and the DWARF tags describing the parameters as well as possible inconsistency between DWARF producers (like Clang and GCC, or older/newer Clang versions, etc).	2021-09-21 20:08:16 -07:00
David Blaikie	db6f1e8a88	DebugInfo: Don't suppress inline namespaces when printing template template parameter names	2021-09-21 19:30:13 -07:00
David Blaikie	d31dfc3011	DebugInfo: Unify some printing policy adjustments	2021-09-21 19:30:12 -07:00
Giorgis Georgakoudis	ac90dfc43a	Revert "[OpenMP] Codegen aggregate for outlined function captures" This reverts commit `1d66649adf`. Revert to fix AMG GPU issue.	2021-09-21 13:20:39 -07:00
Matheus Izvekov	d9308aa39b	[clang] don't mark as Elidable CXXConstruct expressions used in NRVO See PR51862. The consumers of the Elidable flag in CXXConstructExpr assume that an elidable construction just goes through a single copy/move construction, so that the source object is immediately passed as an argument and is the same type as the parameter itself. With the implementation of P2266 and after some adjustments to the implementation of P1825, we started (correctly, as per standard) allowing more cases where the copy initialization goes through user defined conversions. With this patch we stop using this flag in NRVO contexts, to preserve code that relies on that assumption. This causes no known functional changes, we just stop firing some asserts in a cople of included test cases. Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D109800	2021-09-21 21:41:20 +02:00
Giorgis Georgakoudis	1d66649adf	[OpenMP] Codegen aggregate for outlined function captures Parallel regions are outlined as functions with capture variables explicitly generated as distinct parameters in the function's argument list. That complicates the fork_call interface in the OpenMP runtime: (1) the fork_call is variadic since there is a variable number of arguments to forward to the outlined function, (2) wrapping/unwrapping arguments happens in the OpenMP runtime, which is sub-optimal, has been a source of ABI bugs, and has a hardcoded limit (16) in the number of arguments, (3) forwarded arguments must cast to pointer types, which complicates debugging. This patch avoids those issues by aggregating captured arguments in a struct to pass to the fork_call. Reviewed By: jdoerfert, jhuber6 Differential Revision: https://reviews.llvm.org/D102107	2021-09-21 10:50:04 -07:00

1 2 3 4 5 ...

14730 Commits