llvm-project

Commit Graph

Author	SHA1	Message	Date
Yaxun Liu	e1bfbc589f	[HIP] Support -fcuda-flush-denormals-to-zero for amdgcn Differential Revision: https://reviews.llvm.org/D48287 llvm-svn: 337639	2018-07-21 02:02:22 +00:00
Manoj Gupta	da08f6ac16	[clang]: Add support for "-fno-delete-null-pointer-checks" Summary: Support for this option is needed for building Linux kernel. This is a very frequently requested feature by kernel developers. More details : https://lkml.org/lkml/2018/4/4/601 GCC option description for -fdelete-null-pointer-checks: This Assume that programs cannot safely dereference null pointers, and that no code or data element resides at address zero. -fno-delete-null-pointer-checks is the inverse of this implying that null pointer dereferencing is not undefined. This feature is implemented in as the function attribute "null-pointer-is-valid"="true". This CL only adds the attribute on the function. It also strips "nonnull" attributes from function arguments but keeps the related warnings unchanged. Corresponding LLVM change rL336613 already updated the optimizations to not treat null pointer dereferencing as undefined if the attribute is present. Reviewers: t.p.northover, efriedma, jyknight, chandlerc, rnk, srhines, void, george.burgess.iv Reviewed By: jyknight Subscribers: drinkcat, xbolva00, cfe-commits Differential Revision: https://reviews.llvm.org/D47894 llvm-svn: 337433	2018-07-19 00:44:52 +00:00
Yaxun Liu	aefdb8ed34	[NFC] Add CreateMemTempWithoutCast and CreateTempAllocaWithoutCast This is partial re-commit of r332982 llvm-svn: 334837	2018-06-15 15:33:22 +00:00
Yaxun Liu	6c10a66ec7	[CUDA][HIP] Set kernel calling convention before arrange function Currently clang set kernel calling convention for CUDA/HIP after arranging function, which causes incorrect kernel function type since it depends on calling convention. This patch moves setting kernel convention before arranging function. Differential Revision: https://reviews.llvm.org/D47733 llvm-svn: 334457	2018-06-12 00:16:33 +00:00
David Blaikie	181a61307b	Update for an LLVM header file move llvm-svn: 333955	2018-06-04 21:23:29 +00:00
Yaxun Liu	00ddbed298	Revert r332982 Call CreateTempMemWithoutCast for ActiveFlag Due to regression on arm. llvm-svn: 332991	2018-05-22 16:13:07 +00:00
Yaxun Liu	8a60e5db70	Call CreateTempMemWithoutCast for ActiveFlag Introduced CreateMemTempWithoutCast and CreateTemporaryAllocaWithoutCast to emit alloca without casting to default addr space. ActiveFlag is a temporary variable emitted for clean up. It is defined as AllocaInst* type and there is a cast to AlllocaInst in SetActiveFlag. An alloca casted to generic pointer causes assertion in SetActiveFlag. Since there is only load/store of ActiveFlag, it is safe to use the original alloca, therefore use CreateMemTempWithoutCast is called. Differential Revision: https://reviews.llvm.org/D47099 llvm-svn: 332982	2018-05-22 14:36:26 +00:00
Yaxun Liu	a2a9cfab83	CodeGen: Fix invalid bitcast for lifetime.start/end lifetime.start/end expects pointer argument in alloca address space. However in C++ a temporary variable is in default address space. This patch changes API CreateMemTemp and CreateTempAlloca to get the original alloca instruction and pass it lifetime.start/end. It only affects targets with non-zero alloca address space. Differential Revision: https://reviews.llvm.org/D45900 llvm-svn: 332593	2018-05-17 11:16:35 +00:00
Akira Hatanaka	852829792b	Address post-commit review comments after r328731. NFC. - Define a function (canPassInRegisters) that determines whether a record can be passed in registers based on language rules and target-specific ABI rules. - Set flag RecordDecl::ParamDestroyedInCallee to true in MSVC mode and remove ASTContext::isParamDestroyedInCallee, which is no longer needed. - Use the same type (unsigned) for RecordDecl's bit-field members. For more background, see the following discussions that took place on cfe-commits. http://lists.llvm.org/pipermail/cfe-commits/Week-of-Mon-20180326/223498.html http://lists.llvm.org/pipermail/cfe-commits/Week-of-Mon-20180402/223688.html http://lists.llvm.org/pipermail/cfe-commits/Week-of-Mon-20180409/224754.html http://lists.llvm.org/pipermail/cfe-commits/Week-of-Mon-20180423/226494.html http://lists.llvm.org/pipermail/cfe-commits/Week-of-Mon-20180507/227647.html llvm-svn: 332397	2018-05-15 21:00:30 +00:00
Richard Smith	eaf11ad709	Track the result of evaluating a computed noexcept specification on the FunctionProtoType. We previously re-evaluated the expression each time we wanted to know whether the type is noexcept or not. We now evaluate the expression exactly once. This is not quite "no functional change": it fixes a crasher bug during AST deserialization where we would try to evaluate the noexcept specification in a situation where we have not deserialized sufficient portions of the AST to permit such evaluation. llvm-svn: 331428	2018-05-03 03:58:32 +00:00
Craig Topper	13d759f87e	[CodeGen] Fix typo in comment form->from. NFC llvm-svn: 331231	2018-04-30 22:02:48 +00:00
Sanjay Patel	c81450e29b	[Driver, CodeGen] rename options to disable an FP cast optimization As suggested in the post-commit thread for rL331056, we should match these clang options with the established vocabulary of the corresponding sanitizer option. Also, the use of 'strict' is well-known for these kinds of knobs, and we can improve the descriptive text in the docs. So this intends to match the logic of D46135 but only change the words. Matching LLVM commit to match this spelling of the attribute to follow shortly. Differential Revision: https://reviews.llvm.org/D46236 llvm-svn: 331209	2018-04-30 18:19:03 +00:00
Sanjay Patel	d175476566	[Driver, CodeGen] add options to enable/disable an FP cast optimization As discussed in the post-commit thread for: rL330437 ( http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20180423/545906.html ) We need a way to opt-out of a float-to-int-to-float cast optimization because too much existing code relies on the platform-specific undefined result of those casts when the float-to-int overflows. The LLVM changes associated with adding this function attribute are here: rL330947 rL330950 rL330951 Also as suggested, I changed the LLVM doc to mention the specific sanitizer flag that catches this problem: rL330958 Differential Revision: https://reviews.llvm.org/D46135 llvm-svn: 331041	2018-04-27 14:22:48 +00:00
Akira Hatanaka	ccda3d2970	[CodeGen] Avoid destructing a callee-destructued struct type in a function if a function delegates to another function. Fix a bug introduced in r328731, which caused a struct with ObjC __weak fields that was passed to a function to be destructed twice, once in the callee function and once in another function the callee function delegates to. To prevent this, keep track of the callee-destructed structs passed to a function and disable their cleanups at the point of the call to the delegated function. This reapplies r331016, which was reverted in r331019 because it caused an assertion to fail in EmitDelegateCallArg on a windows bot. I made changes to EmitDelegateCallArg so that it doesn't try to deactivate cleanups for structs that have trivial destructors (cleanups for those structs are never pushed to the cleanup stack in EmitParmDecl). rdar://problem/39194693 Differential Revision: https://reviews.llvm.org/D45382 llvm-svn: 331020	2018-04-27 06:57:00 +00:00
Akira Hatanaka	b4f3637cec	Revert "[CodeGen] Avoid destructing a callee-destructued struct type in a" This reverts commit r331016, which broke a windows bot. http://lab.llvm.org:8011/builders/clang-x86-windows-msvc2015/builds/11727 llvm-svn: 331019	2018-04-27 05:56:55 +00:00
Akira Hatanaka	e712374496	[CodeGen] Avoid destructing a callee-destructued struct type in a function if a function delegates to another function. Fix a bug introduced in r328731, which caused a struct with ObjC __weak fields that was passed to a function to be destructed twice, once in the callee function and once in another function the callee function delegates to. To prevent this, keep track of the callee-destructed structs passed to a function and disable their cleanups at the point of the call to the delegated function. rdar://problem/39194693 Differential Revision: https://reviews.llvm.org/D45382 llvm-svn: 331016	2018-04-27 04:21:51 +00:00
Alexey Sotkin	3858e26f22	[OpenCL] Add 'denorms-are-zero' function attribute Summary: Generate attribute 'denorms-are-zero'='true' if '-cl-denorms-are-zero' compile option was specified and 'denorms-are-zero'='false' otherwise. Patch by krisb Reviewers: Anastasia, yaxunl Reviewed By: yaxunl Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D45808 llvm-svn: 330404	2018-04-20 08:08:04 +00:00
Akira Hatanaka	4ce0e5a892	[CodeGen] Do not push a destructor cleanup for a struct that doesn't have a non-trivial destructor. This fixes a bug introduced in r328731 where CodeGen emits calls to synthesized destructors for non-trivial C structs in C++ mode when the struct passed to EmitCallArg doesn't have a non-trivial destructor. Under Microsoft's ABI, ASTContext::isParamDestroyedInCallee currently always returns true, so it's necessary to check whether the struct has a non-trivial destructor before pushing a cleanup in EmitCallArg. This fixes PR37146. llvm-svn: 330304	2018-04-18 23:33:15 +00:00
Richard Smith	e78fac5126	PR36992: do not store beyond the dsize of a class object unless we know the tail padding is not reused. We track on the AggValueSlot (and through a couple of other initialization actions) whether we're dealing with an object that might share its tail padding with some other object, so that we can avoid emitting stores into the tail padding if that's the case. We still widen stores into tail padding when we can do so. Differential Revision: https://reviews.llvm.org/D45306 llvm-svn: 329342	2018-04-05 20:52:58 +00:00
Artem Belevich	55ebd6cc26	Revert "Set calling convention for CUDA kernel" This reverts r328795 which introduced an issue with referencing __global__ function templates. More details in the original review D44747. llvm-svn: 329099	2018-04-03 18:29:31 +00:00
Reid Kleckner	399d96e39c	[MS] Emit vftable thunks for functions with incomplete prototypes Summary: The following class hierarchy requires that we be able to emit a this-adjusting thunk for B::foo in C's vftable: struct Incomplete; struct A { virtual A* foo(Incomplete p) = 0; }; struct B : virtual A { void foo(Incomplete p) override; }; struct C : B { int c; }; This TU is valid, but lacks a definition of 'Incomplete', which makes it hard to build a thunk for the final overrider, B::foo. Before this change, Clang gives up attempting to emit the thunk, because it assumes that if the parameter types are incomplete, it must be emitting the thunk for optimization purposes. This is untrue for the MS ABI, where the implementation of B::foo has no idea what thunks C's vftable may require. Clang needs to emit the thunk without necessarily having access to the complete prototype of foo. This change makes Clang emit a musttail variadic call when it needs such a thunk. I call these "unprototyped" thunks, because they only prototype the "this" parameter, which must always come first in the MS C++ ABI. These thunks work, but they create ugly LLVM IR. If the call to the thunk is devirtualized, it will be a call to a bitcast of a function pointer. Today, LLVM cannot inline through such a call, but I want to address that soon, because we also use this pattern for virtual member pointer thunks. This change also implements an old FIXME in the code about reusing the thunk's computed CGFunctionInfo as much as possible. Now we don't end up computing the thunk's mangled name and arranging it's prototype up to around three times. Fixes PR25641 Reviewers: rjmccall, rsmith, hans Subscribers: Prazek, cfe-commits Differential Revision: https://reviews.llvm.org/D45112 llvm-svn: 329009	2018-04-02 20:20:33 +00:00
Richard Smith	866dee4ea0	Add helper to determine if a field is a zero-length bitfield. llvm-svn: 328999	2018-04-02 18:29:43 +00:00
Yaxun Liu	b2f2bb26e4	Set calling convention for CUDA kernel This patch sets target specific calling convention for CUDA kernels in IR. Patch by Greg Rodgers. Revised and lit test added by Yaxun Liu. Differential Revision: https://reviews.llvm.org/D44747 llvm-svn: 328795	2018-03-29 15:02:08 +00:00
Akira Hatanaka	fcbe17c6be	[ObjC++] Make parameter passing and function return compatible with ObjC ObjC and ObjC++ pass non-trivial structs in a way that is incompatible with each other. For example: typedef struct { id f0; __weak id f1; } S; // this code is compiled in c++. extern "C" { void foo(S s); } void caller() { // the caller passes the parameter indirectly and destructs it. foo(S()); } // this function is compiled in c. // 'a' is passed directly and is destructed in the callee. void foo(S a) { } This patch fixes the incompatibility by passing and returning structs with __strong or weak fields using the C ABI in C++ mode. __strong and __weak fields in a struct do not cause the struct to be destructed in the caller and __strong fields do not cause the struct to be passed indirectly. Also, this patch fixes the microsoft ABI bug mentioned here: https://reviews.llvm.org/D41039?id=128767#inline-364710 rdar://problem/38887866 Differential Revision: https://reviews.llvm.org/D44908 llvm-svn: 328731	2018-03-28 21:13:14 +00:00
David Blaikie	f47ca25423	Fix for LLVM change (Transforms/Utils/Local.h -> Analysis/Utils/Local.h) llvm-svn: 328166	2018-03-21 22:34:27 +00:00
Oren Ben Simhon	220671a080	Adding nocf_check attribute for cf-protection fine tuning The patch adds nocf_check target independent attribute for disabling checks that were enabled by cf-protection flag. The attribute can be appertained to functions and function pointers. Attribute name follows GCC's similar attribute name. Differential Revision: https://reviews.llvm.org/D41880 llvm-svn: 327768	2018-03-17 13:31:35 +00:00
Yaxun Liu	5b330e8d61	Recommit r326946 after reducing CallArgList memory footprint llvm-svn: 327634	2018-03-15 15:25:19 +00:00
Joel E. Denny	8150810556	Reland "[Attr] Fix parameter indexing for several attributes" Relands r326602 (reverted in r326862) with new test and fix for PR36620. Differential Revision: https://reviews.llvm.org/D43248 llvm-svn: 327405	2018-03-13 14:51:22 +00:00
Richard Smith	007cb6df58	Revert r326946. It caused stack overflows by significantly increasing the size of a CallArgList. llvm-svn: 327195	2018-03-10 01:47:22 +00:00
George Burgess IV	003be7cbf4	[CodeGen] Emit lifetime.ends in both EH and non-EH blocks Before this, we'd only emit lifetime.ends for these temps in non-exceptional paths. This potentially made our stack larger than it needed to be for any code that follows an EH cleanup. e.g. in ``` struct Foo { char cs[32]; }; void escape(void ); struct Bar { ~Bar() { char cs[64]; escape(cs); } }; Foo getFoo(); void baz() { Bar b; getFoo(); } ``` baz() would require 96 bytes of stack, since the temporary from getFoo() only had a lifetime.end on the non-exceptional path. This also makes us keep hold of the Value returned by EmitLifetimeStart, so we don't have to remake it later. llvm-svn: 326988	2018-03-08 05:32:30 +00:00
Yaxun Liu	06dd81149f	CodeGen: Fix address space of indirect function argument The indirect function argument is in alloca address space in LLVM IR. However, during Clang codegen for C++, the address space of indirect function argument should match its address space in the source code, i.e., default addr space, even for indirect argument. This is because destructor of the indirect argument may be called in the caller function, and address of the indirect argument may be taken, in either case the indirect function argument is expected to be in default addr space, not the alloca address space. Therefore, the indirect function argument should be mapped to the temp var casted to default address space. The caller will cast it to alloca addr space when passing it to the callee. In the callee, the argument is also casted to the default address space and used. CallArg is refactored to facilitate this fix. Differential Revision: https://reviews.llvm.org/D34367 llvm-svn: 326946	2018-03-07 21:45:40 +00:00
Nico Weber	bbf648253d	Revert r326602, it caused PR36620. llvm-svn: 326862	2018-03-07 02:22:41 +00:00
George Burgess IV	7e03f350e8	[CodeGen] Don't emit lifetime.end without lifetime.start EmitLifetimeStart returns a non-null `size` pointer if it actually emits a lifetime.start. Later in this function, we use `tempSize`'s nullness to determine whether or not we should emit a lifetime.end. llvm-svn: 326844	2018-03-06 23:07:00 +00:00
Joel E. Denny	4925445958	[Attr] Fix parameter indexing for several attributes The patch fixes a number of bugs related to parameter indexing in attributes: * Parameter indices in some attributes (argument_with_type_tag, pointer_with_type_tag, nonnull, ownership_takes, ownership_holds, and ownership_returns) are specified in source as one-origin including any C++ implicit this parameter, were stored as zero-origin excluding any this parameter, and were erroneously printing (-ast-print) and confusingly dumping (-ast-dump) as the stored values. * For alloc_size, the C++ implicit this parameter was not subtracted correctly in Sema, leading to assert failures or to silent failures of __builtin_object_size to compute a value. * For argument_with_type_tag, pointer_with_type_tag, and ownership_returns, the C++ implicit this parameter was not added back to parameter indices in some diagnostics. This patch fixes the above bugs and aims to prevent similar bugs in the future by introducing careful mechanisms for handling parameter indices in attributes. ParamIdx stores a parameter index and is designed to hide the stored encoding while providing accessors that require each use (such as printing) to make explicit the encoding that is needed. Attribute declarations declare parameter index arguments as [Variadic]ParamIdxArgument, which are exposed as ParamIdx[*]. This patch rewrites all attribute arguments that are processed by checkFunctionOrMethodParameterIndex in SemaDeclAttr.cpp to be declared as [Variadic]ParamIdxArgument. The only exception is xray_log_args's argument, which is encoded as a count not an index. Differential Revision: https://reviews.llvm.org/D43248 llvm-svn: 326602	2018-03-02 19:03:22 +00:00
Akira Hatanaka	627586b850	Add an option to disable tail-call optimization for escaping blocks. This makes it easier to debug crashes and hangs in block functions since users can easily find out where the block is called from. The option doesn't disable tail-calls from non-escaping blocks since non-escaping blocks are not as hard to debug as escaping blocks. rdar://problem/35758207 Differential Revision: https://reviews.llvm.org/D43841 llvm-svn: 326530	2018-03-02 01:53:15 +00:00
Saleem Abdulrasool	f181f1a6a2	CodeGenObjCXX: handle inalloca appropriately for msgSend variant objc_msgSend_stret takes a hidden parameter for the returned structure's address for the construction. When the function signature is rewritten for the inalloca passing, the return type is no longer marked as indirect but rather inalloca stret. This enhances the test for the indirect return to check for that case as well. This fixes the incorrect return classification for Windows x86. llvm-svn: 326362	2018-02-28 20:16:12 +00:00
Akira Hatanaka	7275da0f2e	[ObjC] Allow declaring __strong pointer fields in structs in Objective-C ARC mode. Declaring __strong pointer fields in structs was not allowed in Objective-C ARC until now because that would make the struct non-trivial to default-initialize, copy/move, and destroy, which is not something C was designed to do. This patch lifts that restriction. Special functions for non-trivial C structs are synthesized that are needed to default-initialize, copy/move, and destroy the structs and manage the ownership of the objects the __strong pointer fields point to. Non-trivial structs passed to functions are destructed in the callee function. rdar://problem/33599681 Differential Revision: https://reviews.llvm.org/D41228 llvm-svn: 326307	2018-02-28 07:15:55 +00:00
Alexey Sotkin	20f65928e1	[OpenCL] Add '-cl-uniform-work-group-size' compile option Summary: OpenCL 2.0 specification defines '-cl-uniform-work-group-size' option, which requires that the global work-size be a multiple of the work-group size specified to clEnqueueNDRangeKernel and allows optimizations that are made possible by this restriction. The patch introduces the support of this option. To keep information about whether an OpenCL kernel has uniform work group size or not, clang generates 'uniform-work-group-size' function attribute for every kernel: - "uniform-work-group-size"="true" for OpenCL 1.2 and lower, - "uniform-work-group-size"="true" for OpenCL 2.0 and higher if '-cl-uniform-work-group-size' option was specified, - "uniform-work-group-size"="false" for OpenCL 2.0 and higher if no '-cl-uniform-work-group-size' options was specified. If the function is not an OpenCL kernel, 'uniform-work-group-size' attribute isn't generated. Patch by: krisb Reviewers: yaxunl, Anastasia, b-sumner Reviewed By: yaxunl, Anastasia Subscribers: nhaehnle, yaxunl, Anastasia, cfe-commits Differential Revision: https://reviews.llvm.org/D43570 llvm-svn: 325771	2018-02-22 11:54:14 +00:00
Erich Keane	93e58667ee	Make attribute-target on a Definition-after-use update the LLVM attributes As reported here: https://bugs.llvm.org/show_bug.cgi?id=36301 The issue is that the 'use' causes the plain declaration to emit the attributes to LLVM-IR. However, if the definition added it later, these would silently disappear. This commit extracts that logic to its own function in CodeGenModule, and has the attribute-applications done during 'definition' update the attributes properly. Differential Revision: https://reviews.llvm.org/D43095 llvm-svn: 324907	2018-02-12 17:01:41 +00:00
Reid Kleckner	b75a3f04ec	[WinEH] Put funclet bundles on inline asm calls Summary: Fixes PR36247, which is where WinEHPrepare replaces inline asm in funclets with unreachable. Make getBundlesForFunclet return by value to simplify some call sites. Reviewers: smeenai, majnemer Subscribers: eraman, cfe-commits Differential Revision: https://reviews.llvm.org/D43033 llvm-svn: 324689	2018-02-09 00:16:41 +00:00
John McCall	9831b843d2	Pass around function pointers as CGCallees, not bare llvm::Value*s. The intention here is to make it easy to write frontend-assisted CFI systems by propagating extra information in the CGCallee. llvm-svn: 324377	2018-02-06 18:52:44 +00:00
Peter Collingbourne	ea21100272	IRGen: Move vtable load after argument evaluation. This change reduces the live range of the loaded function pointer, resulting in a slight code size decrease (~10KB in clang), and also improves the security of CFI for virtual calls by making it less likely that the function pointer will be spilled, and ensuring that it is not spilled across a function call boundary. Fixes PR35353. Differential Revision: https://reviews.llvm.org/D42725 llvm-svn: 324286	2018-02-05 23:09:13 +00:00
Akira Hatanaka	02914dc127	Add support for attribute 'trivial_abi'. The 'trivial_abi' attribute can be applied to a C++ class, struct, or union. It makes special functions of the annotated class (the destructor and copy/move constructors) to be trivial for the purpose of calls and, as a result, enables the annotated class or containing classes to be passed or returned using the C ABI for the underlying type. When a type that is considered trivial for the purpose of calls despite having a non-trivial destructor (which happens only when the class type or one of its subobjects is a 'trivial_abi' class) is passed to a function, the callee is responsible for destroying the object. For more background, see the discussions that took place on the mailing list: http://lists.llvm.org/pipermail/cfe-dev/2017-November/055955.html http://lists.llvm.org/pipermail/cfe-commits/Week-of-Mon-20180101/thread.html#214043 rdar://problem/35204524 Differential Revision: https://reviews.llvm.org/D41039 llvm-svn: 324269	2018-02-05 20:23:22 +00:00
Ivan A. Kosarev	1860b520a2	[CodeGen] Decorate aggregate accesses with TBAA tags Differential Revision: https://reviews.llvm.org/D41539 llvm-svn: 323421	2018-01-25 14:21:55 +00:00
Volodymyr Sapsai	17ebdb239f	Reland "[CodeGen] Fix crash when a function taking transparent union is redeclared." When a function taking transparent union is declared as taking one of union members earlier in the translation unit, clang would hit an "Invalid cast" assertion during EmitFunctionProlog. This case corresponds to function f1 in test/CodeGen/transparent-union-redecl.c. We decided to cast i32 to union because after merging function declarations function parameter type becomes int, CGFunctionInfo::ArgInfo type matches with ABIArgInfo type, so we decide it is a trivial case. But these types should also be castable to parameter declaration type which is not the case here. Now the fix is in converting from ABIArgInfo type to VarDecl type and using argument demotion when necessary. Additional tests in Sema/transparent-union.c capture current behavior and make sure there are no regressions. rdar://problem/34949329 Reviewers: rjmccall, rafael Reviewed By: rjmccall Subscribers: aemerson, cfe-commits, kristof.beyls, ahatanak Differential Revision: https://reviews.llvm.org/D41311 llvm-svn: 323156	2018-01-22 22:29:24 +00:00
Alex Bradbury	e41a5e2490	Refactor handling of signext/zeroext in ABIArgInfo As @rjmccall suggested in D40023, we can get rid of ABIInfo::shouldSignExtUnsignedType (used to handle cases like the Mips calling convention where 32-bit integers are always sign extended regardless of the sign of the type) by adding a SignExt field to ABIArgInfo. In the common case, this new field is set automatically by ABIArgInfo::getExtend based on the sign of the type. For targets that want greater control, they can use ABIArgInfo::getSignExtend or ABIArgInfo::getZeroExtend when necessary. This change also cleans up logic in CGCall.cpp. There is no functional change intended in this patch, and all tests pass unchanged. As noted in D40023, Mips might want to sign-extend unsigned 32-bit integer return types. A future patch might modify MipsABIInfo::classifyReturnType to use MipsABIInfo::extendType. Differential Revision: https://reviews.llvm.org/D41999 llvm-svn: 322396	2018-01-12 20:08:16 +00:00
Volodymyr Sapsai	22b00ec42e	Revert "[CodeGen] Fix crash when a function taking transparent union is redeclared." This reverts commit r321296. It caused performance regressions FAIL: imp.execution_time FAIL: 2007-01-04-KNR-Args.execution_time FAIL: sse_expandfft.execution_time FAIL: sse_stepfft.execution_time llvm-svn: 321306	2017-12-21 20:52:59 +00:00
Volodymyr Sapsai	614f3702d9	[CodeGen] Fix crash when a function taking transparent union is redeclared. When a function taking transparent union is declared as taking one of union members earlier in the translation unit, clang would hit an "Invalid cast" assertion during EmitFunctionProlog. This case corresponds to function f1 in test/CodeGen/transparent-union-redecl.c. We decided to cast i32 to union because after merging function declarations function parameter type becomes int, CGFunctionInfo::ArgInfo type matches with ABIArgInfo type, so we decide it is a trivial case. But these types should also be castable to parameter declaration type which is not the case here. The fix is in checking for the trivial case if ABIArgInfo type matches with parameter declaration type. It exposed inconsistency that we check hasScalarEvaluationKind for different types in EmitParmDecl and EmitFunctionProlog, and comment says they should match. Additional tests in Sema/transparent-union.c capture current behavior and make sure there are no regressions. rdar://problem/34949329 Reviewers: rjmccall, rafael Reviewed By: rjmccall Subscribers: aemerson, cfe-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D41311 llvm-svn: 321296	2017-12-21 19:42:37 +00:00
Vedant Kumar	09b5bfdd85	[ubsan] Diagnose noreturn functions which return Diagnose 'unreachable' UB when a noreturn function returns. 1. Insert a check at the end of functions marked noreturn. 2. A decl may be marked noreturn in the caller TU, but not marked in the TU where it's defined. To diagnose this scenario, strip away the noreturn attribute on the callee and insert check after calls to it. Testing: check-clang, check-ubsan, check-ubsan-minimal, D40700 rdar://33660464 Differential Revision: https://reviews.llvm.org/D40698 llvm-svn: 321231	2017-12-21 00:10:25 +00:00
Adrian Prantl	f3b3ccda59	Silence a bunch of implicit fallthrough warnings llvm-svn: 321115	2017-12-19 22:06:11 +00:00
Craig Topper	9a724aa38f	[Driver][CodeGen] Add -mprefer-vector-width driver option and attribute during CodeGen. This adds a new command line option -mprefer-vector-width to specify a preferred vector width for the vectorizers. Valid values are 'none' and unsigned integers. The driver will check that it meets those constraints. Specific supported integers will be managed by the targets in the backend. Clang will take the value and add it as a new function attribute during CodeGen. This represents the alternate direction proposed by Sanjay in this RFC: http://lists.llvm.org/pipermail/llvm-dev/2017-November/118734.html The syntax here matches gcc, though gcc treats it as an x86 specific command line argument. gcc only allows values of 128, 256, and 512. I'm not having clang check any values. Differential Revision: https://reviews.llvm.org/D40230 llvm-svn: 320419	2017-12-11 21:09:19 +00:00
Yaxun Liu	c325d30d2c	CodeGen: Fix invalid bitcasts for memcpy CreateCoercedLoad/CreateCoercedStore assumes pointer argument of memcpy is in addr space 0, which is not correct and causes invalid bitcasts for triple amdgcn---amdgiz. It is fixed by using alloca addr space instead. Differential Revision: https://reviews.llvm.org/D40806 llvm-svn: 320000	2017-12-07 01:39:52 +00:00
Craig Topper	b338400188	[Target] Make a copy of TargetOptions feature list before sorting during CodeGen Currently CodeGen is calling std::sort on the features vector in TargetOptions for every function, but I don't think CodeGen should be modifying TargetOptions. Differential Revision: https://reviews.llvm.org/D40228 llvm-svn: 319195	2017-11-28 18:00:32 +00:00
Craig Topper	402b431051	[CodeGen] Move Reciprocals option from TargetOptions to CodeGenOptions Diffrential Revision: https://reviews.llvm.org/D40226 llvm-svn: 318662	2017-11-20 17:09:22 +00:00
Sriraman Tallam	5c65148565	New clang option -fno-plt which avoids the PLT and lazy binding while making external calls. Differential Revision: https://reviews.llvm.org/D39079 llvm-svn: 317605	2017-11-07 19:37:51 +00:00
Erich Keane	857ac594b7	Replace a few usages of llvm::join with range-version[NFC] I noticed a few usages of llvm::join that were using begin/end rather than just the range version. This patch just replaces those. llvm-svn: 316784	2017-10-27 18:45:06 +00:00
Erich Keane	cf8807c931	Filter out invalid 'target' items from being passed to LLVM Craig noticed that CodeGen wasn't properly ignoring the values sent to the target attribute. This patch ignores them. This patch also sets the 'default' for this checking to 'supported', since only X86 has implemented the support for checking valid CPU names and Feature Names. One test was changed to i686, since it uses a lakemont, which would otherwise be prohibited in x86_64. Differential Revision: https://reviews.llvm.org/D39357 llvm-svn: 316783	2017-10-27 18:32:23 +00:00
Matt Arsenault	a1cf61b6fc	OpenCL: Assume functions are convergent This was done for CUDA functions in r261779, and for the same reason this also needs to be done for OpenCL. An arbitrary function could have a barrier() call in it, which in turn requires the calling function to be convergent. llvm-svn: 315094	2017-10-06 19:34:40 +00:00
Akira Hatanaka	98a49337be	Add support for attribute 'noescape'. The attribute informs the compiler that the annotated pointer parameter of a function cannot escape and enables IRGen to attach attribute 'nocapture' to parameters that are annotated with the attribute. That is the only optimization that currently takes advantage of 'noescape', but there are other optimizations that will be added later that improves IRGen for ObjC blocks. This recommits r313722, which was reverted in r313725 because clang couldn't build compiler-rt. It failed to build because there were function declarations that were missing 'noescape'. That has been fixed in r313929. rdar://problem/19886775 Differential Revision: https://reviews.llvm.org/D32210 llvm-svn: 313945	2017-09-22 00:41:05 +00:00
Akira Hatanaka	30c93dba5b	Revert "Add support for attribute 'noescape'." This reverts commit r313722. It looks like compiler-rt/lib/tsan/rtl/tsan_libdispatch_mac.cc cannot be compiled because some of the functions declared in the file do not match the ones in the SDK headers (which are annotated with 'noescape'). llvm-svn: 313725	2017-09-20 06:55:43 +00:00
Akira Hatanaka	e974479fa5	Add support for attribute 'noescape'. The attribute informs the compiler that the annotated pointer parameter of a function cannot escape and enables IRGen to attach attribute 'nocapture' to parameters that are annotated with the attribute. That is the only optimization that currently takes advantage of 'noescape', but there are other optimizations that will be added later that improves IRGen for ObjC blocks. rdar://problem/19886775 Differential Revision: https://reviews.llvm.org/D32210 llvm-svn: 313722	2017-09-20 06:32:45 +00:00
Akira Hatanaka	1b9418e163	Revert "Add support for attribute 'noescape'." This reverts r313717. I closed the wrong phabricator review. llvm-svn: 313721	2017-09-20 06:27:39 +00:00
Akira Hatanaka	fc587e6a57	Add support for attribute 'noescape'. The attribute informs the compiler that the annotated pointer parameter of a function cannot escape and enables IRGen to attach attribute 'nocapture' to parameters that are annotated with the attribute. That is the only optimization that currently takes advantage of 'noescape', but there are other optimizations that will be added later that improves IRGen for ObjC blocks. rdar://problem/19886775 Differential Revision: https://reviews.llvm.org/D32520 llvm-svn: 313720	2017-09-20 06:22:51 +00:00
Nuno Lopes	9211ceef2d	clang fix for LLVM API change: isKnownNonNull -> isKnownNonZero Differential Revision: https://reviews.llvm.org/D37628 llvm-svn: 312870	2017-09-09 18:25:36 +00:00
Matt Arsenault	7a124f3ce5	Fix creating bitcasts with wrong address space In a future commit AMDGPU will start passing aggregates directly to more functions, triggering asserts in test/CodeGenOpenCL/addr-space-struct-arg.cl llvm-svn: 309741	2017-08-01 20:36:57 +00:00
Erich Keane	b0d4423bff	Convert attribute 'target' parsing from a 'pair' to a 'struct' to make further improvements easier Convert attribute 'target' parsing from a 'pair' to a 'struct' to make further improvements easier The attribute 'target' parse function previously returned a pair. Convert this to a 'pair' in order to add more functionality, and improve usability. Differential Revision: https://reviews.llvm.org/D35574 llvm-svn: 308357	2017-07-18 20:41:02 +00:00
Martin Storsjo	022e782e75	[AArch64] Add support for __builtin_ms_va_list on aarch64 Move builtins from the x86 specific scope into the global scope. Their use is still limited to x86_64 and aarch64 though. This allows wine on aarch64 to properly handle variadic functions. Differential Revision: https://reviews.llvm.org/D34475 llvm-svn: 308218	2017-07-17 20:49:45 +00:00
Martin Storsjo	d1daa95e11	Update use of llvm::CallingConv:X86_64_Win64 after LLVM commit r308208 llvm-svn: 308209	2017-07-17 20:05:56 +00:00
Hiroshi Inoue	c5e54ddab3	fix trivial typos in comments; NFC llvm-svn: 307007	2017-07-03 08:49:44 +00:00
Yaxun Liu	e9e5c4f975	CodeGen: Fix invalid bitcast for coerced function argument Clang assumes coerced function argument is in address space 0, which is not always true and results in invalid bitcasts. This patch fixes failure in OpenCL conformance test api/get_kernel_arg_info with amdgcn---amdgizcl triple, where non-zero alloca address space is used. Differential Revision: https://reviews.llvm.org/D34777 llvm-svn: 306721	2017-06-29 18:47:45 +00:00
Akira Hatanaka	46dd7dbc8c	[CodeGen] Fix assertion failure in EmitCallArg. The assertion was failing when a method of a parameterized class was called and the types of the argument and parameter didn't match. To fix the failure, move the assertion in EmitCallArg to its only caller EmitCallArgs and require the argument and parameter types match only when the method is not parameterized. rdar://problem/32874473 Differential Revision: https://reviews.llvm.org/D34665 llvm-svn: 306494	2017-06-28 00:42:48 +00:00
Vedant Kumar	c34d343f15	[ubsan] Improve diagnostics for return value checks (clang) This patch makes ubsan's nonnull return value diagnostics more precise, which makes the diagnostics more useful when there are multiple return statements in a function. Example: 1 \|__attribute__((returns_nonnull)) char *foo() { 2 \| if (...) { 3 \| return expr_which_might_evaluate_to_null(); 4 \| } else { 5 \| return another_expr_which_might_evaluate_to_null(); 6 \| } 7 \|} // <- The current diagnostic always points here! runtime error: Null returned from Line 7, Column 2! With this patch, the diagnostic would point to either Line 3, Column 5 or Line 5, Column 5. This is done by emitting source location metadata for each return statement in a sanitized function. The runtime is passed a pointer to the appropriate metadata so that it can prepare and deduplicate reports. Compiler-rt patch (with more tests): https://reviews.llvm.org/D34298 Differential Revision: https://reviews.llvm.org/D34299 llvm-svn: 306163	2017-06-23 21:32:38 +00:00
Yaxun Liu	84744c152a	CodeGen: Cast temporary variable to proper address space In C++ all variables are in default address space. Previously change has been made to cast automatic variables to default address space. However that is not sufficient since all temporary variables need to be casted to default address space. This patch casts all temporary variables to default address space except those for passing indirect arguments since they are only used for load/store. This patch only affects target having non-zero alloca address space. Differential Revision: https://reviews.llvm.org/D33706 llvm-svn: 305711	2017-06-19 17:03:41 +00:00
Xinliang David Li	4ec3606835	Preserve cold attribute for function decls Differential Revision: http://reviews.llvm.org/D34133 llvm-svn: 305325	2017-06-13 21:14:07 +00:00
Galina Kistanova	0872d6c275	Added LLVM_FALLTHROUGH to address warning: this statement may fall through. NFC. llvm-svn: 304649	2017-06-03 06:30:46 +00:00
Pekka Jaaskelainen	fc2629a65a	[OpenCL] Makes kernels use the SPIR_KERNEL CC by default. Rationale: OpenCL kernels are called via an explicit runtime API with arguments set with clSetKernelArg(), not as normal sub-functions. Return SPIR_KERNEL by default as the kernel calling convention to ensure the fingerprint is fixed such way that each OpenCL argument gets one matching argument in the produced kernel function argument list to enable feasible implementation of clSetKernelArg() with aggregates etc. In case we would use the default C calling conv here, clSetKernelArg() might break depending on the target-specific conventions; different targets might split structs passed as values to multiple function arguments etc. https://reviews.llvm.org/D33639 llvm-svn: 304389	2017-06-01 07:18:49 +00:00
Reid Kleckner	ee4930b688	Re-land r301697 "[IR] Make add/remove Attributes use AttrBuilder instead of AttributeList" This time, I fixed, built, and tested clang. This reverts r301712. llvm-svn: 301981	2017-05-02 22:07:37 +00:00
Oren Ben Simhon	318a6eae06	[X86] Support of no_caller_saved_registers attribute Implements the Clang part for no_caller_saved_registers attribute as appears here: https://gcc.gnu.org/git/?p=gcc.git;a=commit;h=5ed3cc7b66af4758f7849ed6f65f4365be8223be. Differential Revision: https://reviews.llvm.org/D31871 llvm-svn: 301535	2017-04-27 12:01:00 +00:00
Reid Kleckner	9d16fa09c6	Prefer addAttr(Attribute::AttrKind) over the AttributeList overload This should simplify the call sites, which typically want to tweak one attribute at a time. It should also avoid creating ephemeral AttributeLists that live forever. llvm-svn: 300718	2017-04-19 17:28:52 +00:00
Reid Kleckner	cdd26794a9	Use less temporary AttributeLists NFC llvm-svn: 300628	2017-04-18 23:50:03 +00:00
Yaxun Liu	d7523283a7	CodeGen: Let byval parameter use alloca address space Differential Revision: https://reviews.llvm.org/D32133 llvm-svn: 300487	2017-04-17 20:10:44 +00:00
Matt Arsenault	502ad60c8f	Update for AllocaInst construction changes llvm-svn: 299889	2017-04-10 22:28:02 +00:00
Erich Keane	623efd8a75	Clang changes for alloc_align attribute GCC has the alloc_align attribute, which is similar to assume_aligned, except the attribute's parameter is the index of the integer parameter that needs aligning to. Differential Revision: https://reviews.llvm.org/D29599 llvm-svn: 299117	2017-03-30 21:48:55 +00:00
Chandler Carruth	45bbe0117b	Revert r298491 and r298494 which changed Clang's handling of 'nonnull' attributes. These patches don't work because we can't currently access the parameter information in a reliable way when building attributes. I thought this would be relatively straightforward to fix, but it seems not to be the case. Fixing this will requrie a substantial re-plumbing of machinery to allow attributes to be handled in this location, and several other fixes to the attribute machinery should probably be made at the same time. All of this will make the patch .... substantially more complicated. Reverting for now as there are active miscompiles caused by the current version. llvm-svn: 298695	2017-03-24 09:11:57 +00:00
Richard Smith	2c27df7603	Remove all uses of std::mem_fun and std::bind1st removed in C++17. llvm-svn: 298657	2017-03-23 23:17:58 +00:00
Chandler Carruth	421fa6c9e2	Remove an overly aggressive assert in r298491 and leave a comment explaining why we have to ignore errors here even though in other parts of codegen we can be more strict with builtins. Also add a test case based on the code in a TSan test that found this issue. llvm-svn: 298494	2017-03-22 10:38:07 +00:00
Chandler Carruth	9b3607f0a6	[nonnull] Teach Clang to attach the nonnull LLVM attribute to declarations and calls instead of just definitions, and then teach it to not attach such attributes even if the source code contains them. This follows the design direction discussed on cfe-dev here: http://lists.llvm.org/pipermail/cfe-dev/2017-January/052066.html The idea is that for C standard library builtins, even if the library vendor chooses to annotate their routines with __attribute__((nonnull)), we will ignore those attributes which pertain to pointer arguments that have an associated size. This allows the widespread (and seemingly reasonable) pattern of calling these routines with a null pointer and a zero size. I have only done this for the library builtins currently recognized by Clang, but we can now trivially add to this set. This will be controllable with -fno-builtin if anyone should care to do so. Note that this does not change the AST. As a consequence, warnings, static analysis, and source code rewriting are not impacted. This isn't even a regression on any platform as neither Clang nor LLVM have ever put 'nonnull' onto these arguments for declarations. All this patch does is enable it on other declarations while preventing us from ever accidentally enabling it on these libc functions due to a library vendor. It will also allow any other libraries using this annotation to gain optimizations based on the annotation even when only a declaration is visible. llvm-svn: 298491	2017-03-22 09:09:13 +00:00
Reid Kleckner	de86482ce0	Update Clang for LLVM rename AttributeSet -> AttributeList llvm-svn: 298394	2017-03-21 16:57:30 +00:00
Vedant Kumar	2b9f48afdd	[ubsan] Use the nicer nullability diagnostic handlers This is a follow-up to r297700 (Add a nullability sanitizer). It addresses some FIXME's re: using nullability-specific diagnostic handlers from compiler-rt, now that the necessary handlers exist. check-ubsan test updates to follow. llvm-svn: 297750	2017-03-14 16:48:29 +00:00
Vedant Kumar	42c17ec5ac	[ubsan] Add a nullability sanitizer Teach UBSan to detect when a value with the _Nonnull type annotation assumes a null value. Call expressions, initializers, assignments, and return statements are all checked. Because _Nonnull does not affect IRGen, the new checks are disabled by default. The new driver flags are: -fsanitize=nullability-arg (_Nonnull violation in call) -fsanitize=nullability-assign (_Nonnull violation in assignment) -fsanitize=nullability-return (_Nonnull violation in return stmt) -fsanitize=nullability (all of the above) This patch builds on top of UBSan's existing support for detecting violations of the nonnull attributes ('nonnull' and 'returns_nonnull'), and relies on the compiler-rt support for those checks. Eventually we will need to update the diagnostic messages in compiler-rt (there are FIXME's for this, which will be addressed in a follow-up). One point of note is that the nullability-return check is only allowed to kick in if all arguments to the function satisfy their nullability preconditions. This makes it necessary to emit some null checks in the function body itself. Testing: check-clang and check-ubsan. I also built some Apple ObjC frameworks with an asserts-enabled compiler, and verified that we get valid reports. Differential Revision: https://reviews.llvm.org/D30762 llvm-svn: 297700	2017-03-14 01:56:34 +00:00
Vedant Kumar	ed00ea084e	[ubsan] Extend the nonnull arg check to ObjC UBSan's nonnull argument check applies when a parameter has the "nonnull" attribute. The check currently works for FunctionDecls, but not for ObjCMethodDecls. This patch extends the check to work for ObjC. Differential Revision: https://reviews.llvm.org/D30599 llvm-svn: 296996	2017-03-06 05:28:22 +00:00
George Burgess IV	b7760210d3	Represent pass_object_size attrs in ExtParameterInfo The goal of this is to fix a bug in modules where we'd merge FunctionDecls that differed in their pass_object_size attributes. Since we can overload on the presence of pass_object_size attributes, this behavior is incorrect. We don't represent `N` in `pass_object_size(N)` as part of ExtParameterInfo, since it's an error to overload solely on the value of N. This means that we have a bug if we have two modules that declare functions that differ only in their pass_object_size attrs, like so: // In module A, from a.h void foo(char __attribute__((pass_object_size(0)))); // In module B, from b.h void foo(char __attribute__((pass_object_size(1)))); // In module C, in main.c #include "a.h" #include "b.h" At the moment, we'll merge the foo decls, when we should instead emit a diagnostic about an invalid overload. We seem to have similar (silent) behavior if we overload only on the return type of `foo` instead; I'll try to find a good place to put a FIXME (or I'll just file a bug) soon. This patch also fixes a bug where we'd not output the proper extended parameter info for declarations with pass_object_size attrs. llvm-svn: 296076	2017-02-24 02:49:47 +00:00
George Burgess IV	d0a9e807f3	[CodeGen] Fix ExtParameterInfo bugs in C++ CodeGen code. This patch makes use of the prefix/suffix ABI argument distinction that was introduced in r295870, so that we now emit ExtParameterInfo at the correct offset for member calls that have added ABI arguments. I don't see a good way to test the generated param info, since we don't actually seem to use it in CGFunctionInfo outside of Swift. Any suggestions/thoughts for how to better test this are welcome. :) This patch also fixes a small bug with inheriting constructors: if we decide not to pass args into an base class ctor, we would still generate ExtParameterInfo as though we did. The added test-case is for that behavior. llvm-svn: 296024	2017-02-23 22:07:35 +00:00
George Burgess IV	0d6592a899	[CodeGen] Don't reemit expressions for pass_object_size params. This fixes an assertion failure in cases where we had expression statements that declared variables nested inside of pass_object_size args. Since we were emitting the same ExprStmt twice (once for the arg, once for the @llvm.objectsize call), we were getting issues with redefining locals. This also means that we can be more lax about when we emit @llvm.objectsize for pass_object_size args: since we're reusing the arg's value itself, we don't have to care so much about side-effects. llvm-svn: 295935	2017-02-23 05:59:56 +00:00
George Burgess IV	75b34a9610	[CodeGen] Add param info for ctors with ABI args. This fixes a few assertion failures. Please see the added test case. llvm-svn: 295894	2017-02-22 22:38:25 +00:00
Simon Pilgrim	27cc054b1c	Fix spelling mistake - paramater -> parameter. NFCI. llvm-svn: 295183	2017-02-15 15:12:06 +00:00
Justin Lebar	b080b630b1	[CodeGen] [CUDA] Add the ability set default attrs on functions in linked modules. Summary: Now when you ask clang to link in a bitcode module, you can tell it to set attributes on that module's functions to match what we would have set if we'd emitted those functions ourselves. This is particularly important for fast-math attributes in CUDA compilations. Each CUDA compilation links in libdevice, a bitcode library provided by nvidia as part of the CUDA distribution. Without this patch, if we have a user-function F that is compiled with -ffast-math that calls a function G from libdevice, F will have the unsafe-fp-math=true (etc.) attributes, but G will have no attributes. Since F calls G, the inliner will merge G's attributes into F's. It considers the lack of an unsafe-fp-math=true attribute on G to be tantamount to unsafe-fp-math=false, so it "merges" these by setting unsafe-fp-math=false on F. This then continues up the call graph, until every function that (transitively) calls something in libdevice gets unsafe-fp-math=false set, thus disabling fastmath in almost all CUDA code. Reviewers: echristo Subscribers: hfinkel, llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D28538 llvm-svn: 293097	2017-01-25 21:29:48 +00:00
George Burgess IV	35cfca2e20	Clean up redundant isa<T> before getAs<T>. NFC. llvm-svn: 291264	2017-01-06 19:10:48 +00:00
George Burgess IV	e37633713d	Add the alloc_size attribute to clang, attempt 2. This is a recommit of r290149, which was reverted in r290169 due to msan failures. msan was failing because we were calling `isMostDerivedAnUnsizedArray` on an invalid designator, which caused us to read uninitialized memory. To fix this, the logic of the caller of said function was simplified, and we now have a `!Invalid` assert in `isMostDerivedAnUnsizedArray`, so we can catch this particular bug more easily in the future. Fingers crossed that this patch sticks this time. :) Original commit message: This patch does three things: - Gives us the alloc_size attribute in clang, which lets us infer the number of bytes handed back to us by malloc/realloc/calloc/any user functions that act in a similar manner. - Teaches our constexpr evaluator that evaluating some `const` variables is OK sometimes. This is why we have a change in test/SemaCXX/constant-expression-cxx11.cpp and other seemingly unrelated tests. Richard Smith okay'ed this idea some time ago in person. - Uniques some Blocks in CodeGen, which was reviewed separately at D26410. Lack of uniquing only really shows up as a problem when combined with our new eagerness in the face of const. llvm-svn: 290297	2016-12-22 02:50:20 +00:00
Chandler Carruth	d7738fe6ad	Revert r290149: Add the alloc_size attribute to clang. This commit fails MSan when running test/CodeGen/object-size.c in a confusing way. After some discussion with George, it isn't really clear what is going on here. We can make the MSan failure go away by testing for the invalid bit, but why things are invalid isn't clear. And yet, other code in the surrounding area is doing precisely this and testing for invalid. George is going to take a closer look at this to better understand the nature of the failure and recommit it, for now backing it out to clean up MSan builds. llvm-svn: 290169	2016-12-20 08:28:19 +00:00
George Burgess IV	a747027bc6	Add the alloc_size attribute to clang. This patch does three things: - Gives us the alloc_size attribute in clang, which lets us infer the number of bytes handed back to us by malloc/realloc/calloc/any user functions that act in a similar manner. - Teaches our constexpr evaluator that evaluating some `const` variables is OK sometimes. This is why we have a change in test/SemaCXX/constant-expression-cxx11.cpp and other seemingly unrelated tests. Richard Smith okay'ed this idea some time ago in person. - Uniques some Blocks in CodeGen, which was reviewed separately at D26410. Lack of uniquing only really shows up as a problem when combined with our new eagerness in the face of const. Differential Revision: https://reviews.llvm.org/D14274 llvm-svn: 290149	2016-12-20 01:05:42 +00:00
Filipe Cabecinhas	322ecd901b	[clang] Version support for UBSan handlers This adds a way for us to version any UBSan handler by itself. The patch overrides D21289 for a better implementation (we're able to rev up a single handler). After this, then we can land a slight modification of D19667+D19668. We probably don't want to keep all the versions in compiler-rt (maybe we want to deprecate on one release and remove the old handler on the next one?), but with this patch we will loudly fail to compile when mixing incompatible handler calls, instead of silently compiling and then providing bad error messages. Reviewers: kcc, samsonov, rsmith, vsk Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D21695 llvm-svn: 289444	2016-12-12 16:18:40 +00:00
Peter Collingbourne	b367c567d9	IRGen: Remove all uses of CreateDefaultAlignedLoad. Differential Revision: https://reviews.llvm.org/D27157 llvm-svn: 288083	2016-11-28 22:30:21 +00:00
John McCall	811b291d8c	Forward ns_consumed delegate arguments with a move. StartFunction enters a release cleanup for ns_consumed arguments in ARC, so we need to balance that somehow. We could teach StartFunction that it's emitting a delegating function, so that the cleanup is unnecessary, but that would be invasive and somewhat fraught. We could balance the consumed argument with an extra retain, but clearing the original variable should be easier to optimize and avoid some extra work at -O0. And there shouldn't be any difference as long as nothing else uses the argument, which should always be true for the places we emit delegate arguments. Fixes PR 27887. llvm-svn: 287291	2016-11-18 01:08:24 +00:00
Erich Keane	757d317c24	regcall: Implement regcall Calling Conv in clang This patch implements the register call calling convention, which ensures as many values as possible are passed in registers. CodeGen changes were committed in https://reviews.llvm.org/rL284108. Differential Revision: https://reviews.llvm.org/D25204 llvm-svn: 285849	2016-11-02 18:29:35 +00:00
Yaxun Liu	7d07ae7c85	[OpenCL] Mark group functions as convergent in opencl-c.h Certain OpenCL builtin functions are supposed to be executed by all threads in a work group or sub group. Such functions should not be made divergent during transformation. It makes sense to mark them with convergent attribute. The adding of convergent attribute is based on Ettore Speziale's work and the original proposal and patch can be found at https://www.mail-archive.com/cfe-commits@lists.llvm.org/msg22271.html. Differential Revision: https://reviews.llvm.org/D25343 llvm-svn: 285725	2016-11-01 18:45:32 +00:00
John McCall	b92ab1afd5	Refactor call emission to package the function pointer together with abstract information about the callee. NFC. The goal here is to make it easier to recognize indirect calls and trigger additional logic in certain cases. That logic will come in a later patch; in the meantime, I felt that this was a significant improvement to the code. llvm-svn: 285258	2016-10-26 23:46:34 +00:00
Justin Lebar	3e6449b4f4	[CUDA] Mark device functions as nounwind. Summary: This prevents clang from emitting 'invoke's and catch statements. Things previously mostly worked thanks to TryToMarkNoThrow() in CodeGenFunction. But this is not a proper IPO, and it doesn't properly handle cases like mutual recursion. Fixes bug 30593. Reviewers: tra Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D25166 llvm-svn: 283272	2016-10-04 23:41:49 +00:00
Sanjay Patel	0bb72c1424	[clang] make reciprocal estimate codegen a function attribute The motivation for the change is that we can't have pseudo-global settings for codegen living in TargetOptions because that doesn't work with LTO. Ideally, these reciprocal attributes will be moved to the instruction-level via FMF, metadata, or something else. But making them function attributes is at least an improvement over the current state. I'm committing this patch ahead of the related LLVM patch to avoid bot failures, but if that patch needs to be reverted, then this should be reverted too. Differential Revision: https://reviews.llvm.org/D24815 llvm-svn: 283251	2016-10-04 20:44:05 +00:00
Vedant Kumar	30914f3d1c	[ARC] Ignore qualifiers in copy-restore expressions When ARC is enabled, an ObjCIndirectCopyRestoreExpr models the passing of a function argument s.t: * The argument is copied into a temporary, * The temporary is passed into the function, and * After the function call completes, the temporary is move-assigned back to the original location of the argument. The argument type and the parameter type must agree "except possibly in qualification". This commit weakens an assertion in EmitCallArg() to actually reflect that. llvm-svn: 283116	2016-10-03 15:29:22 +00:00
Richard Smith	a560ccf2af	Switch to a different workaround for unimplementability of P0145R3 in MS ABIs. Instead of ignoring the evaluation order rule, ignore the "destroy parameters in reverse construction order" rule for the small number of problematic cases. This only causes incorrect behavior in the rare case where both parameters to an overloaded operator <<, >>, ->*, &&, \|\|, or comma are of class type with non-trivial destructor, and the program is depending on those parameters being destroyed in reverse construction order. We could do a little better here by reversing the order of parameter destruction for those functions (and reversing the argument evaluation order for all direct calls, not just those with operator syntax), but that is not a complete solution to the problem, as the same situation can be reached by an indirect function call. Approach reviewed off-line by rnk. llvm-svn: 282777	2016-09-29 21:30:12 +00:00
Richard Smith	762672a73a	Re-commit r282556, reverted in r282564, with a fix to CallArgList::addFrom to function correctly when targeting MS ABIs (this appears to have never mattered prior to this change). Update test case to always cover both 32-bit and 64-bit Windows ABIs, since they behave somewhat differently from each other here. Update test case to also cover operators , && and \|\|, which it appears are also affected by P0145R3 (they're not explicitly called out by the design document, but this is the emergent behavior of the existing wording). Original commit message: P0145R3 (C++17 evaluation order tweaks): evaluate the right-hand side of assignment and compound-assignment operators before the left-hand side. (Even if it's an overloaded operator.) This completes the implementation of P0145R3 + P0400R0 for all targets except Windows, where the evaluation order guarantees for <<, >>, and ->* are unimplementable as the ABI requires the function arguments are evaluated from right to left (because parameter destructors are run from left to right in the callee). llvm-svn: 282619	2016-09-28 19:09:10 +00:00
Richard Smith	4499145a5f	Revert r282556. This change made several bots unhappy. llvm-svn: 282564	2016-09-28 02:20:06 +00:00
Richard Smith	97a616d624	P0145R3 (C++17 evaluation order tweaks): evaluate the right-hand side of assignment and compound-assignment operators before the left-hand side. (Even if it's an overloaded operator.) This completes the implementation of P0145R3 + P0400R0 for all targets except Windows, where the evaluation order guarantees for <<, >>, and ->* are unimplementable as the ABI requires the function arguments are evaluated from right to left (because parameter destructors are run from left to right in the callee). llvm-svn: 282556	2016-09-27 23:44:22 +00:00
Nick Lewycky	d9bce5062e	Replace 'isProvablyNonNull' with existing utility llvm::IsKnownNonNull which handles more cases. Noticed by inspection. Because of how the IR generation works, this isn't expected to cause an observable difference. llvm-svn: 281979	2016-09-20 15:49:58 +00:00
Reid Kleckner	e5a321b5e8	[MS] Fix prologue this adjustment when 'this' is passed indirectly Move the logic for doing this from the ABI argument lowering into EmitParmDecl, which runs for all parameters. Our codegen is slightly suboptimal in this case, as we may leave behind a dead store after optimization, but it's 32-bit inalloca, and this fixes the bug in a robust way. Fixes PR30293 llvm-svn: 280836	2016-09-07 18:21:30 +00:00
Sjoerd Meijer	0a8d4216ad	This adds new options -fdenormal-fp-math and passes through option -ffast-math to CC1, which are translated to function attributes and can e.g. be mapped on build attributes FP_exceptions and FP_denormal. Setting these build attributes allows better selection of floating point libraries. Differential Revision: https://reviews.llvm.org/D23840 llvm-svn: 280064	2016-08-30 08:09:45 +00:00
Justin Bogner	882f861cc7	CodeGen: Rename a variable to better fit LLVM style. NFC llvm-svn: 279159	2016-08-18 21:46:54 +00:00
Saleem Abdulrasool	be25c486dc	CodeGen: use range based for loop, NFC llvm-svn: 279154	2016-08-18 21:40:06 +00:00
Yaxun Liu	ffb60901fe	[OpenCL] Handle -cl-fp32-correctly-rounded-divide-sqrt Let the driver pass the option to frontend. Do not set precision metadata for division instructions when this option is set. Set function attribute "correctly-rounded-divide-sqrt-fp-math" based on this option. Differential Revision: https://reviews.llvm.org/D22940 llvm-svn: 278155	2016-08-09 20:10:18 +00:00
Yaxun Liu	79c99fb7eb	[OpenCL] Add missing -cl-no-signed-zeros option into driver Add OCL option -cl-no-signed-zeros to driver options. Also added to opencl.cl testcases. Patch by Aaron En Ye Shi. Differential Revision: http://reviews.llvm.org/D22067 llvm-svn: 274923	2016-07-08 20:28:29 +00:00
Nikolay Haustov	8c6538b86d	AMDGPU: Set amdgpu_kernel calling convention for OpenCL kernels. Summary: Summary: Change Clang calling convention SpirKernel to OpenCLKernel. Set calling convention OpenCLKernel for amdgcn as well. Add virtual method .getOpenCLKernelCallingConv() to TargetCodeGenInfo and use it to set target calling convention for AMDGPU and SPIR. Update tests. Reviewers: rsmith, tstellarAMD, Anastasia, yaxunl Subscribers: kzhuravl, cfe-commits Differential Revision: http://reviews.llvm.org/D21367 llvm-svn: 274220	2016-06-30 09:06:33 +00:00
Richard Smith	5179eb7821	P0136R1, DR1573, DR1645, DR1715, DR1736, DR1903, DR1941, DR1959, DR1991: Replace inheriting constructors implementation with new approach, voted into C++ last year as a DR against C++11. Instead of synthesizing a set of derived class constructors for each inherited base class constructor, we make the constructors of the base class visible to constructor lookup in the derived class, using the normal rules for using-declarations. For constructors, UsingShadowDecl now has a ConstructorUsingShadowDecl derived class that tracks the requisite additional information. We create shadow constructors (not found by name lookup) in the derived class to model the actual initialization, and have a new expression node, CXXInheritedCtorInitExpr, to model the initialization of a base class from such a constructor. (This initialization is special because it performs real perfect forwarding of arguments.) In cases where argument forwarding is not possible (for inalloca calls, variadic calls, and calls with callee parameter cleanup), the shadow inheriting constructor is not emitted and instead we directly emit the initialization code into the caller of the inherited constructor. Note that this new model is not perfectly compatible with the old model in some corner cases. In particular: * if B inherits a private constructor from A, and C uses that constructor to construct a B, then we previously required that A befriends B and B befriends C, but the new rules require A to befriend C directly, and * if a derived class has its own constructors (and so its implicit default constructor is suppressed), it may still inherit a default constructor from a base class llvm-svn: 274049	2016-06-28 19:03:57 +00:00
David Majnemer	59f7792136	Use more ArrayRefs No functional change is intended, just a small refactoring. llvm-svn: 273647	2016-06-24 04:05:48 +00:00
George Burgess IV	419996ccb5	[CodeGen] Fix a segfault caused by pass_object_size. This patch fixes a bug where we'd segfault (in some cases) if we saw a variadic function with one or more pass_object_size arguments. Differential Revision: http://reviews.llvm.org/D17462 llvm-svn: 272971	2016-06-16 23:06:04 +00:00
Richard Smith	d62d498ed7	Remove nonsense and simplify. To forward a reference, we always just load the pointer-to-pointer representing the parameter. An aggregate rvalue representing a pointer does not make sense. We got away with this weirdness because CGCall happens to blindly load an RValue in aggregate form in this case, without checking whether an RValue for the type should be in scalar or aggregate form. llvm-svn: 272609	2016-06-14 01:13:21 +00:00
Marcin Koscielnicki	b31ee6db11	[SystemZ] Add -mbackchain option. This option, like the corresponding gcc option, is SystemZ-specific and enables storing frame backchain links, as specified in the ABI. Differential Revision: http://reviews.llvm.org/D19891 llvm-svn: 268575	2016-05-04 23:37:40 +00:00
Reid Kleckner	9d03109233	Fix argument expansion of reference fields of structs r268261 made Clang "expand" more struct arguments on Windows. It removed the check for 'RD->isCLike()', which was preventing us from attempting to expand structs with reference type fields. Our expansion code was attempting to load and pass each field of the type in turn. We were accidentally doing one to many loads on reference type fields. On the function prologue side, we can use EmitLValueForFieldInitialization, which obviously gets the address of the field. On the call side, I tweaked EmitRValueForField directly, since this is the only use of this method. Fixes PR27607 llvm-svn: 268321	2016-05-02 22:42:34 +00:00
Saleem Abdulrasool	10a4972a8d	revert SVN r265702, r265640 Revert the two changes to thread CodeGenOptions into the TargetInfo allocation and to fix the layering violation by moving CodeGenOptions into Basic. Code Generation is arguably not particularly "basic". This addresses Richard's post-commit review comments. This change purely does the mechanical revert and will be followed up with an alternate approach to thread the desired information into TargetInfo. llvm-svn: 265806	2016-04-08 16:52:00 +00:00
Saleem Abdulrasool	94cfc603d1	Basic: move CodeGenOptions from Frontend This is a mechanical move of CodeGenOptions from libFrontend to libBasic. This fixes the layering violation introduced earlier by threading CodeGenOptions into TargetInfo. It should also fix the modules based self-hosting builds. NFC. llvm-svn: 265702	2016-04-07 17:49:44 +00:00
Justin Lebar	d3a44f6885	[CUDA] Add -fcuda-flush-denormals-to-zero. Summary: Setting this flag causes all functions are annotated with the "nvvm-f32ftz" = "true" attribute. In addition, we annotate the module with "nvvm-reflect-ftz" set to 0 or 1, depending on whether -cuda-flush-denormals-to-zero is set. This is read by the NVVMReflect pass. Reviewers: tra, rnk Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D18671 llvm-svn: 265435	2016-04-05 18:26:20 +00:00
John McCall	12f2352152	IRGen-level lowering for the Swift calling convention. llvm-svn: 265324	2016-04-04 18:33:08 +00:00
Adam Nemet	1e217bc25f	[PGO] More comments how function pointers for indirect calls are mapped to function names Summary: Hopefully this will make it easier for the next person to figure all this out... Reviewers: bogner, davidxl Subscribers: davidxl, cfe-commits Differential Revision: http://reviews.llvm.org/D18489 llvm-svn: 264681	2016-03-28 22:18:53 +00:00
Roman Levenstein	35aa5cecf2	Add attributes for preserve_mostcc/preserve_allcc calling conventions to the C/C++ front-end Till now, preserve_mostcc/preserve_allcc calling convention attributes were only available at the LLVM IR level. This patch adds attributes for preserve_mostcc/preserve_allcc calling conventions to the C/C++ front-end. The code was mostly written by Juergen Ributzka. I just added support for the AArch64 target and tests. Differential Revision: http://reviews.llvm.org/D18025 llvm-svn: 263647	2016-03-16 18:00:46 +00:00
Mehdi Amini	557c20a886	Remove compile time PreserveName in favor of a runtime cc1 -discard-value-names option Summary: This flag is enabled by default in the driver when NDEBUG is set. It is forwarded on the LLVMContext to discard all value names (but GlobalValue) for performance purpose. This an improved version of D18024 Reviewers: echristo, chandlerc Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D18127 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 263394	2016-03-13 21:05:23 +00:00
Eric Christopher	02e3dd4b2e	Temporarily revert these patches: commit 60d9845f6a037122d9be9a6d92d4de617ef45b04 Author: Mehdi Amini <mehdi.amini@apple.com> Date: Fri Mar 11 18:48:02 2016 +0000 Fix clang crash: when CodeGenAction is initialized without a context, use the member and not the parameter From: Mehdi Amini <mehdi.amini@apple.com> git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@263273 91177308-0d34-0410-b5e6-96231b3b80d8 commit af7ce3bf04a75ad5124b457b805df26006bd215b Author: Mehdi Amini <mehdi.amini@apple.com> Date: Fri Mar 11 17:32:58 2016 +0000 Fix build: use -> with pointers and not . Silly typo. From: Mehdi Amini <mehdi.amini@apple.com> git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@263267 91177308-0d34-0410-b5e6-96231b3b80d8 commit d0eea119192814954e7368c77d0dc5a9eeec1fbb Author: Mehdi Amini <mehdi.amini@apple.com> Date: Fri Mar 11 17:15:44 2016 +0000 Remove compile time PreserveName switch based on NDEBUG Summary: Following r263086, we are now relying on a flag on the Context to discard Value names in release builds. Reviewers: chandlerc Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D18024 From: Mehdi Amini <mehdi.amini@apple.com> git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@263257 91177308-0d34-0410-b5e6-96231b3b80d8 until we can fix the Release builds. This reverts commits 263257, 263267, 263273 llvm-svn: 263320	2016-03-12 01:47:11 +00:00
Mehdi Amini	e803fc3276	Remove compile time PreserveName switch based on NDEBUG Summary: Following r263086, we are now relying on a flag on the Context to discard Value names in release builds. Reviewers: chandlerc Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D18024 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 263257	2016-03-11 17:15:44 +00:00
John McCall	f26e73df75	Add a coerce-and-expand ABIArgInfo as a generalization of some of the things we do with Expand / Direct. NFC for now, but this will be used by swiftcall expansion. llvm-svn: 263192	2016-03-11 04:30:43 +00:00
John McCall	c56a8b3284	Preserve ExtParameterInfos into CGFunctionInfo. As part of this, make the function-arrangement interfaces a little simpler and more semantic. NFC. llvm-svn: 263191	2016-03-11 04:30:31 +00:00
Amjad Aboud	faea560286	Resolved Bug 26414. https://llvm.org/bugs/show_bug.cgi?id=26414 Since interrupt handler must be returned with iret, tail call can't be used. Differential Revision: http://reviews.llvm.org/D17853 llvm-svn: 262830	2016-03-07 14:22:46 +00:00
Justin Lebar	ddd97faeec	[CUDA] Mark all CUDA device-side function defs, decls, and calls as convergent. Summary: This is important for e.g. the following case: void sync() { __syncthreads(); } void foo() { do_something(); sync(); do_something_else(): } Without this change, if the optimizer does not inline sync() (which it won't because __syncthreads is also marked as noduplicate, for now anyway), it is free to perform optimizations on sync() that it would not be able to perform on __syncthreads(), because sync() is not marked as convergent. Similarly, we need a notion of convergent calls, since in the case when we can't statically determine a call's target(s), we need to know whether it's safe to perform optimizations around the call. This change is conservative; the optimizer will remove these attrs where it can, see r260318, r260319. Reviewers: majnemer Subscribers: cfe-commits, jhen, echristo, tra Differential Revision: http://reviews.llvm.org/D17056 llvm-svn: 261779	2016-02-24 21:55:11 +00:00
David Majnemer	971d31be6f	[WinEH] Make sure terminate handlers have funclet operands Calls to the terminate handler must be annotated within the exception region they are within. llvm-svn: 261751	2016-02-24 17:02:45 +00:00
Akira Hatanaka	9d8ac61fec	[CodeGen] Fix an assert in CodeGenFunction::EmitFunctionEpilog The assert is triggered because isObjCRetainableType() is called on the canonicalized return type that has been stripped of the typedefs and attributes attached to it. To fix this assert, this commit gets the original return type from CurCodeDecl or BlockInfo and uses it instead of the canoicalized type. rdar://problem/24470031 Differential Revision: http://reviews.llvm.org/D16914 llvm-svn: 261151	2016-02-17 21:09:50 +00:00
Benjamin Kramer	0bb97746a8	RValue refs do not work that way. llvm-svn: 260823	2016-02-13 16:00:13 +00:00
Richard Smith	c99f11b373	Fix undefined behavior when compiling in C++14 due to sized operator delete being called with the wrong size: convert CGFunctionInfo to use TrailingObjects and ask TrailingObjects to provide a working 'operator delete' for us. llvm-svn: 260181	2016-02-09 01:05:04 +00:00
David Majnemer	3df77bc67c	[WinEH] Annotate calls to __RTtypeid with a funclet bundle Clang's CodeGen has several paths which end up invoking or calling a function. The one that we used for calls to __RTtypeid did not appropriately annotate the call with a funclet bundle. This fixes PR26329. llvm-svn: 258877	2016-01-26 23:14:47 +00:00
Betul Buyukkurt	518276a5fe	Clang changes for value profiling Differential Revision: http://reviews.llvm.org/D8940 llvm-svn: 258650	2016-01-23 22:50:44 +00:00
Sanjay Patel	846b63b436	fix formatting; NFC llvm-svn: 258097	2016-01-18 22:15:33 +00:00
Chad Rosier	7dbc9cf876	[Driver] Add support for -fno-builtin-foo options. Addresses PR4941 and rdar://6756912. http://reviews.llvm.org/D15195 llvm-svn: 256937	2016-01-06 14:35:46 +00:00
David Majnemer	0b17d44faf	[WinEH] Update clang to use operand bundles on call sites This updates clang to use bundle operands to associate an invoke with the funclet which it is contained within. Depends on D15517. Differential Revision: http://reviews.llvm.org/D15518 llvm-svn: 255675	2015-12-15 21:27:59 +00:00
David Majnemer	4e52d6f811	Update clang to use the updated LLVM EH instructions Depends on D15139. Reviewers: rnk Differential Revision: http://reviews.llvm.org/D15140 llvm-svn: 255423	2015-12-12 05:39:21 +00:00
George Burgess IV	3e3bb95b69	Add the `pass_object_size` attribute to clang. `pass_object_size` is our way of enabling `__builtin_object_size` to produce high quality results without requiring inlining to happen everywhere. A link to the design doc for this attribute is available at the Differential review link below. Differential Revision: http://reviews.llvm.org/D13263 llvm-svn: 254554	2015-12-02 21:58:08 +00:00
Samuel Antao	798f11cfb7	Preserve exceptions information during calls code generation. This patch changes the generation of CGFunctionInfo to contain the FunctionProtoType if it is available. This enables the code generation for call instructions to look into this type for exception information and therefore generate better quality IR - it will not create invoke instructions for functions that are know not to throw. llvm-svn: 253926	2015-11-23 22:04:44 +00:00
Akira Hatanaka	7828b1e604	Add support for function attribute 'disable_tail_calls'. The ``disable_tail_calls`` attribute instructs the backend to not perform tail call optimization inside the marked function. For example, int callee(int); int foo(int a) __attribute__((disable_tail_calls)) { return callee(a); // This call is not tail-call optimized. } Note that this attribute is different from 'not_tail_called', which prevents tail-call optimization to the marked function. rdar://problem/8973573 Differential Revision: http://reviews.llvm.org/D12547 llvm-svn: 252986	2015-11-13 00:42:21 +00:00
Eric Christopher	2b90a64e31	Extract out a function onto CodeGenModule for getting the map of features for a particular function, then use it to clean up some code. llvm-svn: 252819	2015-11-11 23:05:08 +00:00
Yaron Keren	30e4515a5f	Replace tab with 8 spaces, NFC. llvm-svn: 252426	2015-11-08 22:01:45 +00:00
Akira Hatanaka	c866762272	Add support for function attribute 'not_tail_called'. This attribute is used to prevent tail-call optimizations to the marked function. For example, in the following piece of code, foo1 will not be tail-call optimized: int __attribute__((not_tail_called)) foo1(int); int foo2(int a) { return foo1(a); // Tail-call optimization is not performed. } The attribute has effect only on statically bound calls. It has no effect on indirect calls. Also, virtual functions and objective-c methods cannot be marked as 'not_tail_called'. rdar://problem/22667622 Differential Revision: http://reviews.llvm.org/D12922 llvm-svn: 252369	2015-11-06 23:56:15 +00:00
Duncan P. N. Exon Smith	9f5260ab13	CodeGen: Remove implicit ilist iterator conversions, NFC Make ilist iterator conversions explicit in clangCodeGen. Eventually I'll remove them everywhere. llvm-svn: 252358	2015-11-06 23:00:41 +00:00
Reid Kleckner	a002bd544c	[WinEH] Mark calls inside cleanups as noinline This works around PR25162. The MSVC tables make it very difficult to correctly inline a C++ destructor that contains try / catch. We've attempted to address PR25162 in LLVM's backend, but it feels pretty infeasible. MSVC and ICC both appear to avoid inlining such complex destructors. Long term, we want to fix this by making the inliner smart enough to know when it is inlining into a cleanup, so it can inline simple destructors (~unique_ptr and ~vector) while avoiding destructors containing try / catch. llvm-svn: 251576	2015-10-28 23:06:42 +00:00
Benjamin Kramer	5b4296af77	Move global classes into anonymous namespaces. NFC. llvm-svn: 251528	2015-10-28 17:16:26 +00:00
John McCall	b04ecb753a	Unify the ObjC entrypoint caches. llvm-svn: 250918	2015-10-21 18:06:43 +00:00
Angel Garcia Gomez	637d1e6694	Roll-back r250822. Summary: It breaks the build for the ASTMatchers Subscribers: klimek, cfe-commits Differential Revision: http://reviews.llvm.org/D13893 llvm-svn: 250827	2015-10-20 13:23:58 +00:00
Angel Garcia Gomez	b5250d3448	Apply modernize-use-default to clang. Summary: Replace empty bodies of default constructors and destructors with '= default'. Reviewers: bkramer, klimek Subscribers: klimek, alexfh, cfe-commits Differential Revision: http://reviews.llvm.org/D13890 llvm-svn: 250822	2015-10-20 12:52:55 +00:00
Eric Christopher	15709991d0	Add an error when calling a builtin that requires features that don't match the feature set of the function that they're being called from. This ensures that we can effectively diagnose some[1] code that would instead ICE in the backend with a failure to select message. Example: __m128d foo(__m128d a, __m128d b) { return __builtin_ia32_addsubps(b, a); } compiled for normal x86_64 via: clang -target x86_64-linux-gnu -c would fail to compile in the back end because the normal subtarget features for x86_64 only include sse2 and the builtin requires sse3. [1] We're still not erroring on: __m128i bar(__m128i const *p) { return _mm_lddqu_si128(p); } where we should fail and error on an always_inline function being inlined into a function that doesn't support the subtarget features required. llvm-svn: 250473	2015-10-15 23:47:11 +00:00
Benjamin Kramer	c2d2b4259c	[CodeGen] Remove dead code. NFC. llvm-svn: 250418	2015-10-15 15:29:40 +00:00
Reid Kleckner	7c2f9e80f7	Don't emit exceptional stackrestore cleanups around inalloca functions The backend restores the stack pointer after recovering from an exception. This is similar to r245879, but it doesn't try to use the normal cleanup mechanism, so hopefully it won't cause the same breakage. llvm-svn: 249640	2015-10-08 00:17:45 +00:00
Charles Davis	c7d5c94f78	Support __builtin_ms_va_list. Summary: This change adds support for `__builtin_ms_va_list`, a GCC extension for variadic `ms_abi` functions. The existing `__builtin_va_list` support is inadequate for this because `va_list` is defined differently in the Win64 ABI vs. the System V/AMD64 ABI. Depends on D1622. Reviewers: rsmith, rnk, rjmccall CC: cfe-commits Differential Revision: http://reviews.llvm.org/D1623 llvm-svn: 247941	2015-09-17 20:55:33 +00:00
Piotr Padlewski	d679d7e924	Generating assumption loads of vptr after ctor call (fixed) Generating call assume(icmp %vtable, %global_vtable) after constructor call for devirtualization purposes. For more info go to: http://lists.llvm.org/pipermail/cfe-dev/2015-July/044227.html Edit: Fixed version because of PR24479 and other bug caused in chrome. After this patch got reverted because of ScalarEvolution bug (D12719) Merged after John McCall big patch (Added Address). http://reviews.llvm.org/D11859 http://reviews.llvm.org/D12865 llvm-svn: 247646	2015-09-15 00:37:06 +00:00
Akira Hatanaka	aecca041c9	Record function attribute "stackrealign" instead of using backend option -force-align-stack. Also, make changes to the driver so that -mno-stack-realign is no longer an option exposed to the end-user that disallows stack realignment in the backend. Differential Revision: http://reviews.llvm.org/D11815 llvm-svn: 247451	2015-09-11 18:55:09 +00:00
David Majnemer	9df56372e8	[MS ABI] Make member pointers return true for isIncompleteType The type of a member pointer is incomplete if it has no inheritance model. This lets us reuse more general logic already embedded in clang. llvm-svn: 247346	2015-09-10 21:52:00 +00:00
Piotr Padlewski	4bed31b9bf	Revert "Generating assumption loads of vptr after ctor call (fixed)" It seems that there is small bug, and we can't generate assume loads when some virtual functions have internal visibiliy This reverts commit 982bb7d966947812d216489b3c519c9825cacbf2. llvm-svn: 247332	2015-09-10 20:18:30 +00:00
John McCall	9a2c1c9603	Don't crash when emitting a block under returns_nonnull. rdar://22071955 llvm-svn: 247228	2015-09-10 00:57:46 +00:00
Piotr Padlewski	255652e828	Generating assumption loads of vptr after ctor call (fixed) Generating call assume(icmp %vtable, %global_vtable) after constructor call for devirtualization purposes. For more info go to: http://lists.llvm.org/pipermail/cfe-dev/2015-July/044227.html Edit: Fixed version because of PR24479. After this patch got reverted because of ScalarEvolution bug (D12719) Merged after John McCall big patch (Added Address). http://reviews.llvm.org/D11859 llvm-svn: 247199	2015-09-09 22:20:28 +00:00
David Majnemer	22ee1a7466	[MS ABI] Don't crash on references to pointers to members in args We know that a reference can always be dereferenced. However, we don't always know the number of bytes if the reference's pointee type is incomplete. This case was correctly handled but we didn't consider the case where the type is complete but we cannot calculate its size for ABI specific reasons. In this specific case, a member pointer's size is available only under certain conditions. This fixes PR24703. llvm-svn: 247188	2015-09-09 20:57:59 +00:00
Jakub Kuderski	f50ab0ffce	findDominatingStoreToReturn in CGCall.cpp didn't check if a candidate store instruction used the ReturnValue as pointer operand or value operand. This led to wrong code gen - in later stages (load-store elision code) the found store and its operand would be erased, causing ReturnValue to become a <badref>. The patch adds a check that makes sure that ReturnValue is a pointer operand of store instruction. Regression test is also added. This fixes PR24386. Differential Revision: http://reviews.llvm.org/D12400 llvm-svn: 247003	2015-09-08 10:36:42 +00:00
John McCall	7f416cc426	Compute and preserve alignment more faithfully in IR-generation. Introduce an Address type to bundle a pointer value with an alignment. Introduce APIs on CGBuilderTy to work with Address values. Change core APIs on CGF/CGM to traffic in Address where appropriate. Require alignments to be non-zero. Update a ton of code to compute and propagate alignment information. As part of this, I've promoted CGBuiltin's EmitPointerWithAlignment helper function to CGF and made use of it in a number of places in the expression emitter. The end result is that we should now be significantly more correct when performing operations on objects that are locally known to be under-aligned. Since alignment is not reliably tracked in the type system, there are inherent limits to this, but at least we are no longer confused by standard operations like derived-to-base conversions and array-to-pointer decay. I've also fixed a large number of bugs where we were applying the complete-object alignment to a pointer instead of the non-virtual alignment, although most of these were hidden by the very conservative approach we took with member alignment. Also, because IRGen now reliably asserts on zero alignments, we should no longer be subject to an absurd but frustrating recurring bug where an incomplete type would report a zero alignment and then we'd naively do a alignmentAtOffset on it and emit code using an alignment equal to the largest power-of-two factor of the offset. We should also now be emitting much more aggressive alignment attributes in the presence of over-alignment. In particular, field access now uses alignmentAtOffset instead of min. Several times in this patch, I had to change the existing code-generation pattern in order to more effectively use the Address APIs. For the most part, this seems to be a strict improvement, like doing pointer arithmetic with GEPs instead of ptrtoint. That said, I've tried very hard to not change semantics, but it is likely that I've failed in a few places, for which I apologize. ABIArgInfo now always carries the assumed alignment of indirect and indirect byval arguments. In order to cut down on what was already a dauntingly large patch, I changed the code to never set align attributes in the IR on non-byval indirect arguments. That is, we still generate code which assumes that indirect arguments have the given alignment, but we don't express this information to the backend except where it's semantically required (i.e. on byvals). This is likely a minor regression for those targets that did provide this information, but it'll be trivial to add it back in a later patch. I partially punted on applying this work to CGBuiltin. Please do not add more uses of the CreateDefaultAligned{Load,Store} APIs; they will be going away eventually. llvm-svn: 246985	2015-09-08 08:05:57 +00:00
Eric Christopher	86fdc85181	Migrate the target attribute parsing code to returning an instance every time it's called rather than attempting to cache the result. It's unlikely to be called frequently and the overhead of using it in the first place is already factored out. llvm-svn: 246706	2015-09-02 20:40:12 +00:00
Eric Christopher	bb0cef6e9c	Migrate the target attribute parsing code into an extension off of the main attribute and cache the results so we don't have to parse a single attribute more than once. This reapplies r246596 with a fix for an uninitialized class member, and a couple of cleanups and formatting changes. llvm-svn: 246610	2015-09-02 00:12:02 +00:00
Eric Christopher	bf91fbab3a	Revert "Migrate the target attribute parsing code into an extension off of" This is failing in release mode. Revert while I figure out what's happening. This reverts commit r246596. llvm-svn: 246598	2015-09-01 22:37:03 +00:00
Eric Christopher	21213a7040	Migrate the target attribute parsing code into an extension off of the main attribute and cache the results so we don't have to parse a single attribute more than once. llvm-svn: 246596	2015-09-01 22:03:58 +00:00
Eric Christopher	b57804a134	Use hasAttr, not getAttr if we're just checking for presence. llvm-svn: 246595	2015-09-01 22:03:56 +00:00
Eric Christopher	dec31befef	Revert "Pull the target attribute parsing out of CGCall and onto TargetInfo." This reverts commit r246468 while we figure out what to do about Basic and AST. llvm-svn: 246508	2015-08-31 23:19:55 +00:00
Eric Christopher	d40722e267	Pull the target attribute parsing out of CGCall and onto TargetInfo. Also: - Add a typedef to make working with the result easier. - Update callers to use the new function. - Make initFeatureMap out of line. llvm-svn: 246468	2015-08-31 18:39:22 +00:00
Steven Wu	5528da76ef	Revert r246214 and r246213 These two commits causes llvm LTO bootstrap to hang in ScalarEvolution. llvm-svn: 246282	2015-08-28 07:14:10 +00:00
Eric Christopher	ef1e295a8c	Merge the two feature map setting functions into a single function and replace all callers. llvm-svn: 246259	2015-08-28 02:13:58 +00:00
Eric Christopher	da89d6804c	Use an explicit assignment. llvm-svn: 246225	2015-08-27 22:20:03 +00:00
Piotr Padlewski	525f746710	Generating assumption loads of vptr after ctor call (fixed) Generating call assume(icmp %vtable, %global_vtable) after constructor call for devirtualization purposes. For more info go to: http://lists.llvm.org/pipermail/cfe-dev/2015-July/044227.html Edit: Fixed version because of PR24479. http://reviews.llvm.org/D11859 llvm-svn: 246213	2015-08-27 21:35:37 +00:00
Eric Christopher	3751bce2a9	Target attribute syntax compatibility fix - gcc uses no- rather than mno-. llvm-svn: 246197	2015-08-27 20:05:48 +00:00
Eric Christopher	3a98b3c1a5	Rewrite the code generation handling for function feature and cpu attributes. A couple of changes here: a) Do less work in the case where we don't have a target attribute on the function. We've already canonicalized the attributes for the function - no need to do more work. b) Use the newer canonicalized feature adding functions from TargetInfo to do the work when we do have a target attribute. This enables us to diagnose some warnings in the case of conflicting written attributes (only ppc does this today) and also make sure to get all of the features for a cpu that's listed rather than just change the cpu. Updated all testcases accordingly and added a new testcase to verify that we'll error out on ppc if we have some incompatible options using the existing diagnosis framework there. llvm-svn: 246195	2015-08-27 19:59:34 +00:00
Nico Weber	8cdb3f90ef	Revert r245879. Speculative, might have caused crbug.com/524604 llvm-svn: 245965	2015-08-25 18:43:32 +00:00
David Majnemer	3cbfb65a52	[MS ABI] Don't emit stackrestore in cleanups The stackrestore intrinsic isn't meaningful inside of a cleanup funclet. llvm-svn: 245879	2015-08-24 21:34:21 +00:00
Piotr Padlewski	fa0e11efdd	Revert "Generating assumption loads of vptr after ctor call (fixed)" Reverting because of 245721 This reverts commit 552658e2b60543c928030b09cc9b5dfcb40c3f28. llvm-svn: 245727	2015-08-21 19:49:41 +00:00
Piotr Padlewski	910a059e42	Generating assumption loads of vptr after ctor call (fixed) Generating call assume(icmp %vtable, %global_vtable) after constructor call for devirtualization purposes. For more info go to: http://lists.llvm.org/pipermail/cfe-dev/2015-July/044227.html Edit: Fixed version because of PR24479. http://reviews.llvm.org/D11859 llvm-svn: 245721	2015-08-21 18:28:00 +00:00
James Y Knight	7160857da3	Properly provide alignment of 'byval' arguments down to llvm. This is important in the case that the LLVM-inferred llvm-struct alignment is not the same as the clang-known C-struct alignment. Differential Revision: http://reviews.llvm.org/D12243 llvm-svn: 245719	2015-08-21 18:19:06 +00:00
David Blaikie	7e70d6803d	Devirtualize EHScopeStack::Cleanup's dtor because it's never destroyed polymorphically llvm-svn: 245378	2015-08-18 22:40:54 +00:00
Justin Bogner	3c32c83daa	Revert "Generating assumption loads of vptr after ctor call (fixed)" Bootstrap bots were failing: http://lab.llvm.org:8080/green/job/clang-stage2-configure-Rlto_build/6382/ http://bb.pgr.jp/builders/clang-3stage-i686-linux/builds/2969 This reverts r245264. llvm-svn: 245267	2015-08-18 05:40:20 +00:00
Piotr Padlewski	bc7497abbb	Generating assumption loads of vptr after ctor call (fixed) Generating call assume(icmp %vtable, %global_vtable) after constructor call for devirtualization purposes. For more info go to: http://lists.llvm.org/pipermail/cfe-dev/2015-July/044227.html Edit: Fixed version because of PR24479. http://reviews.llvm.org/D11859 llvm-svn: 245264	2015-08-18 03:52:00 +00:00
Eric Christopher	83dfb00fd7	Untabify. llvm-svn: 244695	2015-08-11 23:17:31 +00:00
Pete Cooper	57d3f14502	Use llvm::reverse to make a bunch of loops use foreach. NFC. In llvm commit r243581, a reverse range adapter was added which allows us to change code such as for (auto I = Fields.rbegin(), E = Fields.rend(); I != E; ++I) { in to for (const FieldDecl *I : llvm::reverse(Fields)) This commit changes a few of the places in clang which are eligible to use this new adapter. llvm-svn: 243663	2015-07-30 17:22:52 +00:00
David Blaikie	f05779e21c	Pass an iterator range to EmitCallArgs llvm-svn: 242824	2015-07-21 18:37:18 +00:00
David Majnemer	1bf0f8ede6	[MS Compat] Add support for __declspec(noalias) The attribute '__declspec(noalias)' communicates that the function only accesses memory pointed to by its pointer-typed arguments. llvm-svn: 242728	2015-07-20 22:51:52 +00:00
Benjamin Kramer	f48ee4482a	[AST] Cleanup ExprIterator. - Make it a proper random access iterator with a little help from iterator_adaptor_base - Clean up users of magic dereferencing. The iterator should behave like an Expr **. - Make it an implementation detail of Stmt. This allows inlining of the assertions. llvm-svn: 242608	2015-07-18 14:35:53 +00:00
Ulrich Weigand	6e2cea6f0c	Respect alignment when loading up a coerced function argument Code in CGCall.cpp that loads up function arguments that need to be coerced to a different type may in some cases ignore the fact that the source of the argument is not naturally aligned. This may cause incorrect code to be generated. In some places in CreateCoercedLoad, we already have setAlignment calls to address this, but I ran into one where it was missing, causing wrong code generation on SystemZ. However, in that location, we do not actually know what alignment of the source location we can rely on; the callers do not pass anything to this routine. This is already an issue in other places in CreateCoercedLoad; and the same problem exists for CreateCoercedStore. To avoid pessimising code, and to fix the FIXMEs already in place, this patch also adds an alignment argument to the CreateCoerced* routines and uses it instead of forcing an alignment of 1. The callers are changed to pass in the best information they have. This actually requires changes in a number of existing test cases since we now get better alignment in many places. Differential Revision: http://reviews.llvm.org/D11033 llvm-svn: 241898	2015-07-10 11:31:43 +00:00
Eric Christopher	d151addc00	Update target attribute support for post-commit feedback. Use const auto rather than duplicating the type name and fix the error message when the attribute is applied to an incorrect entity. llvm-svn: 241526	2015-07-06 23:52:01 +00:00
Eric Christopher	af4d608d13	Handle arbitrary whitespace in the target attribute support. This allows us to deal a bit more gracefully with inclusions done by macros, token pasting, or just code layout/formatting. llvm-svn: 241525	2015-07-06 23:51:59 +00:00
Akira Hatanaka	85365cd72a	Attach attribute "trap-func-name" to call sites of llvm.trap and llvm.debugtrap. This is needed to use clang's command line option "-ftrap-function" for LTO and enable changing the trap function name on a per-call-site basis. rdar://problem/21225723 Differential Revision: http://reviews.llvm.org/D10831 llvm-svn: 241306	2015-07-02 22:15:41 +00:00
Benjamin Kramer	9c218592c8	[CodeGen] Use llvm::join to simplify string joining. While there replace stable_sort of std::string with just sort, stability is not necessary for "simple" value types. No functional change intended. llvm-svn: 241299	2015-07-02 21:02:39 +00:00
Eric Christopher	2374a7cba8	Use a stable sort to guarantee target feature ordering in the IR in order to make testing somewhat more feasible. Has the advantage of making it easier to find target features as well. llvm-svn: 241134	2015-07-01 01:07:12 +00:00
Eric Christopher	2249b81697	Fix a TODO dealing with canonicalizing attributes on functions by using a string map to canonicalize. Fix up a couple of testcases that needed changing since we are no longer simply appending features to the list, but all of their mask dependencies as well. llvm-svn: 241129	2015-07-01 00:08:29 +00:00
Alexander Kornienko	ab9db51042	Revert r240270 ("Fixed/added namespace ending comments using clang-tidy"). llvm-svn: 240353	2015-06-22 23:07:51 +00:00
Alexander Kornienko	3d9d929e42	Fixed/added namespace ending comments using clang-tidy. NFC The patch is generated using this command: $ tools/extra/clang-tidy/tool/run-clang-tidy.py -fix \ -checks=-,llvm-namespace-comment -header-filter='llvm/.\|clang/.*' \ work/llvm/tools/clang To reduce churn, not touching namespaces spanning less than 10 lines. llvm-svn: 240270	2015-06-22 09:47:44 +00:00
Eric Christopher	2c4555ad1b	Fix "the the" in comments/documentation/etc. llvm-svn: 240110	2015-06-19 01:52:53 +00:00
Alexey Samsonov	1054420ba3	[CGCall] Fix potential invalid iterator decrement in findDominatingStoreToReturnValue. If llvm.lifetime.end turns out to be the first instruction in the last basic block, we can decrement the iterator twice, going past rend. At the moment, this can never happen because llvm.lifetime.end always goes immediately after bitcast, but relying on this is very brittle. llvm-svn: 239638	2015-06-12 21:05:32 +00:00
Eric Christopher	249e3762e5	Handle fpmath= in the target attribute. Right now we're ignoring the fpmath attribute since there's no backend support for a feature like this and to do so would require checking the validity of the strings and doing general subtarget feature parsing of valid and invalid features with the target attribute feature. llvm-svn: 239582	2015-06-12 01:36:00 +00:00
Eric Christopher	4dfe075f93	Handle -mno-<feature> in target attribute strings by replacing the -mno- with a -<feature> to match how we handle this in the rest of the frontend. llvm-svn: 239581	2015-06-12 01:35:58 +00:00
Eric Christopher	64a247b68b	Add support for tune= to the target attribute support by ignoring it. We don't currently support the -mtune option in any useful way so ignoring the annotation is fine. llvm-svn: 239580	2015-06-12 01:35:56 +00:00
Eric Christopher	11acf739f8	Add support for the the target attribute. Modeled after the gcc attribute of the same name, this feature allows source level annotations to correspond to backend code generation. In llvm particular parlance, this allows the adding of subtarget features and changing the cpu for a particular function based on source level hints. This has been added into the existing support for function level attributes without particular verification for any target outside of whether or not the backend will support the features/cpu given (similar to section, etc). llvm-svn: 239579	2015-06-12 01:35:52 +00:00
Akira Hatanaka	262a4c4ec0	Attach attribute "disable-tail-calls" to the functions in the IR. This commit adds back the code that seems to have been dropped unintentionally in r176985. rdar://problem/13752163 Differential Revision: http://reviews.llvm.org/D10100 llvm-svn: 239426	2015-06-09 19:04:36 +00:00
Leny Kholodov	6aab1117e8	[CodeGen] Reuse stack space from unused function results (with more accurate unused result detection) This patch fixes issues with unused result detection which were found in patch http://reviews.llvm.org/D9743. Differential Revision: http://reviews.llvm.org/D10042 llvm-svn: 239294	2015-06-08 10:23:49 +00:00
Nuno Lopes	1ba2d78b9a	ubsan: Check for null pointers given to certain builtins, such as memcpy, memset, memmove, and bzero. Reviewed by: Richard Smith Differential Revision: http://reviews.llvm.org/D9673 llvm-svn: 238657	2015-05-30 16:11:40 +00:00
Petar Jovanovic	1a3f965fe3	[MIPS] Re-land the change r238200 to fix extension of integer types Re-land the change r238200, but with modifications in the tests that should prevent new failures in some environments as reported with the original change on the mailing list. llvm-svn: 238253	2015-05-26 21:07:19 +00:00
Hans Wennborg	74df0df135	Revert r238200: "[MIPS] fix extension of integer types (function calls)" mips-unsigned-ext-var.c and mips-unsigned-extend.c fail in some builds. llvm-svn: 238237	2015-05-26 19:39:54 +00:00
Petar Jovanovic	9aa0f1657f	[MIPS] fix extension of integer types (function calls) On MIPS unsigned int type should not be zero extended but sign-extended. Patch by Strahinja Petrovic. Differential Revision: http://reviews.llvm.org/D9198 llvm-svn: 238200	2015-05-26 13:30:54 +00:00
David Blaikie	43f9bb7371	API update for streamlining of IRBuilder::CreateCall to just use ArrayRef/initializer_list+braced init llvm-svn: 237625	2015-05-18 22:14:03 +00:00
NAKAMURA Takumi	1a6756bba0	Revert r237385, "[CodeGen] Reuse stack space from unused function results" It broke clang stage2, at least tblgen. llvm-svn: 237418	2015-05-15 03:49:05 +00:00
Sergey Dmitrouk	3e96fc08da	[CodeGen] Reuse stack space from unused function results Summary: Space on stack allocated for unused structures returned by functions was unused even when it's lifetime didn't intersect with lifetime of any other objects that could use the same space. The test added also checks for named and auto objects. It seems to make sense to have this all in one place. Reviewers: aadg, rsmith, rjmccall, rnk Reviewed By: rnk Subscribers: asl, cfe-commits Differential Revision: http://reviews.llvm.org/D9743 llvm-svn: 237385	2015-05-14 19:58:03 +00:00
Justin Bogner	f43d8e1cce	InstrProf: This call does nothing, remove it llvm-svn: 236298	2015-05-01 01:02:17 +00:00
Eric Christopher	f37ab1ca73	Always add the target-cpu and target-features sets if they're non-null. This makes sure that the front end is specific about what they're expecting the backend to produce. Update a FIXME with the idea that the target-features could be more precise using backend knowledge. llvm-svn: 235936	2015-04-27 23:11:34 +00:00
David Majnemer	e154456d4a	[MS ABI] Fix the preferred alignment of member pointers Member pointers in the MS ABI have different alignment depending on whether they were created on the stack or live in a record. llvm-svn: 235681	2015-04-24 01:25:05 +00:00
David Majnemer	dc012fa266	Revert "Revert r234581, it might have caused a few miscompiles in Chromium." This reverts commit r234700. It turns out that the lifetime markers were not the cause of Chromium failing but a bug which was uncovered by optimizations exposed by the markers. llvm-svn: 235553	2015-04-22 21:38:15 +00:00
Nico Weber	1c565c31b1	Revert r234581, it might have caused a few miscompiles in Chromium. If the revert helps, I'll get a repro this Monday. Else I'll put the change back in. llvm-svn: 234700	2015-04-11 23:51:38 +00:00
Benjamin Kramer	c19cde119d	Don't rely on implicit CallSite construction. llvm-svn: 234600	2015-04-10 14:49:31 +00:00
Arnaud A. de Grandmaison	047a686d53	Remove threshold for inserting lifetime markers for named temporaries Now that TailRecursionElimination has been fixed with r222354, the threshold on size for lifetime marker insertion can be removed. This only affects named temporary though, as the patch for unnamed temporaries is still in progress. My previous commit (r222993) was not handling debuginfo correctly, but this could only be seen with some asan tests. Basically, lifetime markers are just instrumentation for the compiler's usage and should not affect debug information; however, the cleanup infrastructure was assuming it contained only destructors, i.e. actual code to be executed, and was setting the breakpoint for the end of the function to the closing '}', and not the return statement, in order to show some destructors have been called when leaving the function. This is wrong when the cleanups are only lifetime markers, and this is now fixed. llvm-svn: 234581	2015-04-10 10:13:52 +00:00
David Blaikie	2e80428dc5	clang-format my last commit (sorry, keep forgetting that) llvm-svn: 234129	2015-04-05 22:47:07 +00:00
David Blaikie	1ed728c499	[opaque pointer type] More GEP API migrations Looks like the VTable code in particular will need some work to pass around the pointee type explicitly. llvm-svn: 234128	2015-04-05 22:45:47 +00:00
David Blaikie	17ea266bac	[opaque pointer type] More GEP API migrations llvm-svn: 234109	2015-04-04 21:07:17 +00:00
David Blaikie	5e259a8c6d	[opaque pointer type] Explicitly specify some types for GEP Not all of them (there's still a fallback for this specific function that omits the type parameter) but it's some I bothered to do now. llvm-svn: 234063	2015-04-03 22:54:16 +00:00
Duncan P. N. Exon Smith	2809cc7493	DebugInfo: Use new LLVM API for DebugLoc Use the new API for `DebugLoc` added in r233573 before the old one disappears. llvm-svn: 233589	2015-03-30 20:01:41 +00:00
Eric Christopher	70c1665d83	Reapply r232888 after applying a fix for -msse4 code generation. As a note, any target that uses fake target features via command line options will have similar problems. llvm-svn: 233227	2015-03-25 23:14:47 +00:00
Daniel Jasper	17ae9f0206	Revert "Add CodeGen support for adding cpu attributes on functions based on" This breaks CodeGen for an internal target. I'll get repro instructions to you. llvm-svn: 232930	2015-03-23 05:52:28 +00:00
Eric Christopher	ea00c2a06f	Add CodeGen support for adding cpu attributes on functions based on the target-cpu, if different from the triple's cpu, and target-features as they're written that are passed down from the driver. Together with LLVM r232885 this should allow the LTO'ing of binaries that contain modules compiled with different code generation options on a subset of architectures with full backend support (x86, powerpc, aarch64). llvm-svn: 232888	2015-03-21 06:15:15 +00:00
David Majnemer	37fd66e78b	MS ABI: Generate default constructor closures The MS ABI utilizes a compiler generated function called the "vector constructor iterator" to construct arrays of objects with non-trivial constructors/destructors. For this to work, the constructor must follow a specific calling convention. A thunk must be created if the default constructor has default arguments, is variadic or is otherwise incompatible. This thunk is called the default constructor closure. N.B. Default constructor closures are only generated if the default constructor is exported because clang itself does not utilize vector constructor iterators. Failing to export the default constructor closure will result in link/load failure if a translation unit compiled with MSVC is on the import side. Differential Revision: http://reviews.llvm.org/D8331 llvm-svn: 232229	2015-03-13 22:36:55 +00:00
David Majnemer	dfa6d2067c	MS ABI: Implement copy-ctor closures, finish implementing throw This adds support for copy-constructor closures. These are generated when the C++ runtime has to call a copy-constructor with a particular calling convention or with default arguments substituted in to the call. Because the runtime has no mechanism to call the function with a different calling convention or know-how to evaluate the default arguments at run-time, we create a thunk which will do all the appropriate work and package it in a way the runtime can use. Differential Revision: http://reviews.llvm.org/D8225 llvm-svn: 231952	2015-03-11 18:36:39 +00:00
Mehdi Amini	b3d5209927	Update for LLVM API change: getOrEnforceKnownAlignment() requires a DataLayout From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 231739	2015-03-10 02:36:43 +00:00
Benjamin Kramer	f989042f18	Prefer SmallVector::append/insert over push_back loops. Clang edition. Same functionality, but hoists the vector growth out of the loop. llvm-svn: 229508	2015-02-17 16:48:30 +00:00
Reid Kleckner	11c033e8aa	SEH: Use the SEHTryEpilogueStack instead of a separate bool We don't need a bool to track this now that we have a stack for it. llvm-svn: 228982	2015-02-12 23:40:45 +00:00
Reid Kleckner	a593000f01	Add the 'noinline' attribute to call sites within __try bodies LLVM doesn't support non-call exceptions, so inlining makes it harder to catch such asynchronous exceptions. llvm-svn: 228876	2015-02-11 21:40:48 +00:00
Reid Kleckner	e7b3f7c70d	Emit landing pads for SEH even if nounwind is present Disabling exceptions applies nounwind to lots of functions. SEH catches asynch exceptions, so emit the landing pad anyway. llvm-svn: 228769	2015-02-11 00:00:21 +00:00
David Blaikie	38b2591469	DebugInfo: Refactor default arg handling into a common place (instead of handling in repeatedly for aggregate, complex, and scalar types) llvm-svn: 228591	2015-02-09 19:13:51 +00:00
Benjamin Kramer	0327866a53	CodeGen: Move DebugLocs. It's slightly cheaper than copying it, if the DebugLoc points to replaceable metadata every copy is recorded in a DenseMap, moving reduces the peak size of that map. llvm-svn: 228492	2015-02-07 13:15:54 +00:00
David Majnemer	631a90b6bc	Sema: Add support for __declspec(restrict) __declspec(restrict) and __attribute(malloc) are both handled identically by clang: they are allowed to the noalias LLVM attribute. Seeing as how noalias models the C99 notion of 'restrict', rename the internal clang attribute to Restrict from Malloc. llvm-svn: 228120	2015-02-04 07:23:21 +00:00
Derek Schuff	3970a7ec9b	Remove support for pnaclcall attribute Summary: It was used for interoperability with PNaCl's calling conventions, but it's no longer needed. Also Remove NaCl*ABIInfo which just existed to delegate to either the portable or native ABIInfo, and remove checkCallingConvention which was now a no-op override. Reviewers: jvoung Subscribers: jfb, llvm-commits Differential Revision: http://reviews.llvm.org/D7206 llvm-svn: 227362	2015-01-28 20:24:52 +00:00
David Blaikie	835afb205f	DebugInfo: Remove forced column-info workaround for inlined calls This workaround was to provide unique call sites to ensure LLVM's inline debug info handling would properly unique two calls to the same function on the same line. Instead, this has now been fixed in LLVM (r226736) and the workaround here can be removed. Originally committed in r176895, but this isn't a straight revert due to all the changes since then. I just searched for anything ForcedColumn* related and removed them. We could test this - but it didn't strike me as terribly valuable once we're no longer adding this workaround everything just works as expected & it's no longer a special case to test for. llvm-svn: 226738	2015-01-21 23:08:17 +00:00
Alexander Kornienko	21de0ae3d4	Re-apply "r226548 - Introduce SPIR calling conventions" reverted in r226558. The test was fixed after a discussion with the revision author: the check pattern was made more flexible as the "%call" part is not what we actually want to check strictly there. The original patch description: === Introduce SPIR calling conventions. This implements Section 3.7 from the SPIR 1.2 spec: SPIR kernels should use "spir_kernel" calling convention. Non-kernel functions use "spir_func" calling convention. All other calling conventions are disallowed. The patch works only for OpenCL source. Any other uses will need to ensure that kernels are assigned the spir_kernel calling convention correctly. === llvm-svn: 226561	2015-01-20 11:20:41 +00:00
Alexander Kornienko	22c9d67e34	Reverting r226548 as one of the tests fails in some configurations. Here's the fail log from our internal setup: === .../tools/clang/clang -cc1 -internal-isystem .../tools/clang/staging/include -nostdsysteminc .../tools/clang/test/CodeGenOpenCL/spir-calling-conv.cl -triple spir-unknown-unknown -emit-llvm -o - FileCheck .../tools/clang/test/CodeGenOpenCL/spir-calling-conv.cl .../tools/clang/test/CodeGenOpenCL/spir-calling-conv.cl:11:12: error: expected string not found in input // CHECK: %call = tail call spir_func i32 @get_dummy_id(i32 0) ^ <stdin>:6:52: note: scanning from here define spir_kernel void @foo(i32 addrspace(1)* %A) #0 { ^ <stdin>:7:2: note: possible intended match here %1 = tail call spir_func i32 @get_dummy_id(i32 0) #2 ^ === Here's a failure on a public CI server: http://lab.llvm.org:8080/green/job/clang-stage2-configure-Rlto_check/1183/ llvm-svn: 226558	2015-01-20 10:55:33 +00:00
Sameer Sahasrabuddhe	450a58b8af	Introduce SPIR calling conventions. This implements Section 3.7 from the SPIR 1.2 spec: SPIR kernels should use "spir_kernel" calling convention. Non-kernel functions use "spir_func" calling convention. All other calling conventions are disallowed. The patch works only for OpenCL source. Any other uses will need to ensure that kernels are assigned the spir_kernel calling convention correctly. llvm-svn: 226548	2015-01-20 06:44:32 +00:00
David Blaikie	7d2a2ac57b	Recommit r225083 (reverted in r225361) now that calls to aggregate initializers from in class non-static data members are explicitly attributed to the desired line. The code setting the debug location being removed here was accidentally leaking a location into the call to the non-static data member's ctor call. Without it the call had no location and could cause assertion failures if it was inlined. Now that it has a location (and a correct one at that) this code should hopefully be no longer needed. It's possible of course that other parts of the debug info are also relying on the debug locations being set here to leak to where they're needed - so we might see the same assertions again & will have to investigate what the dependence was/is. But the chances are good that any of those are debug info line table quality bugs we've just not found yet anyway - so it'll be good to flush them out. llvm-svn: 226383	2015-01-18 00:14:21 +00:00
Nico Weber	eac50037fb	Revert r225085, it caused PR22096. PR22096 has several test cases that assert that look fairly different. I'm adding one of those as an automated test, but when relanding the other cases should probably be checked as well. llvm-svn: 225361	2015-01-07 18:23:08 +00:00
David Blaikie	fcee870c17	DebugInfo: Remove some now-unnecessary location handling around function arguments. r225000 generalized debug info line info handling for expressions such that this code is no longer necessary. This removes the last use of CGDebugInfo::getLocation, but not all the uses of CGDebugInfo::CurLoc, which is still used internally in CGDebugInfo. I'd like to do away with all of that & might succeed after a few more patches. llvm-svn: 225085	2015-01-02 19:49:10 +00:00
Peter Collingbourne	f770683f14	Implement the __builtin_call_with_static_chain GNU extension. The extension has the following syntax: __builtin_call_with_static_chain(Call, Chain) where Call must be a function call expression and Chain must be of pointer type This extension performs a function call Call with a static chain pointer Chain passed to the callee in a designated register. This is useful for calling foreign language functions whose ABI uses static chain pointers (e.g. to implement closures). Differential Revision: http://reviews.llvm.org/D6332 llvm-svn: 224167	2014-12-12 23:41:25 +00:00
Paul Robinson	0855695159	Instead of having -Os/-Oz add OptimizeForSize/MinSize first, and later having OptimizeNone remove them again, just don't add them in the first place if the function already has OptimizeNone. Note that MinSize can still appear due to attributes on different declarations; a future patch will address that. llvm-svn: 224047	2014-12-11 20:14:04 +00:00
Saleem Abdulrasool	32d1a96d69	CodeGen: further simplify assertion Use more of algorithm to simplify the assertion. Pointed out by David Blakie! llvm-svn: 222721	2014-11-25 03:49:50 +00:00
Saleem Abdulrasool	76ecafd523	CodeGen: use a range-based for loop Convert a debug assertion into a range-based loop form. NFC. llvm-svn: 222679	2014-11-24 20:14:26 +00:00
David Blaikie	82e95a3c79	Update for LLVM API change to make Small(Ptr)Set::insert return pair<iterator, bool> as per the C++ standard's associative container concept. llvm-svn: 222335	2014-11-19 07:49:47 +00:00
Alexey Samsonov	e396bfc064	Bundle conditions checked by UBSan with sanitizer kinds they implement. Summary: This change makes CodeGenFunction::EmitCheck() take several conditions that needs to be checked (all of them need to be true), together with sanitizer kinds these checks are for. This would allow to split one call into UBSan runtime into several calls in case different sanitizer kinds would have different recoverability settings. Tests should be fixed accordingly, I'm working on it. Test Plan: regression test suite. Reviewers: rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6219 llvm-svn: 221716	2014-11-11 22:03:54 +00:00
Alexey Samsonov	4c1a96f519	Propagate SanitizerKind into CodeGenFunction::EmitCheck() call. Make sure CodeGenFunction::EmitCheck() knows which sanitizer it emits check for. Make CheckRecoverableKind enum an implementation detail and move it away from header. Currently CheckRecoverableKind is determined by the type of sanitizer ("unreachable" and "return" are unrecoverable, "vptr" is always-recoverable, all the rest are recoverable). This will change in future if we allow to specify which sanitizers are recoverable, and which are not by -fsanitize-recover= flag. No functionality change. llvm-svn: 221635	2014-11-10 22:27:30 +00:00
Alexey Samsonov	edf99a92c0	Introduce a SanitizerKind enum to LangOptions. Use the bitmask to store the set of enabled sanitizers instead of a bitfield. On the negative side, it makes syntax for querying the set of enabled sanitizers a bit more clunky. On the positive side, we will be able to use SanitizerKind to eventually implement the new semantics for -fsanitize-recover= flag, that would allow us to make some sanitizers recoverable, and some non-recoverable. No functionality change. llvm-svn: 221558	2014-11-07 22:29:38 +00:00
Reid Kleckner	80944df6f4	Implement IRGen for the x86 vectorcall convention The most complex aspect of the convention is the handling of homogeneous vector and floating point aggregates. Reuse the homogeneous aggregate classification code that we use on PPC64 and ARM for this. This convention also has a C mangling, and we apparently implement that in both Clang and LLVM. Reviewed By: majnemer Differential Revision: http://reviews.llvm.org/D6063 llvm-svn: 221006	2014-10-31 22:00:51 +00:00
David Majnemer	0c0b6d9ac6	MS ABI: Properly call global delete when invoking virtual destructors Summary: The Itanium ABI approach of using offset-to-top isn't possible with the MS ABI, it doesn't have that kind of information lying around. Instead, we do the following: - Call the virtual deleting destructor with the "don't delete the object flag" set. The virtual deleting destructor will return a pointer to 'this' adjusted to the most derived class. - Call the global delete using the adjusted 'this' pointer. Reviewers: rnk Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D5996 llvm-svn: 220993	2014-10-31 20:09:12 +00:00
Reid Kleckner	e9f6a717dd	Fix ARM HVA classification of classes with non-virtual bases Reuse the PPC64 HVA detection algorithm for ARM and AArch64. This is a nice code deduplication, since they are roughly identical. A few virtual method extension points are needed to understand how big an HVA can be and what element types it can have for a given architecture. Also make the record expansion code work in the presence of non-virtual bases. Reviewed By: uweigand, asl Differential Revision: http://reviews.llvm.org/D6045 llvm-svn: 220972	2014-10-31 17:10:41 +00:00
Alexey Samsonov	035462c1cf	Get rid of SanitizerOptions::Disabled global. NFC. SanitizerOptions is not even a POD now, so having global variable of this type, is not nice. Instead, provide a regular constructor and clear() method, and let each CodeGenFunction has its own copy of SanitizerOptions it uses. llvm-svn: 220920	2014-10-30 19:33:44 +00:00
Reid Kleckner	d7857f05f4	Add frontend support for __vectorcall Wire it through everywhere we have support for fastcall, essentially. This allows us to parse the MSVC "14" CTP headers, but we will miscompile them because LLVM doesn't support __vectorcall yet. Reviewed By: Aaron Ballman Differential Revision: http://reviews.llvm.org/D5808 llvm-svn: 220573	2014-10-24 17:42:17 +00:00
Reid Kleckner	79b0fd7a48	Promote null pointer constants used as arguments to variadic functions Make it possible to pass NULL through variadic functions on 64-bit Windows targets. The Visual C++ headers define NULL to 0, when they should define it to 0LL on Win64 so that NULL is a pointer-sized integer. Fixes PR20949. Reviewers: thakis, rsmith Differential Revision: http://reviews.llvm.org/D5480 llvm-svn: 219456	2014-10-10 00:05:45 +00:00
Hal Finkel	1b0d24e03a	Initial support for the align_value attribute This adds support for the align_value attribute. This attribute is supported by Intel's compiler (versions 14.0+), and several of my HPC users have requested support in Clang. It specifies an alignment assumption on the values to which a pointer points, and is used by numerical libraries to encourage efficient generation of vector code. Of course, we already have an aligned attribute that can specify enhanced alignment for a type, so why is this additional attribute important? The problem is that if you want to specify that an input array of T is, say, 64-byte aligned, you could try this: typedef double aligned_double attribute((aligned(64))); void foo(aligned_double P) { double x = P[0]; // This is fine. double y = P[1]; // What alignment did those doubles have again? } the access here to P[1] causes problems. P was specified as a pointer to type aligned_double, and any object of type aligned_double must be 64-byte aligned. But if P[0] is 64-byte aligned, then P[1] cannot be, and this access causes undefined behavior. Getting round this problem requires a lot of awkward casting and hand-unrolling of loops, all of which is bad. With the align_value attribute, we can accomplish what we'd like in a well defined way: typedef double aligned_double_ptr attribute((align_value(64))); void foo(aligned_double_ptr P) { double x = P[0]; // This is fine. double y = P[1]; // This is fine too. } This attribute does not create a new type (and so it not part of the type system), and so will only "propagate" through templates, auto, etc. by optimizer deduction after inlining. This seems consistent with Intel's implementation (thanks to Alexey for confirming the various Intel-compiler behaviors). As a final note, I would have chosen to call this aligned_value, not align_value, for better naming consistency with the aligned attribute, but I think it would be more useful to users to adopt Intel's name. llvm-svn: 218910	2014-10-02 21:21:25 +00:00
Alexey Samsonov	153004f220	Use ClangToLLVMArgsMapping in CodeGenTypes::GetFunctionType(). NFC. This is the last piece of CGCall code that had implicit assumptions about the order in which Clang arguments are translated to LLVM ones (positions of inalloca argument, sret, this, padding arguments etc.) Now all of this data is encapsulated in ClangToLLVMArgsMapping. If this information would be required somewhere else, this class can be moved to a separate header or pulled into CGFunctionInfo. llvm-svn: 218634	2014-09-29 22:08:00 +00:00
Alexey Samsonov	34625dda07	Introduce CGFunctionInfo::getNumRequiredArgs(). NFC. Save the callers from necessity to special-case on variadic functions. llvm-svn: 218625	2014-09-29 21:21:48 +00:00
Alexey Samsonov	52c0f6adb6	Speedup ClangToLLVMArgMapping construction. NFC. Add a method to calculate the number of arguments given QualType expnads to. Use this method in ClangToLLVMArgMapping calculation. This number may be cached in CodeGenTypes for efficiency, if needed. llvm-svn: 218623	2014-09-29 20:30:22 +00:00
Alexey Samsonov	8a0bad0bfc	Refactor ABIArgInfo::Expand implementation (NFC). Hoist the logic which determines the way QualType is expanded into a separate method. Remove a bunch of copy-paste and simplify getTypesFromArgs() / ExpandTypeFromArgs() / ExpandTypeToArgs() methods. llvm-svn: 218615	2014-09-29 18:41:28 +00:00
Hal Finkel	ee90a223ea	Support the assume_aligned function attribute In addition to __builtin_assume_aligned, GCC also supports an assume_aligned attribute which specifies the alignment (and optional offset) of a function's return value. Here we implement support for the assume_aligned attribute by making use of the @llvm.assume intrinsic. llvm-svn: 218500	2014-09-26 05:04:30 +00:00
Alexey Samsonov	90452df7b1	Report source location of returns_nonnull attribute in UBSan reports. llvm-svn: 217400	2014-09-08 20:17:19 +00:00
Alexey Samsonov	8e1162c71d	Implement nonnull-attribute sanitizer Summary: This patch implements a new UBSan check, which verifies that function arguments declared to be nonnull with __attribute__((nonnull)) are actually nonnull in runtime. To implement this check, we pass FunctionDecl to CodeGenFunction::EmitCallArgs (where applicable) and if function declaration has nonnull attribute specified for a certain formal parameter, we compare the corresponding RValue to null as soon as it's calculated. Test Plan: regression test suite Reviewers: rsmith Reviewed By: rsmith Subscribers: cfe-commits, rnk Differential Revision: http://reviews.llvm.org/D5082 llvm-svn: 217389	2014-09-08 17:22:45 +00:00
Rafael Espindola	8d2a19b478	Handle constructors and destructors a bit more uniformly in CodeGen. There were code paths that are duplicated for constructors and destructors just because we have both CXXCtorType and CXXDtorsTypes. This patch introduces an unified enum and reduces code deplication a bit. llvm-svn: 217383	2014-09-08 16:01:27 +00:00
Hans Wennborg	d71907dd07	Don't emit prologues or epilogues for naked functions (PR18791, PR20028) For naked functions with parameters, Clang would still emit stores in the prologue that would clobber the stack, because LLVM doesn't set up a stack frame. (This shows up in -O0 compiles, because the stores are optimized away otherwise.) For example: __attribute__((naked)) int f(int x) { asm("movl $42, %eax"); asm("retl"); } Would result in: _Z1fi: movl 12(%esp), %eax movl %eax, (%esp) <--- Oops. movl $42, %eax retl Differential Revision: http://reviews.llvm.org/D5183 llvm-svn: 217198	2014-09-04 22:16:33 +00:00
Reid Kleckner	c34735148f	Make all virtual member pointers use variadic musttail calls This avoids encoding information about the function prototype into the thunk at the cost of some function prototype bitcast gymnastics. Fixes PR20653. llvm-svn: 216782	2014-08-29 21:43:29 +00:00
James Molloy	90d6101410	Use store size instead of alloc size when coercing. Previously, EnterStructPointerForCoercedAccess used Alloc size when determining how to convert. This was problematic, because there were situations were the alloc size was larger than the store size. For example, if the first element of a structure were i24 and the destination type were i32, the old code would generate a GEP and a load i24. The code should compare store sizes to ensure the whole object is loaded. I have attached a test case. This patch modifies the output of arm64-be-bitfield.c test case, but the new IR seems to be equivalent, and after -O3, the compiler generates identical ARM assembly. (asr x0, x0, #54) Patch by Thomas Jablin! llvm-svn: 216722	2014-08-29 10:17:52 +00:00
Alexey Samsonov	9fc9bf83a8	Properly handle multiple nonnull attributes in CodeGen llvm-svn: 216638	2014-08-28 00:53:20 +00:00
Richard Smith	00cc1c09c3	Fix regression in r216520: don't apply nonnull to non-pointer function parameters in the IR. llvm-svn: 216574	2014-08-27 18:56:18 +00:00
Oliver Stannard	2bfdc5b517	Move some ARM-specific code from CGCall.cpp to TargetInfo.cpp This tidies up some ARM-specific code added by r208417 to move it out of the target-independent parts of clang into TargetInfo.cpp. This also has the advantage that we can now flatten struct arguments to variadic AAPCS functions. llvm-svn: 216535	2014-08-27 10:43:15 +00:00
Craig Topper	5fc8fc2d31	Simplify creation of a bunch of ArrayRefs by using None, makeArrayRef or just letting them be implicitly created. llvm-svn: 216528	2014-08-27 06:28:36 +00:00
Alexey Samsonov	91cf455af1	CGCall: Factor out the logic mapping call arguments to LLVM IR arguments. Summary: This refactoring introduces ClangToLLVMArgMapping class, which encapsulates the information about the order in which function arguments listed in CGFunctionInfo should be passed to actual LLVM IR function, such as: 1) positions of sret, if there is any 2) position of inalloca argument, if there is any 3) position of helper padding argument for each call argument 4) positions of regular argument (there can be many if it's expanded). Simplify several related methods (ConstructAttributeList, EmitFunctionProlog and EmitCall): now they don't have to maintain iterators over the list of LLVM IR function arguments, dealing with all the sret/inalloca/this complexities, and just use expected positions of LLVM IR arguments stored in ClangToLLVMArgMapping. This may increase the running time of EmitFunctionProlog, as we have to traverse expandable arguments twice, but in further refactoring we will be able to speed up EmitCall by passing already calculated CallArgsToIRArgsMapping to ConstructAttributeList, thus avoiding traversing expandable argument there. No functionality change. Test Plan: regression test suite Reviewers: majnemer, rnk Reviewed By: rnk Subscribers: cfe-commits, rjmccall, timurrrr Differential Revision: http://reviews.llvm.org/D4938 llvm-svn: 216251	2014-08-22 01:06:06 +00:00
Alexey Samsonov	e5ef3ca932	Simplify some CodeGenTypes::arrangeXXX functions. No functionality change llvm-svn: 215606	2014-08-13 23:55:54 +00:00
Alexey Samsonov	3551e311f0	Simplify a few loops over CallArgList/FunctionArgList. NFC llvm-svn: 215571	2014-08-13 20:06:24 +00:00
Alexey Samsonov	de443c5002	[UBSan] Add returns-nonnull sanitizer. Summary: This patch adds a runtime check verifying that functions annotated with "returns_nonnull" attribute do in fact return nonnull pointers. It is based on suggestion by Jakub Jelinek: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20140623/223693.html. Test Plan: regression test suite Reviewers: rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D4849 llvm-svn: 215485	2014-08-13 00:26:40 +00:00
Reid Kleckner	ab2090d107	MS ABI: Use musttail for vtable thunks that pass arguments by value This moves some memptr specific code into the generic thunk emission codepath. Fixes PR20053. Reviewers: majnemer Differential Revision: http://reviews.llvm.org/D4613 llvm-svn: 214004	2014-07-26 01:34:32 +00:00
Hal Finkel	16e394a36c	Cleanup comparisons to VariableArrayType::Static for non-VLAs The enum is part of ArrayType, so there is no functional change, but comparing to ArrayType::Static for non-VLAs makes more sense. llvm-svn: 213446	2014-07-19 02:13:40 +00:00
Hal Finkel	48d53e2c4c	Use the dereferenceable attribute on C99 array parameters with static In C99, an array parameter declarator might have the form: direct-declarator '[' 'static' type-qual-list[opt] assign-expr ']' where the static keyword indicates that the caller will always provide a pointer to the beginning of an array with at least the number of elements specified by the assignment expression. For constant sizes, we can use the new dereferenceable attribute to pass this information to the optimizer. For VLAs, we don't know the size, but (for addrspace(0)) do know that the pointer must be nonnull (and so we can use the nonnull attribute). llvm-svn: 213444	2014-07-19 01:41:07 +00:00
Hal Finkel	a2347baaec	Mark C++ reference parameters as dereferenceable Because references must be initialized using some evaluated expression, they must point to something, and a callee can assume the reference parameter is dereferenceable. Taking advantage of a new attribute just added to LLVM, mark them as such. Because dereferenceability in addrspace(0) implies nonnull in the backend, we don't need both attributes. However, we need to know the size of the object to use the dereferenceable attribute, so for incomplete types we still emit only nonnull. llvm-svn: 213386	2014-07-18 15:52:10 +00:00
Hal Finkel	d8442b1b21	Add nonnull in CodeGen for __attribute__((returns_nonnull)) As a follow-up to r212835, also add the LLVM nonnull function attribute when __attribute__((returns_nonnull)) is provided. llvm-svn: 212874	2014-07-12 04:51:04 +00:00
Hal Finkel	82504f03ce	Add nonnull in CodeGen for __attribute__((nonnull)) We now have an LLVM-level nonnull attribute that can be applied to function parameters, and we emit it for reference types (as of r209723), but did not emit it when an __attribute__((nonnull)) was provided. Now we will. llvm-svn: 212835	2014-07-11 17:35:21 +00:00
Reid Kleckner	afba553ede	MS ABI: "Fix" passing non-POD structs by value to variadic functions Of course, such code is horribly broken and will explode on impact. That said, ATL does it, and we have to support them, at least a little bit. Fixes PR20191. llvm-svn: 212508	2014-07-08 02:24:27 +00:00
Nick Lewycky	9b46eb8112	Add 'nonnull' parameter or return attribute when producing an llvm pointer type in a function type where the C++ type is a reference. Update the tests. llvm-svn: 209723	2014-05-28 09:56:42 +00:00
Craig Topper	8a13c4180e	[C++11] Use 'nullptr'. CodeGen edition. llvm-svn: 209272	2014-05-21 05:09:00 +00:00
Peter Collingbourne	41af7c2fdc	Implement the flatten attribute. This is a GNU attribute that causes calls within the attributed function to be inlined where possible. It is implemented by giving such calls the alwaysinline attribute. Differential Revision: http://reviews.llvm.org/D3816 llvm-svn: 209217	2014-05-20 17:12:51 +00:00
Peter Collingbourne	b4728c12e8	Implement the no_split_stack attribute. This is a GNU attribute that allows split stacks to be turned off on a per-function basis. Differential Revision: http://reviews.llvm.org/D3817 llvm-svn: 209167	2014-05-19 22:14:34 +00:00
Reid Kleckner	966abe7614	MS ABI: Use musttail for thunk IR generation This allows us to perfectly forward non-trivial arguments that use inalloca. We still can't forward non-trivial arguments through thunks when we have a covariant return type with a non-trivial adjustment. This would require emitting an extra copy, which is non-conforming anyway. llvm-svn: 208927	2014-05-15 23:01:46 +00:00
Reid Kleckner	37abaca3c2	MS ABI: Pass 'sret' as the second parameter of instance methods Summary: MSVC always passes 'sret' after 'this', unlike GCC. This required changing a number of places in Clang that assumed the sret parameter was always first in LLVM IR. This fixes win64 MSVC ABI compatibility for methods returning structs. Reviewers: rsmith, majnemer Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D3618 llvm-svn: 208458	2014-05-09 22:46:15 +00:00
James Molloy	6f244b6f78	Reapply r208417 (olista01 'ARM: HFAs must be passed in consecutive registers'). Bots are now pacified. llvm-svn: 208425	2014-05-09 16:21:39 +00:00
James Molloy	1aa0d5f3b2	Revert r208417 (olista01 'ARM: HFAs must be passed in consecutive registers'). This is a followon commit from r208413 which broke the LLVM bots. llvm-svn: 208422	2014-05-09 16:17:09 +00:00
Oliver Stannard	19f3b4f2ce	ARM: HFAs must be passed in consecutive registers This is the clang counterpart to 208413, which ensures that Homogeneous Floating-point Aggregates are passed in consecutive registers on ARM. llvm-svn: 208417	2014-05-09 15:14:56 +00:00
James Molloy	491cefbe7a	When doing int<->ptr coercion for big-endian, calculate the shift amount correctly. Previously we calculated the shift amount based upon DataLayout::getTypeAllocSizeInBits. This will only work for legal types - types such as i24 that are created as part of structs for bitfields will return "32" from that function. Change to using getTypeSizeInBits. It turns out that AArch64 didn't run across this problem because it always returned [1 x i64] as the type for a bitfield, whereas ARM64 returns i64 so goes down this (better, but wrong) codepath. llvm-svn: 208231	2014-05-07 17:41:15 +00:00
Reid Kleckner	e39ee21551	MS ABI x64: Pass small objects with dtors but no copy ctors directly Passing objects directly (in registers or memory) creates a second copy of the object in the callee. The callee always destroys its copy, but we also have to destroy any temporary created in the caller. In other words, copy elision of these kinds of objects is impossible. Objects larger than 8 bytes with non-trivial dtors and trivial copy ctors are still passed indirectly, and we can still elide copies of them. Fixes PR19640. llvm-svn: 207889	2014-05-03 00:33:28 +00:00
Reid Kleckner	ac64060c80	MS ABI x64: Don't destroy arguments twice on x64 We were destroying them in the callee, and then again in the caller. We should use an EH-only cleanup and disable it at the point of the call for win64, even though we don't use inalloca. llvm-svn: 207733	2014-05-01 03:07:18 +00:00
Reid Kleckner	fb873af67e	Update Clang for LLVM split stack API changes in r205997 Patch by Alex Crichton! llvm-svn: 205998	2014-04-10 22:59:13 +00:00
Reid Kleckner	9df1d975b8	Avoid crashing when failing to emit a thunk If we crash, we raise a crash handler dialog, and that's really annoying. Even though we can't emit correct IR until we have musttail, don't crash. llvm-svn: 205948	2014-04-10 01:40:15 +00:00
David Majnemer	32b57b0a4c	MS ABI: Use the proper type for inalloca args Summary: The definition of a type later in a translation unit may change it's type from {}* to (%struct.foo). Earlier function definitions may use the former while more recent definitions might use the later. This is fine until they interact with one another (like one calling the other). In these cases, a bitcast is needed because the inalloca must match the function call but the store to the lvalue which initializes the argument slot has to match the rvalue's type. This technique is along the same lines with what the other, non-inalloca, codepaths perform. This fixes PR19287. Reviewers: rnk CC: cfe-commits Differential Revision: http://llvm-reviews.chandlerc.com/D3224 llvm-svn: 205217	2014-03-31 16:12:47 +00:00
Tim Northover	e77cc39aff	ObjC: allow targets to decide when to use stret for blocks. This was originally part of the ARM64 patch, but seems semantically separate. llvm-svn: 205097	2014-03-29 13:28:05 +00:00
Aaron Ballman	ec47bc2bae	[C++11] Replacing CGFunctionInfo arg iterators with iterator_range arguments(). Updating all of the usages of the iterators with range-based for loops. llvm-svn: 204068	2014-03-17 18:10:01 +00:00
Aaron Ballman	36a7fa80ab	[C++11] Replacing CallArgList writeback iterators with iterator_range writebacks(). Updating all of the usages of the iterators with range-based for loops, and removing the no-longer-needed iterator versions. llvm-svn: 204062	2014-03-17 17:22:27 +00:00
Craig Topper	4f12f10de4	[C++11] Add 'override' keyword to virtual methods that override their base class. llvm-svn: 203643	2014-03-12 06:41:41 +00:00
Chandler Carruth	4d01fff492	[C++11] Update Clang for the change to LLVM's Use-Def chain iterators in r203364: what was use_iterator is now user_iterator, and there is a use_iterator for directly iterating over the uses. This also switches to use the range-based APIs where appropriate. llvm-svn: 203365	2014-03-09 03:16:50 +00:00
Aaron Ballman	e8a8baef44	[C++11] Replacing RecordDecl iterators field_begin() and field_end() with iterator_range fields(). Updating all of the usages of the iterators with range-based for loops. llvm-svn: 203355	2014-03-08 20:12:42 +00:00
Aaron Ballman	43b68bebe7	[C++11] Replacing ObjCMethodDecl iterators param_begin() and param_end() with iterator_range params(). Updating all of the usages of the iterators with range-based for loops. llvm-svn: 203255	2014-03-07 17:50:17 +00:00
Chandler Carruth	c80ceea90b	[Modules] Update to reflect the move of CallSite into the IR library in LLVM r202816. llvm-svn: 202817	2014-03-04 11:02:08 +00:00
Reid Kleckner	fab1e89de9	MS ABI: Return sret parameters when using inalloca Previously the X86 backend would look for the sret attribute and handle this for us. inalloca takes that all away, so we have to do the return ourselves now. llvm-svn: 202097	2014-02-25 00:59:14 +00:00
Aaron Ballman	7c19ab17c7	Exposing the noduplicate attribute within Clang, which marks functions so that the optimizer does not duplicate code. Patch thanks to Marcello Maggioni! llvm-svn: 201941	2014-02-22 16:59:24 +00:00
Reid Kleckner	8ae1627733	Remove local type use in template. llvm-svn: 200598	2014-02-01 00:23:22 +00:00
Reid Kleckner	314ef7bafd	[ms-cxxabi] Use inalloca on win32 when passing non-trivial C++ objects When a non-trivial parameter is present, clang now gathers up all the parameters that lack inreg and puts them into a packed struct. MSVC always aligns each parameter to 4 bytes and no more, so this is a pretty simple struct to lay out. On win64, non-trivial records are passed indirectly. Prior to this change, clang was incorrectly using byval on win64. I'm able to self-host a working clang with this change and additional LLVM patches. Reviewers: rsmith Differential Revision: http://llvm-reviews.chandlerc.com/D2636 llvm-svn: 200597	2014-02-01 00:04:45 +00:00
Reid Kleckner	4982b82b73	[ms-cxxabi] Use x86_cdeclmethodcc for __cdecl methods on win32 This fixes PR15768, where the sret parameter and the 'this' parameter are in the wrong order. Instance methods compiled by MSVC never return records in registers, they always return indirectly through an sret pointer. That sret pointer always comes after the 'this' parameter, for both __cdecl and __thiscall methods. Unfortunately, the same is true for other calling conventions, so we'll have to change the overall approach here relatively soon. Reviewers: rsmith Differential Revision: http://llvm-reviews.chandlerc.com/D2664 llvm-svn: 200587	2014-01-31 22:54:50 +00:00
Alp Toker	314cc81b8c	Rename getResultType() on function and method declarations to getReturnType() A return type is the declared or deduced part of the function type specified in the declaration. A result type is the (potentially adjusted) type of the value of an expression that calls the function. Rule of thumb: * Declarations have return types and parameters. * Expressions have result types and arguments. llvm-svn: 200082	2014-01-25 16:55:45 +00:00
Alp Toker	9cacbabd33	Rename FunctionProtoType accessors from 'arguments' to 'parameters' Fix a perennial source of confusion in the clang type system: Declarations and function prototypes have parameters to which arguments are supplied, so calling these 'arguments' was a stretch even in C mode, let alone C++ where default arguments, templates and overloading make the distinction important to get right. Readability win across the board, especially in the casting, ADL and overloading implementations which make a lot more sense at a glance now. Will keep an eye on the builders and update dependent projects shortly. No functional change. llvm-svn: 199686	2014-01-20 20:26:09 +00:00
Justin Bogner	06bd6d04e0	CodeGen: Introduce CodeGenPGO::setCurrentRegionUnreachable There are a number of places where we do PGO.setCurrentRegionCount(0) directly after an unconditional branch. Give this operation a name so that it's clearer why we're doing this. llvm-svn: 199138	2014-01-13 21:24:18 +00:00
Justin Bogner	ef512b9929	CodeGen: Initial instrumentation based PGO implementation llvm-svn: 198640	2014-01-06 22:27:43 +00:00
Aaron Ballman	0362a6d466	Implement the MSABI and SysVABI calling conventions for Objective-C method declarations. This appears to be an omission from r189644. llvm-svn: 197584	2013-12-18 16:23:37 +00:00
Reid Kleckner	89077a1b00	[ms-cxxabi] The 'most derived' ctor parameter usually comes last Unlike Itanium's VTTs, the 'most derived' boolean or bitfield is the last parameter for non-variadic constructors, rather than the second. For variadic constructors, the 'most derived' parameter comes after the 'this' parameter. This affects constructor calls and constructor decls in a variety of places. Reviewers: timurrrr Differential Revision: http://llvm-reviews.chandlerc.com/D2405 llvm-svn: 197518	2013-12-17 19:46:40 +00:00
Reid Kleckner	739756c0f9	[ms-cxxabi] Construct and destroy call arguments in the correct order Summary: MSVC destroys arguments in the callee from left to right. Because C++ objects have to be destroyed in the reverse order of construction, Clang has to construct arguments from right to left and destroy arguments from left to right. This patch fixes the ordering by reversing the order of evaluation of all call arguments under the MS C++ ABI. Fixes PR18035. Reviewers: rsmith Differential Revision: http://llvm-reviews.chandlerc.com/D2275 llvm-svn: 196402	2013-12-04 19:23:12 +00:00
Mark Lacey	a8e7df3602	Add CodeGenABITypes.h for use in LLDB. CodeGenABITypes is a wrapper built on top of CodeGenModule that exposes some of the functionality of CodeGenTypes (held by CodeGenModule), specifically methods that determine the LLVM types appropriate for function argument and return values. I addition to CodeGenABITypes.h, CGFunctionInfo.h is introduced, and the definitions of ABIArgInfo, RequiredArgs, and CGFunctionInfo are moved into this new header from the private headers ABIInfo.h and CGCall.h. Exposing this functionality is one part of making it possible for LLDB to determine the actual ABI locations of function arguments and return values, making it possible for it to determine this for any supported target without hard-coding ABI knowledge in the LLDB code. llvm-svn: 193717	2013-10-30 21:53:58 +00:00
Mark Lacey	2345575db3	Make CodeGenTypes data members private. No functionality differences. llvm-svn: 192390	2013-10-10 20:57:00 +00:00
Mark Lacey	5ea993bb59	Use the CGCXXABI member on CodeGenTypes. CodeGenTypes already has a reference to a CGCXXABI. Use this directly rather than going through CodeGenModule to get to the same information. This is consistent with other references to CGCXXABI in CodeGenTypes functions defined in CGCall.cpp. llvm-svn: 191854	2013-10-02 20:35:23 +00:00
Nick Lewycky	2d84e84236	Thread a SourceLocation into the EmitCheck for "load_invalid_value". This occurs when scalars are loaded / undergo lvalue-to-rvalue conversion. llvm-svn: 191808	2013-10-02 02:29:49 +00:00
Nick Lewycky	5fa40c3b9e	No functionality change. Reflow lines that could fit on one line. Break lines that had 80-column violations. Remove spurious emacs mode markers on .cpp files. llvm-svn: 191797	2013-10-01 21:51:38 +00:00
Benjamin Kramer	60509af49a	Fix constructor-related typos. Noticed by Roman Divacky. llvm-svn: 190311	2013-09-09 14:48:42 +00:00
Charles Davis	b5a214e4f3	Add ms_abi and sysv_abi attribute handling. Based on a patch by Benno Rice! llvm-svn: 189644	2013-08-30 04:39:01 +00:00
Reid Kleckner	78af0708b7	Delete CC_Default and use the target default CC everywhere Summary: Makes functions with implicit calling convention compatible with function types with a matching explicit calling convention. This fixes things like calls to qsort(), which has an explicit __cdecl attribute on the comparator in Windows headers. Clang will now infer the calling convention from the declarator. There are two cases when the CC must be adjusted during redeclaration: 1. When defining a non-inline static method. 2. When redeclaring a function with an implicit or mismatched convention. Fixes PR13457, and allows clang to compile CommandLine.cpp for the Microsoft C++ ABI. Excellent test cases provided by Alexander Zinenko! Reviewers: rsmith Differential Revision: http://llvm-reviews.chandlerc.com/D1231 llvm-svn: 189412	2013-08-27 23:08:25 +00:00
Bill Wendling	17d1b61480	Only add this attribute when it's set. If it's not there, the assumption is that it's off. llvm-svn: 189064	2013-08-22 21:16:51 +00:00
Timur Iskhodzhanov	88fd439a24	Abstract out virtual calls and virtual function prologue code generation; implement them for -cxx-abi microsoft llvm-svn: 188870	2013-08-21 06:25:03 +00:00
Bill Wendling	d8f4950862	Use function attributes to indicate if we don't want to realign the stack. llvm-svn: 187617	2013-08-01 21:41:02 +00:00
Bill Wendling	f69f594512	Use the new boolean to StringRef function to generate the proper StringRefs. llvm-svn: 187251	2013-07-26 21:51:11 +00:00
Bill Wendling	a9cc8c0385	Replace the "NoFramePointerElimNonLeaf" target option with a function attribute. llvm-svn: 187092	2013-07-25 00:32:41 +00:00
Bill Wendling	b321972fdf	Use the updated name for the attribute. llvm-svn: 186864	2013-07-22 20:15:41 +00:00
Bill Wendling	021c8ded04	Use function attributes to pass along the stack protector buffer size instead of making it a target option. llvm-svn: 186218	2013-07-12 22:26:07 +00:00

... 5 6 7 8 9 ...

1073 Commits