teak-llvm

mirror of https://github.com/Gericom/teak-llvm.git synced 2025-06-23 05:25:50 -04:00

Author	SHA1	Message	Date
Nico Weber	f2a39a7b4e	Revert r234786, it contained a bunch of stuff I did not mean to commit. llvm-svn: 234787	2015-04-13 20:03:03 +00:00
Nico Weber	b31abb05fb	Revert r234613 (and follow-ups 234614, 234616, 234618), it caused PR23216. llvm-svn: 234786	2015-04-13 20:01:20 +00:00
Nico Weber	1c565c31b1	Revert r234581, it might have caused a few miscompiles in Chromium. If the revert helps, I'll get a repro this Monday. Else I'll put the change back in. llvm-svn: 234700	2015-04-11 23:51:38 +00:00
Reid Kleckner	11859afd5f	[SEH] Re-land r234532, but use internal linkage for all SEH helpers Even though these symbols are in a comdat group, the Microsoft linker really wants them to have internal linkage. I'm planning to tweak the mangling in a follow-up change. This is a straight revert with a 1-line fix. llvm-svn: 234613	2015-04-10 17:34:52 +00:00
Alexey Bataev	794ba0dcb7	[OPENMP] Codegen for 'reduction' clause in 'parallel' directive. Emit a code for reduction clause. Next code should be emitted for reductions: static kmp_critical_name lock = { 0 }; void reduce_func(void lhs[<n>], void rhs[<n>]) { ... (Type<i> )lhs[i] = RedOp<i>((Type<i> )lhs[i], (Type<i> )rhs[i]); ... } ... void RedList[<n>] = {&<RHSExprs>[0], ..., &<RHSExprs>[<n> - 1]}; switch (__kmpc_reduce{_nowait}(<loc>, <gtid>, <n>, sizeof(RedList), RedList, reduce_func, &<lock>)) { case 1: ... <LHSExprs>[i] = RedOp<i>(<LHSExprs>[i], <RHSExprs>[i]); ... __kmpc_end_reduce{_nowait}(<loc>, <gtid>, &<lock>); break; case 2: ... Atomic(<LHSExprs>[i] = RedOp<i>(<LHSExprs>[i], *<RHSExprs>[i])); ... break; default: ; } Reduction variables are a kind of a private variables, they have private copies, but initial values are chosen in accordance with the reduction operation. Differential Revision: http://reviews.llvm.org/D8915 llvm-svn: 234583	2015-04-10 10:43:45 +00:00
Arnaud A. de Grandmaison	047a686d53	Remove threshold for inserting lifetime markers for named temporaries Now that TailRecursionElimination has been fixed with r222354, the threshold on size for lifetime marker insertion can be removed. This only affects named temporary though, as the patch for unnamed temporaries is still in progress. My previous commit (r222993) was not handling debuginfo correctly, but this could only be seen with some asan tests. Basically, lifetime markers are just instrumentation for the compiler's usage and should not affect debug information; however, the cleanup infrastructure was assuming it contained only destructors, i.e. actual code to be executed, and was setting the breakpoint for the end of the function to the closing '}', and not the return statement, in order to show some destructors have been called when leaving the function. This is wrong when the cleanups are only lifetime markers, and this is now fixed. llvm-svn: 234581	2015-04-10 10:13:52 +00:00
Alexey Bataev	6f1ffc069b	[OPENMP] Refactoring of codegen for OpenMP directives. Refactored API of OpenMPRuntime for compatibility with combined directives. Differential Revision: http://reviews.llvm.org/D8859 llvm-svn: 234564	2015-04-10 04:50:10 +00:00
Nico Weber	bd51a6a99f	Revert r234532 for a bit, it very likely caused http://crbug.com/475768 llvm-svn: 234563	2015-04-10 04:33:03 +00:00
Reid Kleckner	0dbecf2b78	[SEH] Outline finally blocks using the new variable capture support WinEHPrepare was going to have to pattern match the control flow merge and split that the old lowering used, and that wasn't really feasible. Now we can teach WinEHPrepare to pattern match this, which is much simpler: %fp = call i8* @llvm.frameaddress(i32 0) call void @func(iN [01], i8* %fp) This prototype happens to match the prototype used by the Win64 SEH personality function, so this is really simple. llvm-svn: 234532	2015-04-09 20:37:24 +00:00
Reid Kleckner	31a1bb0c14	Reland "[SEH] Implement filter capturing in CodeGen" The test should be fixed. It was failing in NDEBUG builds due to a missing '*' character in a regex. In asserts builds, the pattern matched a single digit value, which became a double digit value in NDEBUG builds. Go figure. This reverts commit r234261. llvm-svn: 234447	2015-04-08 22:23:48 +00:00
Daniel Jasper	303c3ac925	Revert "[SEH] Implement filter capturing in CodeGen" Test fails: http://lab.llvm.org:8080/green/job/clang-stage2-configure-Rlto_check/3182/ llvm-svn: 234306	2015-04-07 10:07:47 +00:00
Reid Kleckner	0ada50f17f	[SEH] Implement filter capturing in CodeGen While capturing filters aren't very common, we'd like to outline __finally blocks in the frontend to simplify -O0 EH preparation and reduce code size. Finally blocks are usually have captures, and this is the first step towards that. Currently we don't support capturing 'this' or VLAs. Reviewers: majnemer Differential Revision: http://reviews.llvm.org/D8825 llvm-svn: 234261	2015-04-06 23:51:44 +00:00
David Blaikie	1ed728c499	[opaque pointer type] More GEP API migrations Looks like the VTable code in particular will need some work to pass around the pointee type explicitly. llvm-svn: 234128	2015-04-05 22:45:47 +00:00
David Blaikie	fb901c7abf	[opaque pointer type] more GEP API migrations llvm-svn: 234097	2015-04-04 15:12:29 +00:00
Ulrich Weigand	3a610ebf1e	[SystemZ] Support transactional execution on zEC12 The zEC12 provides the transactional-execution facility. This is exposed to users via a set of builtin routines on other compilers. This patch adds clang support to enable those builtins. In partciular, the patch: - enables the transactional-execution feature by default on zEC12 - allows to override presence of that feature via the -mhtm/-mno-htm options - adds a predefined macro __HTM__ if the feature is enabled - adds support for the transactional-execution GCC builtins - adds Sema checking to verify the __builtin_tabort abort code - adds the s390intrin.h header file (for GCC compatibility) - adds s390 sections to the htmintrin.h and htmxlintrin.h header files Since this is first use of target-specific intrinsics on the platform, the patch creates the include/clang/Basic/BuiltinsSystemZ.def file and hooks it up in TargetBuiltins.h and lib/Basic/Targets.cpp. An associated LLVM patch adds the required LLVM IR intrinsics. For reference, the transactional-execution instructions are documented in the z/Architecture Principles of Operation for the zEC12: http://publibfp.boulder.ibm.com/cgi-bin/bookmgr/download/DZ9ZR009.pdf The associated builtins are documented in the GCC manual: http://gcc.gnu.org/onlinedocs/gcc/S_002f390-System-z-Built-in-Functions.html The htmxlintrin.h intrinsics provided for compatibility with the IBM XL compiler are documented in the "z/OS XL C/C++ Programming Guide". llvm-svn: 233804	2015-04-01 12:54:25 +00:00
Alexey Bataev	b4505a7229	[OPENMP] Codegen for 'atomic update' construct. Adds atomic update codegen for the following forms of expressions: x binop= expr; x++; ++x; x--; --x; x = x binop expr; x = expr binop x; If x and expr are integer and binop is associative or x is a LHS in a RHS of the assignment expression, and atomics are allowed for type of x on the target platform atomicrmw instruction is emitted. Otherwise compare-and-swap sequence is emitted: bb: ... atomic load <x> cont: <expected> = phi [ <x>, label %bb ], [ <new_failed>, %cont ] <desired> = <expected> binop <expr> <res> = cmpxchg atomic &<x>, desired, expected <new_failed> = <res>.field1; br <res>field2, label %exit, label %cont exit: ... Differential Revision: http://reviews.llvm.org/D8536 llvm-svn: 233513	2015-03-30 05:20:59 +00:00
Alexey Bataev	a63048e4fd	[OPENMP] Codegen for 'copyprivate' clause ('single' directive). If there is at least one 'copyprivate' clause is associated with the single directive, the following code is generated: ``` i32 did_it = 0; \\ for 'copyprivate' clause if(__kmpc_single(ident_t , gtid)) { SingleOpGen(); __kmpc_end_single(ident_t , gtid); did_it = 1; \\ for 'copyprivate' clause } <copyprivate_list>[0] = &var0; ... <copyprivate_list>[n] = &varn; call __kmpc_copyprivate(ident_t , gtid, <copyprivate_list_size>, <copyprivate_list>, <copy_func>, did_it); ... void<copy_func>(void LHSArg, void RHSArg) { Dst = (void [n])(LHSArg); Src = (void * [n])(RHSArg); Dst[0] = Src[0]; ... Dst[n] = Src[n]; } ``` All list items from all 'copyprivate' clauses are gathered into single <copyprivate list> (<copyprivate_list_size> is a size in bytes of this list) and <copy_func> is used to propagate values of private or threadprivate variables from the 'single' region to other implicit threads from outer 'parallel' region. Differential Revision: http://reviews.llvm.org/D8410 llvm-svn: 232932	2015-03-23 06:18:07 +00:00
Peter Collingbourne	d2926c91d5	Implement bad cast checks using control flow integrity information. This scheme checks that pointer and lvalue casts are made to an object of the correct dynamic type; that is, the dynamic type of the object must be a derived class of the pointee type of the cast. The checks are currently only introduced where the class being casted to is a polymorphic class. Differential Revision: http://reviews.llvm.org/D8312 llvm-svn: 232241	2015-03-14 02:42:25 +00:00
Benjamin Kramer	7f1f6b5370	Disambiguate call for GCC. llvm-svn: 232122	2015-03-12 23:46:55 +00:00
Benjamin Kramer	51680bccda	CodeGen: Base the conditional cleanup machinery on variadic templates This is complicated by the fact that we can't simply use side-effecting calls in an argument list without losing all guarantees about the order they're emitted. To keep things deterministic we use tuples and brace initialization, which thankfully guarantees evaluation order. No functionality change intended. llvm-svn: 232121	2015-03-12 23:41:40 +00:00
Alexey Bataev	2df54a07bf	[OPENMP] Initial codegen for 'omp sections' and 'omp section' directives. If only one section is found in the sections region, it is emitted just like single region. Otherwise it is emitted as a static non-chunked loop. #pragma omp sections { #pragma omp section {1} ... #pragma omp section {n} } is translated to something like i32 <iter_var> i32 <last_iter> = 0 i32 <lower_bound> = 0 i32 <upper_bound> = n-1 i32 <stride> = 1 call void @__kmpc_for_static_init_4(<loc>, i32 <gtid>, i32 34/static non-chunked/, i32* <last_iter>, i32* <lower_bound>, i32* <upper_bound>, i32* <stride>, i32 1/increment always 1/, i32 1/chunk always 1/) <upper_bound> = min(<upper_bound>, n-1) <iter_var> = <lb> check: br <iter_var> <= <upper_bound>, label cont, label exit continue: switch (IV) { case 0: {1}; break; ... case <NumSection> - 1: {n}; break; } ++<iter_var> br label check exit: call void @__kmpc_for_static_fini(<loc>, i32 <gtid>) Differential Revision: http://reviews.llvm.org/D8244 llvm-svn: 232021	2015-03-12 08:53:29 +00:00
David Majnemer	7c23707174	MS ABI: Implement support for throwing a C++ exception Throwing a C++ exception, under the MS ABI, is implemented using three components: - ThrowInfo structure which contains information like CV qualifiers, what destructor to call and a pointer to the CatchableTypeArray. - In a significant departure from the Itanium ABI, copying by-value occurs in the runtime and not at the catch site. This means we need to enumerate all possible types that this exception could be caught as and encode the necessary information to convert from the exception object's type to the catch handler's type. This includes complicated derived to base conversions and the execution of copy-constructors. N.B. This implementation doesn't support the execution of a copy-constructor from within the runtime for now. Adding support for that functionality is quite difficult due to things like default argument expressions which may evaluate arbitrary code hiding in the copy-constructor's parameters. Differential Revision: http://reviews.llvm.org/D8066 llvm-svn: 231328	2015-03-05 00:46:22 +00:00
Alexey Bataev	8cbe0a6b62	[OPENMP] Fixed codegen for directives without function outlining. Fixed crash on codegen for directives like 'omp for', 'omp single' etc. inside of the 'omp parallel', 'omp task' etc. regions. llvm-svn: 230621	2015-02-26 10:27:34 +00:00
Peter Collingbourne	a4ccff3281	Implement Control Flow Integrity for virtual calls. This patch introduces the -fsanitize=cfi-vptr flag, which enables a control flow integrity scheme that checks that virtual calls take place using a vptr of the correct dynamic type. More details in the new docs/ControlFlowIntegrity.rst file. It also introduces the -fsanitize=cfi flag, which is currently a synonym for -fsanitize=cfi-vptr, but will eventually cover all CFI checks implemented in Clang. Differential Revision: http://reviews.llvm.org/D7424 llvm-svn: 230055	2015-02-20 20:30:56 +00:00
Aaron Ballman	abc1892057	Removing LLVM_DELETED_FUNCTION, as MSVC 2012 was the last reason for requiring the macro. NFC; Clang edition. llvm-svn: 229339	2015-02-15 22:54:08 +00:00
Aaron Ballman	673476684e	Removing LLVM_EXPLICIT, as MSVC 2012 was the last reason for requiring the macro. NFC; Clang edition. llvm-svn: 229336	2015-02-15 22:00:28 +00:00
David Majnemer	a5b195a1dc	Revert "Revert r229082 for a bit, it caused PR22577." This reverts commit r229123. It was a red herring, the bug was present without r229082. llvm-svn: 229205	2015-02-14 01:35:12 +00:00
Nico Weber	7ce96b853d	Revert r229082 for a bit, it caused PR22577. llvm-svn: 229123	2015-02-13 16:27:00 +00:00
David Majnemer	abc482effc	MS ABI: Implement /volatile:ms The /volatile:ms semantics turn volatile loads and stores into atomic acquire and release operations. This distinction is important because volatile memory operations do not form a happens-before relationship with non-atomic memory. This means that a volatile store is not sufficient for implementing a mutex unlock routine. Differential Revision: http://reviews.llvm.org/D7580 llvm-svn: 229082	2015-02-13 07:55:47 +00:00
Reid Kleckner	11c033e8aa	SEH: Use the SEHTryEpilogueStack instead of a separate bool We don't need a bool to track this now that we have a stack for it. llvm-svn: 228982	2015-02-12 23:40:45 +00:00
Nico Weber	5779f84000	[ms] Implement codegen for __leave. Reviewed at http://reviews.llvm.org/D7575 llvm-svn: 228977	2015-02-12 23:16:11 +00:00
Richard Smith	527473df0d	Fix typoo. llvm-svn: 228963	2015-02-12 21:23:20 +00:00
Nico Weber	1bebad1b86	Wrap to 80 columns. No behavior change. llvm-svn: 228880	2015-02-11 22:33:32 +00:00
Reid Kleckner	a593000f01	Add the 'noinline' attribute to call sites within __try bodies LLVM doesn't support non-call exceptions, so inlining makes it harder to catch such asynchronous exceptions. llvm-svn: 228876	2015-02-11 21:40:48 +00:00
Reid Kleckner	e7b3f7c70d	Emit landing pads for SEH even if nounwind is present Disabling exceptions applies nounwind to lots of functions. SEH catches asynch exceptions, so emit the landing pad anyway. llvm-svn: 228769	2015-02-11 00:00:21 +00:00
Reid Kleckner	aca01db706	Implement IRGen for SEH __finally and AbnormalTermination Previously we would simply double-emit the body of the __finally block, but that doesn't work when it contains any kind of Decl, which we can't double emit. This fixes that by emitting the block once and branching into a shared code region and then branching back out. llvm-svn: 228222	2015-02-04 22:37:07 +00:00
David Blaikie	4d52443c0e	DebugInfo: Attribute cleanup code to the end of the scope, not the end of the function. Now if you break on a dtor and go 'up' in your debugger (or you get an asan failure in a dtor) during an exception unwind, you'll have more context. Instead of all dtors appearing to be called from the '}' of the function, they'll be attributed to the end of the scope of the variable, the same as the non-exceptional dtor call. This doesn't /quite/ remove all uses of CurEHLocation (which might be nice to remove, for a few reasons) - it's still used to choose the location for some other work in the landing pad. It'd be nice to attribute that code to the same location as the exception calls within the block and to remove CurEHLocation. llvm-svn: 228181	2015-02-04 19:47:54 +00:00
David Majnemer	fd1e739a44	CodeGen: Copy-ctorm must obey the destination's alignment requirement We would synthesize memcpy intrinsics when emitting calls to trivial C++ constructors but we wouldn't take into account the alignment of the destination. llvm-svn: 228061	2015-02-03 23:04:06 +00:00
Alexander Musman	df7a8e2bc8	Support ‘omp for’ with static chunked schedule kind. Differential Revision: http://reviews.llvm.org/D7006 llvm-svn: 226795	2015-01-22 08:49:35 +00:00
Reid Kleckner	1d59f99f5c	Initial support for Win64 SEH IR emission The lowering looks a lot like normal EH lowering, with the exception that the exceptions are caught by executing filter expression code instead of matching typeinfo globals. The filter expressions are outlined into functions which are used in landingpad clauses where typeinfo would normally go. Major aspects that still need work: - Non-call exceptions in __try bodies won't work yet. The plan is to outline the __try block in the frontend to keep things simple. - Filter expressions cannot use local variables until capturing is implemented. - __finally blocks will not run after exceptions. Fixing this requires work in the LLVM SEH preparation pass. The IR lowering looks like this: // C code: bool safe_div(int n, int d, int r) { __try { r = normal_div(n, d); } __except(_exception_code() == EXCEPTION_INT_DIVIDE_BY_ZERO) { return false; } return true; } ; LLVM IR: define i32 @filter(i8* %e, i8* %fp) { %ehptrs = bitcast i8* %e to i32 %ehrec = load i32 %ehptrs %code = load i32* %ehrec %matches = icmp eq i32 %code, i32 u0xC0000094 %matches.i32 = zext i1 %matches to i32 ret i32 %matches.i32 } define i1 zeroext @safe_div(i32 %n, i32 %d, i32* %r) { %rr = invoke i32 @normal_div(i32 %n, i32 %d) to label %normal unwind to label %lpad normal: store i32 %rr, i32* %r ret i1 1 lpad: %ehvals = landingpad {i8, i32} personality i32 (...) @__C_specific_handler catch i8* bitcast (i32 (i8, i8)* @filter to i8) %ehptr = extractvalue {i8, i32} %ehvals, i32 0 %sel = extractvalue {i8, i32} %ehvals, i32 1 %filter_sel = call i32 @llvm.eh.seh.typeid.for(i8 bitcast (i32 (i8, i8)* @filter to i8*)) %matches = icmp eq i32 %sel, %filter_sel br i1 %matches, label %eh.except, label %eh.resume eh.except: ret i1 false eh.resume: resume } Reviewers: rjmccall, rsmith, majnemer Differential Revision: http://reviews.llvm.org/D5607 llvm-svn: 226760	2015-01-22 01:36:17 +00:00
David Blaikie	835afb205f	DebugInfo: Remove forced column-info workaround for inlined calls This workaround was to provide unique call sites to ensure LLVM's inline debug info handling would properly unique two calls to the same function on the same line. Instead, this has now been fixed in LLVM (r226736) and the workaround here can be removed. Originally committed in r176895, but this isn't a straight revert due to all the changes since then. I just searched for anything ForcedColumn* related and removed them. We could test this - but it didn't strike me as terribly valuable once we're no longer adding this workaround everything just works as expected & it's no longer a special case to test for. llvm-svn: 226738	2015-01-21 23:08:17 +00:00
David Blaikie	a0a1a8726f	Add comment after API changes in r225090 Code review suggestion by Eric Christopher. llvm-svn: 226395	2015-01-18 02:48:07 +00:00
David Blaikie	66e4197f07	Reapply r225000 (reverted in r225555): DebugInfo: Generalize debug info location handling (and follow-up commits). Several pieces of code were relying on implicit debug location setting which usually lead to incorrect line information anyway. So I've fixed those (in r225955 and r225845) separately which should pave the way for this commit to be cleanly reapplied. The reason these implicit dependencies resulted in crashes with this patch is that the debug location would no longer implicitly leak from one place to another, but be set back to invalid. Once a call with no/invalid location was emitted, if that call was ever inlined it could produce invalid debugloc chains and assert during LLVM's codegen. There may be further cases of such bugs in this patch - they're hard to flush out with regression testing, so I'll keep an eye out for reports and investigate/fix them ASAP if they come up. Original commit message: Reapply "DebugInfo: Generalize debug info location handling" Originally committed in r224385 and reverted in r224441 due to concerns this change might've introduced a crash. Turns out this change fixes the crash introduced by one of my earlier more specific location handling changes (those specific fixes are reverted by this patch, in favor of the more general solution). Recommitted in r224941 and reverted in r224970 after it caused a crash when building compiler-rt. Looks to be due to this change zeroing out the debug location when emitting default arguments (which were meant to inherit their outer expression's location) thus creating call instructions without locations - these create problems for inlining and must not be created. That is fixed and tested in this version of the change. Original commit message: This is a more scalable (fixed in mostly one place, rather than many places that will need constant improvement/maintenance) solution to several commits I've made recently to increase source fidelity for subexpressions. This resetting had to be done at the DebugLoc level (not the SourceLocation level) to preserve scoping information (if the resetting was done with CGDebugInfo::EmitLocation, it would've caused the tail end of an expression's codegen to end up in a potentially different scope than the start, even though it was at the same source location). The drawback to this is that it might leave CGDebugInfo out of sync. Ideally CGDebugInfo shouldn't have a duplicate sense of the current SourceLocation, but for now it seems it does... - I don't think I'm going to tackle removing that just now. I expect this'll probably cause some more buildbot fallout & I'll investigate that as it comes up. Also these sort of improvements might be starting to show a weakness/bug in LLVM's line table handling: we don't correctly emit is_stmt for statements, we just put it on every line table entry. This means one statement split over multiple lines appears as multiple 'statements' and two statements on one line (without column info) are treated as one statement. I don't think we have any IR representation of statements that would help us distinguish these cases and identify the beginning of each statement - so that might be something we need to add (possibly to the lexical scope chain - a scope for each statement). This does cause some problems for GDB and possibly other DWARF consumers. llvm-svn: 225956	2015-01-14 07:38:27 +00:00
David Blaikie	f142580dea	Sink a parameter into the callee since it's always the same expression in terms of another parameter llvm-svn: 225856	2015-01-14 00:04:42 +00:00
David Blaikie	f353d3ecd0	Revert "DebugInfo: Generalize debug info location handling" and related commits This reverts commit r225000, r225021, r225083, r225086, r225090. The root change (r225000) still has several issues where it's caused calls to be emitted without debug locations. This causes assertion failures if/when those calls are inlined. I'll work up some test cases and fixes before recommitting this. llvm-svn: 225555	2015-01-09 23:00:28 +00:00
David Blaikie	b9a23c9155	DebugInfo: Provide a less subtle way to set the debug location of simple ret instructions un-XFAILing the test XFAIL'd in r225086 after it regressed in r225083. llvm-svn: 225090	2015-01-02 22:07:26 +00:00
David Blaikie	84fe79cfc3	Reapply "DebugInfo: Generalize debug info location handling" Originally committed in r224385 and reverted in r224441 due to concerns this change might've introduced a crash. Turns out this change fixes the crash introduced by one of my earlier more specific location handling changes (those specific fixes are reverted by this patch, in favor of the more general solution). Recommitted in r224941 and reverted in r224970 after it caused a crash when building compiler-rt. Looks to be due to this change zeroing out the debug location when emitting default arguments (which were meant to inherit their outer expression's location) thus creating call instructions without locations - these create problems for inlining and must not be created. That is fixed and tested in this version of the change. Original commit message: This is a more scalable (fixed in mostly one place, rather than many places that will need constant improvement/maintenance) solution to several commits I've made recently to increase source fidelity for subexpressions. This resetting had to be done at the DebugLoc level (not the SourceLocation level) to preserve scoping information (if the resetting was done with CGDebugInfo::EmitLocation, it would've caused the tail end of an expression's codegen to end up in a potentially different scope than the start, even though it was at the same source location). The drawback to this is that it might leave CGDebugInfo out of sync. Ideally CGDebugInfo shouldn't have a duplicate sense of the current SourceLocation, but for now it seems it does... - I don't think I'm going to tackle removing that just now. I expect this'll probably cause some more buildbot fallout & I'll investigate that as it comes up. Also these sort of improvements might be starting to show a weakness/bug in LLVM's line table handling: we don't correctly emit is_stmt for statements, we just put it on every line table entry. This means one statement split over multiple lines appears as multiple 'statements' and two statements on one line (without column info) are treated as one statement. I don't think we have any IR representation of statements that would help us distinguish these cases and identify the beginning of each statement - so that might be something we need to add (possibly to the lexical scope chain - a scope for each statement). This does cause some problems for GDB and possibly other DWARF consumers. llvm-svn: 225000	2014-12-30 19:39:33 +00:00
David Blaikie	608a24501c	Revert "DebugInfo: Generalize debug info location handling" Asserting when building compiler-rt when using a GCC host compiler. Reverting while I investigate. This reverts commit r224941. llvm-svn: 224970	2014-12-29 23:49:00 +00:00
David Blaikie	3945d1bd99	Reapply "DebugInfo: Generalize debug info location handling" Originally committed in r224385 and reverted in r224441 due to concerns this change might've introduced a crash. Turns out this change fixes the crash introduced by one of my earlier more specific location handling changes (those specific fixes are reverted by this patch, in favor of the more general solution). Original commit message: This is a more scalable (fixed in mostly one place, rather than many places that will need constant improvement/maintenance) solution to several commits I've made recently to increase source fidelity for subexpressions. This resetting had to be done at the DebugLoc level (not the SourceLocation level) to preserve scoping information (if the resetting was done with CGDebugInfo::EmitLocation, it would've caused the tail end of an expression's codegen to end up in a potentially different scope than the start, even though it was at the same source location). The drawback to this is that it might leave CGDebugInfo out of sync. Ideally CGDebugInfo shouldn't have a duplicate sense of the current SourceLocation, but for now it seems it does... - I don't think I'm going to tackle removing that just now. I expect this'll probably cause some more buildbot fallout & I'll investigate that as it comes up. Also these sort of improvements might be starting to show a weakness/bug in LLVM's line table handling: we don't correctly emit is_stmt for statements, we just put it on every line table entry. This means one statement split over multiple lines appears as multiple 'statements' and two statements on one line (without column info) are treated as one statement. I don't think we have any IR representation of statements that would help us distinguish these cases and identify the beginning of each statement - so that might be something we need to add (possibly to the lexical scope chain - a scope for each statement). This does cause some problems for GDB and possibly other DWARF consumers. llvm-svn: 224941	2014-12-29 18:18:45 +00:00
Alexey Bataev	7cb1789011	Fix for PR21915: assert on multidimensional VLA in function arguments. Fixed assertion on type checking for arguments and parameters on function call if arguments are pointers to VLA Differential Revision: http://reviews.llvm.org/D6655 llvm-svn: 224504	2014-12-18 06:54:53 +00:00
David Blaikie	06b2c54db9	Revert "DebugInfo: Generalize debug info location handling" Fails an ASan bootstrap - I'll try to reproduce locally & sort that out before recommitting. This reverts commit r224385. llvm-svn: 224441	2014-12-17 18:02:04 +00:00
David Blaikie	bf22a4eaee	DebugInfo: Generalize debug info location handling This is a more scalable (fixed in mostly one place, rather than many places that will need constant improvement/maintenance) solution to several commits I've made recently to increase source fidelity for subexpressions. This resetting had to be done at the DebugLoc level (not the SourceLocation level) to preserve scoping information (if the resetting was done with CGDebugInfo::EmitLocation, it would've caused the tail end of an expression's codegen to end up in a potentially different scope than the start, even though it was at the same source location). The drawback to this is that it might leave CGDebugInfo out of sync. Ideally CGDebugInfo shouldn't have a duplicate sense of the current SourceLocation, but for now it seems it does... - I don't think I'm going to tackle removing that just now. I expect this'll probably cause some more buildbot fallout & I'll investigate that as it comes up. Also these sort of improvements might be starting to show a weakness/bug in LLVM's line table handling: we don't correctly emit is_stmt for statements, we just put it on every line table entry. This means one statement split over multiple lines appears as multiple 'statements' and two statements on one line (without column info) are treated as one statement. I don't think we have any IR representation of statements that would help us distinguish these cases and identify the beginning of each statement - so that might be something we need to add (possibly to the lexical scope chain - a scope for each statement). This does cause some problems for GDB and possibly other DWARF consumers. llvm-svn: 224385	2014-12-16 22:49:17 +00:00
Alexey Bataev	f841bd9fcd	[OPENMP] Bugfix for processing of global variables in OpenMP regions. Currently, if global variable is marked as a private OpenMP variable, the compiler crashes in debug version or generates incorrect code in release version. It happens because in the OpenMP region the original global variable is used instead of the generated private copy. It happens because currently globals variables are not captured in the OpenMP region. This patch adds capturing of global variables iff private copy of the global variable must be used in the OpenMP region. Differential Revision: http://reviews.llvm.org/D6259 llvm-svn: 224323	2014-12-16 07:00:22 +00:00
Alexander Musman	c638868bdf	First patch with codegen of the 'omp for' directive. It implements the simplest case, which is used when no chunk_size is specified in the schedule(static) or no 'schedule' clause is specified - the iteration space is divided by the library into chunks that are approximately equal in size, and at most one chunk is distributed to each thread. In this case, we do not need an outer loop in each thread - each thread requests once which iterations range it should handle (using __kmpc_for_static_init runtime call) and then runs the inner loop on this range. Differential Revision: http://reviews.llvm.org/D5865 llvm-svn: 224233	2014-12-15 07:07:06 +00:00
Alexey Bataev	452d8e1133	Bugfix for Codegen of atomic load/store/other ops. Currently clang fires assertions on x86-64 on any atomic operations for long double operands. Patch fixes codegen for such operations. Differential Revision: http://reviews.llvm.org/D6499 llvm-svn: 224230	2014-12-15 05:25:25 +00:00
Peter Collingbourne	f770683f14	Implement the __builtin_call_with_static_chain GNU extension. The extension has the following syntax: __builtin_call_with_static_chain(Call, Chain) where Call must be a function call expression and Chain must be of pointer type This extension performs a function call Call with a static chain pointer Chain passed to the callee in a designated register. This is useful for calling foreign language functions whose ABI uses static chain pointers (e.g. to implement closures). Differential Revision: http://reviews.llvm.org/D6332 llvm-svn: 224167	2014-12-12 23:41:25 +00:00
David Blaikie	7f138811cd	DebugInfo: Correct the location of initializations of auto. llvm-svn: 223839	2014-12-09 22:04:13 +00:00
David Blaikie	538deffd2d	DebugInfo: Emit the correct location for initialization of a complex variable Especially useful for sanitizer reports. llvm-svn: 223825	2014-12-09 20:52:24 +00:00
David Blaikie	73ca56942d	DebugInfo: Correctly identify the location of C++ member initializer list elements This particularly helps the fidelity of ASan reports (which can occur even in these examples - if, for example, one uses placement new over a buffer of insufficient size - now ASan will correctly identify which member's initialization went over the end of the buffer). This doesn't cover all types of members - more coming. llvm-svn: 223726	2014-12-09 00:32:22 +00:00
Saleem Abdulrasool	a14ac3f437	CodeGen: refactor ARM builtin handling Create a helper function to construct a value for the ARM hint intrinsic rather than inling the construction. In order to avoid the use of the sentinel value, inline the use of intrinsic instruction retrieval. NFC. llvm-svn: 223338	2014-12-04 04:52:37 +00:00
Nico Weber	aad4af6d50	Fix incorrect codegen for devirtualized calls to virtual overloaded operators. Consider this program: struct A { virtual void operator-() { printf("base\n"); } }; struct B final : public A { virtual void operator-() override { printf("derived\n"); } }; int main() { B* b = new B; -static_cast<A&>(*b); } Before this patch, clang saw the virtual call to A::operator-(), figured out that it can be devirtualized, and then just called A::operator-() directly, without going through the vtable. Instead, it should've looked up which operator-() the call devirtualizes to and should've called that. For regular virtual member calls, clang gets all this right already. So instead of giving EmitCXXOperatorMemberCallee() all the logic that EmitCXXMemberCallExpr() already has, cut the latter function into two pieces, call the second piece EmitCXXMemberOrOperatorMemberCallExpr(), and use it also to generate code for calls to virtual member operators. This way, virtual overloaded operators automatically don't get devirtualized if they have covariant returns (like it was done for regular calls in r218602), etc. This also happens to fix (or at least improve) codegen for explicit constructor calls (`A a; a.A::A()`) in MS mode with -fsanitize-address-field-padding=1. (This adjustment for virtual operator calls seems still wrong with the MS ABI.) llvm-svn: 223185	2014-12-03 01:21:41 +00:00
Arnaud A. de Grandmaison	f3470cc979	Revert "Remove threshold for lifetime marker insertion of named temporaries" Revert r222993 while I investigate some MemorySanitizer failures. llvm-svn: 222995	2014-12-01 09:30:16 +00:00
Arnaud A. de Grandmaison	f2730e2d22	Remove threshold for lifetime marker insertion of named temporaries Now that TailRecursionElimination has been fixed with r222354, the threshold on size for lifetime marker insertion can be removed. This only affects named temporary though, as the patch for unnamed temporaries is still in progress. llvm-svn: 222993	2014-12-01 09:13:54 +00:00
Alexey Samsonov	e396bfc064	Bundle conditions checked by UBSan with sanitizer kinds they implement. Summary: This change makes CodeGenFunction::EmitCheck() take several conditions that needs to be checked (all of them need to be true), together with sanitizer kinds these checks are for. This would allow to split one call into UBSan runtime into several calls in case different sanitizer kinds would have different recoverability settings. Tests should be fixed accordingly, I'm working on it. Test Plan: regression test suite. Reviewers: rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6219 llvm-svn: 221716	2014-11-11 22:03:54 +00:00
Alexey Samsonov	a041610f11	[Sanitizer] Refactor sanitizer options in LangOptions. Get rid of ugly SanitizerOptions class thrust into LangOptions: * Make SanitizeAddressFieldPadding a regular language option, and rely on default behavior to initialize/reset it. * Make SanitizerBlacklistFile a regular member LangOptions. * Introduce the helper class "SanitizerSet" to represent the set of enabled sanitizers and make it a member of LangOptions. It is exactly the entity we want to cache and modify in CodeGenFunction, for instance. We'd also be able to reuse SanitizerSet in CodeGenOptions for storing the set of recoverable sanitizers, and in the Driver to represent the set of sanitizers turned on/off by the commandline flags. No functionality change. llvm-svn: 221653	2014-11-11 01:26:14 +00:00
Alexey Samsonov	4c1a96f519	Propagate SanitizerKind into CodeGenFunction::EmitCheck() call. Make sure CodeGenFunction::EmitCheck() knows which sanitizer it emits check for. Make CheckRecoverableKind enum an implementation detail and move it away from header. Currently CheckRecoverableKind is determined by the type of sanitizer ("unreachable" and "return" are unrecoverable, "vptr" is always-recoverable, all the rest are recoverable). This will change in future if we allow to specify which sanitizers are recoverable, and which are not by -fsanitize-recover= flag. No functionality change. llvm-svn: 221635	2014-11-10 22:27:30 +00:00
Reid Kleckner	c311aba247	Silence a warning from MSVC "14" by making an enum unsigned It says there is a narrowing conversion when we assign it to an unsigned 3 bit bitfield. Also, use unsigned instead of size_t for the Size field of the struct in question. Otherwise they won't run together in MSVC or clang-cl. llvm-svn: 221019	2014-10-31 23:33:56 +00:00
David Majnemer	0c0b6d9ac6	MS ABI: Properly call global delete when invoking virtual destructors Summary: The Itanium ABI approach of using offset-to-top isn't possible with the MS ABI, it doesn't have that kind of information lying around. Instead, we do the following: - Call the virtual deleting destructor with the "don't delete the object flag" set. The virtual deleting destructor will return a pointer to 'this' adjusted to the most derived class. - Call the global delete using the adjusted 'this' pointer. Reviewers: rnk Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D5996 llvm-svn: 220993	2014-10-31 20:09:12 +00:00
Alexey Samsonov	035462c1cf	Get rid of SanitizerOptions::Disabled global. NFC. SanitizerOptions is not even a POD now, so having global variable of this type, is not nice. Instead, provide a regular constructor and clear() method, and let each CodeGenFunction has its own copy of SanitizerOptions it uses. llvm-svn: 220920	2014-10-30 19:33:44 +00:00
Alexey Bataev	330de03083	Improved capturing variable-length array types in CapturedStmt. An updated implemnentation of VLA types capturing based on previously committed solution for Lambdas. This version captures the whole VLA type instead of particular variables which are part of VLA size expression and allows to use previusly calculated size of VLA type in captured regions. Required for OpenMP. Differential Revision: http://reviews.llvm.org/D5099 llvm-svn: 220850	2014-10-29 12:21:55 +00:00
Fariborz Jahanian	9ad94aa280	Objective-C. revert patch for rdar://17554063. llvm-svn: 220812	2014-10-28 18:28:16 +00:00
Aaron Ballman	560aa94ede	Fixing the MSVC build by removing friendship with CodeGenFunction; NFC. llvm-svn: 220293	2014-10-21 13:39:56 +00:00
Alexey Bataev	03b340a3a5	[OPENMP] Codegen for 'private' clause in 'parallel' directive. This patch generates some helper variables which used as a private copies of the corresponding original variables inside an OpenMP 'parallel' directive. These generated variables are initialized by default (with the default constructor, if any). In outlined function references to original variables are replaced by the references to these private helper variables. At the end of the initialization of the private variables and implicit barier is set by calling __kmpc_barrier(...) runtime function to be sure that all threads were initialized using original values of the variables. Differential Revision: http://reviews.llvm.org/D4752 llvm-svn: 220262	2014-10-21 03:16:40 +00:00
Kostya Serebryany	293dc9be6e	Insert poisoned paddings between fields in C++ classes so that AddressSanitizer can find intra-object-overflow bugs Summary: The general approach is to add extra paddings after every field in AST/RecordLayoutBuilder.cpp, then add code to CTORs/DTORs that poisons the paddings (CodeGen/CGClass.cpp). Everything is done under the flag -fsanitize-address-field-padding. The blacklist file (-fsanitize-blacklist) allows to avoid the transformation for given classes or source files. See also https://code.google.com/p/address-sanitizer/wiki/IntraObjectOverflow Test Plan: run SPEC2006 and some of the Chromium tests with -fsanitize-address-field-padding Reviewers: samsonov, rnk, rsmith Reviewed By: rsmith Subscribers: majnemer, cfe-commits Differential Revision: http://reviews.llvm.org/D5687 llvm-svn: 219961	2014-10-16 20:54:52 +00:00
Hal Finkel	6fae849597	Moving CGF::EmitAlignmentAssumption to IRBuilder The functionality contained in CodeGenFunction::EmitAlignmentAssumption has been moved to IRBuilder (so that it can also be used by LLVM-level code). Remove this now-duplicate implementation in favor of the IRBuilder code. llvm-svn: 219877	2014-10-15 23:45:08 +00:00
Alexey Samsonov	eb47d8a2c8	Sanitize upcasts and conversion to virtual base. This change adds UBSan check to upcasts. Namely, when we perform derived-to-base conversion, we: 1) check that the pointer-to-derived has suitable alignment and underlying storage, if this pointer is non-null. 2) if vptr-sanitizer is enabled, and we perform conversion to virtual base, we check that pointer-to-derived has a matching vptr. llvm-svn: 219642	2014-10-13 23:59:00 +00:00
Benjamin Kramer	c52193f4c7	Unfriend CGOpenMPRegionInfo so it can go into an anonymous namespace. Also remove some unnecessary virtual keywords. NFC. llvm-svn: 219497	2014-10-10 13:57:57 +00:00
Alexey Bataev	1809571c76	Code reformatting and improvement for OpenMP. Moved CGOpenMPRegionInfo from CGOpenMPRuntime.h to CGOpenMPRuntime.cpp file and reworked the code for this change. Also added processing of ThreadID variable passed as an argument in outlined functions in parallel and task directives. llvm-svn: 219490	2014-10-10 12:19:54 +00:00
Alexey Bataev	435ad7ba5e	Code improvements in OpenMP CodeGen. This patch makes class OMPPrivateScope a common class for all private variables. Reworked processing of firstprivate variables (now it is based on OMPPrivateScope too). llvm-svn: 219486	2014-10-10 09:48:26 +00:00
Nick Lewycky	5d1159ebe9	Revert r218865 because it introduced PR21236, a crash in codegen emitting the try block. llvm-svn: 219470	2014-10-10 04:05:00 +00:00
Reid Kleckner	79b0fd7a48	Promote null pointer constants used as arguments to variadic functions Make it possible to pass NULL through variadic functions on 64-bit Windows targets. The Visual C++ headers define NULL to 0, when they should define it to 0LL on Win64 so that NULL is a pointer-sized integer. Fixes PR20949. Reviewers: thakis, rsmith Differential Revision: http://reviews.llvm.org/D5480 llvm-svn: 219456	2014-10-10 00:05:45 +00:00
Alexey Bataev	13314bf526	[OPENMP] 'omp teams' directive basic support. Includes parsing and semantic analysis for 'omp teams' directive support from OpenMP 4.0. Adds additional analysis to 'omp target' directive with 'omp teams' directive. llvm-svn: 219385	2014-10-09 04:18:56 +00:00
Alexey Bataev	4a5bb772c3	[OPENMP] Codegen for 'firstprivate' clause. This patch generates some helper variables that used as private copies of the corresponding original variables inside an OpenMP 'parallel' directive. These generated variables are initialized by copy using values of the original variables (with the copy constructor, if any). For arrays, initializator is generated for single element and in the codegen procedure this initial value is automatically propagated between all elements of the private copy. In outlined function, references to original variables are replaced by the references to these private helper variables. At the end of the initialization of the private variables an implicit barier is generated by calling __kmpc_barrier(...) runtime function to be sure that all threads were initialized using original values of the variables. Differential Revision: http://reviews.llvm.org/D5140 llvm-svn: 219306	2014-10-08 14:01:46 +00:00
Alexey Bataev	8068b643c4	Revert commit r219297. Still troubles with OpenMP/parallel_firstprivate_codegen.cpp (now in ARM buildbots). llvm-svn: 219298	2014-10-08 12:00:22 +00:00
Alexey Bataev	3854f63aaf	[OPENMP] Codegen for 'firstprivate' clause. This patch generates some helper variables that used as private copies of the corresponding original variables inside an OpenMP 'parallel' directive. These generated variables are initialized by copy using values of the original variables (with the copy constructor, if any). For arrays, initializator is generated for single element and in the codegen procedure this initial value is automatically propagated between all elements of the private copy. In outlined function, references to original variables are replaced by the references to these private helper variables. At the end of the initialization of the private variables an implicit barier is generated by calling __kmpc_barrier(...) runtime function to be sure that all threads were initialized using original values of the variables. Differential Revision: http://reviews.llvm.org/D5140 llvm-svn: 219297	2014-10-08 11:35:04 +00:00
Alexey Bataev	bdef50e1ad	Revert back r219295. To fix issues with test OpenMP/parallel_firstprivate_codegen.cpp llvm-svn: 219296	2014-10-08 11:12:35 +00:00
Alexey Bataev	e7a5517a58	[OPENMP] Codegen for 'firstprivate' clause. This patch generates some helper variables that used as private copies of the corresponding original variables inside an OpenMP 'parallel' directive. These generated variables are initialized by copy using values of the original variables (with the copy constructor, if any). For arrays, initializator is generated for single element and in the codegen procedure this initial value is automatically propagated between all elements of the private copy. In outlined function, references to original variables are replaced by the references to these private helper variables. At the end of the initialization of the private variables an implicit barier is generated by calling __kmpc_barrier(...) runtime function to be sure that all threads were initialized using original values of the variables. Differential Revision: http://reviews.llvm.org/D5140 llvm-svn: 219295	2014-10-08 10:42:55 +00:00
Renato Golin	9804fa5d48	Revert "[OPENMP] 'omp teams' directive basic support. Includes parsing and semantic analysis for 'omp teams' directive support from OpenMP 4.0. Adds additional analysis to 'omp target' directive with 'omp teams' directive." This reverts commit r219197 because it broke ARM self-hosting buildbots with segmentation fault errors in many tests. llvm-svn: 219289	2014-10-08 09:06:45 +00:00
Reid Kleckner	453e056467	Fix IRGen for referencing a static local before emitting its decl Summary: Previously CodeGen assumed that static locals were emitted before they could be accessed, which is true for automatic storage duration locals. However, it is possible to have CodeGen emit a nested function that uses a static local before emitting the function that defines the static local, breaking that assumption. Fix it by creating the static local upon access and ensuring that the deferred function body gets emitted. We may not be able to emit the initializer properly from outside the function body, so don't try. Fixes PR18020. See also previous attempts to fix static locals in PR6769 and PR7101. Reviewers: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D4787 llvm-svn: 219265	2014-10-08 01:07:54 +00:00
Alexey Bataev	941bbec6f4	[OPENMP] 'omp teams' directive basic support. Includes parsing and semantic analysis for 'omp teams' directive support from OpenMP 4.0. Adds additional analysis to 'omp target' directive with 'omp teams' directive. llvm-svn: 219197	2014-10-07 10:13:33 +00:00
Alexander Musman	d196ef2124	[OPENMP] Small refactoring of EmitOMPSimdLoop helper routine. No functional changes intended. Renamed EmitOMPSimdLoop to EmitOMPInnerLoop, I plan to re-use it to emit inner loop in the future patches for CodeGen of the worksharing loop directives (omp for, omp for simd). llvm-svn: 219195	2014-10-07 08:57:09 +00:00
David Majnemer	b3341ea453	MS ABI: Implement thread_local for global variables Summary: This add support for the C++11 feature, thread_local global variables. The ABI Clang implements is an improvement of the MSVC ABI. Sadly, further improvements could be made but not without sacrificing ABI compatibility. The feature is implemented as follows: - All thread_local initialization routines are pointed to from the .CRT$XDU section. - All non-weak thread_local variables have their initialization routines call from a single function instead of getting their own .CRT$XDU section entry. This is done to open up optimization opportunities to the compiler. - All weak thread_local variables have their own .CRT$XDU section entry. This entry is in a COMDAT with the global variable it is initializing; this ensures that we will initialize the global exactly once. - Destructors are registered in the initialization function using __tlregdtor. Differential Revision: http://reviews.llvm.org/D5597 llvm-svn: 219074	2014-10-05 05:05:40 +00:00
Arnaud A. de Grandmaison	42d314d1ba	Emit lifetime.start / lifetime.end markers for unnamed temporary objects. This will give more information to the optimizers so that they can reuse stack slots and reduce stack usage. llvm-svn: 218865	2014-10-02 12:19:51 +00:00
Alexander Musman	a5f070aec0	[OPENMP] Loop collapsing and codegen for 'omp simd' directive. This patch implements collapsing of the loops (in particular, in presense of clause 'collapse'). It calculates number of iterations N and expressions nesessary to calculate the nested loops counters values based on new iteration variable (that goes from 0 to N-1) in Sema. It also adds Codegen for 'omp simd', which uses (and tests) this feature. Differential Revision: http://reviews.llvm.org/D5184 llvm-svn: 218743	2014-10-01 06:03:56 +00:00
Alexander Musman	e4e893bb36	[OPENMP] Parsing/Sema of directive omp parallel for simd llvm-svn: 218299	2014-09-23 09:33:00 +00:00
Alexey Bataev	0bd520b767	[OPENMP] Initial parsing/sema analysis of 'target' directive. llvm-svn: 218110	2014-09-19 08:19:49 +00:00
David Majnemer	9928106536	MS ABI: Don't ICE for pointers to pointers to members of incomplete classes CodeGen would try to come up with an LLVM IR type for a pointer to member type on the way to forming an LLVM IR type for a pointer to pointer to member type. However, if the pointer to member representation has not been locked in yet, we would not be able to come up with a pointer to member IR type. In these cases, make the pointer to member type an incomplete type. This will make the pointer to pointer to member type a pointer to an incomplete type. If the class eventually obtains an inheritance model, we will make the pointer to member type represent the actual inheritance model. Differential Revision: http://reviews.llvm.org/D5373 llvm-svn: 218084	2014-09-18 22:05:54 +00:00
Alexander Musman	f82886e502	Parsing/Sema of directive omp for simd llvm-svn: 218029	2014-09-18 05:12:34 +00:00
Alexey Samsonov	8e1162c71d	Implement nonnull-attribute sanitizer Summary: This patch implements a new UBSan check, which verifies that function arguments declared to be nonnull with __attribute__((nonnull)) are actually nonnull in runtime. To implement this check, we pass FunctionDecl to CodeGenFunction::EmitCallArgs (where applicable) and if function declaration has nonnull attribute specified for a certain formal parameter, we compare the corresponding RValue to null as soon as it's calculated. Test Plan: regression test suite Reviewers: rsmith Reviewed By: rsmith Subscribers: cfe-commits, rnk Differential Revision: http://reviews.llvm.org/D5082 llvm-svn: 217389	2014-09-08 17:22:45 +00:00
Hal Finkel	bcc06085a8	Add __builtin_assume and __builtin_assume_aligned using @llvm.assume. This makes use of the recently-added @llvm.assume intrinsic to implement a __builtin_assume(bool) intrinsic (to provide additional information to the optimizer). This hooks up __assume in MS-compatibility mode to mirror __builtin_assume (the semantics have been intentionally kept compatible), and implements GCC's __builtin_assume_aligned as assume((p - o) & mask == 0). LLVM now contains special logic to deal with assumptions of this form. llvm-svn: 217349	2014-09-07 22:58:14 +00:00
Reid Kleckner	9b3e3dfc54	MS inline asm: Allow __asm blocks to set a return value If control falls off the end of a function after an __asm block, MSVC assumes that the inline assembly filled the EAX and possibly EDX registers with an appropriate return value. This functionality is used in inline functions returning 64-bit integers in system headers, so we need some amount of compatibility. This is implemented in Clang by adding extra output constraints to every inline asm block, and storing the resulting output registers into the return value slot. If we see an asm block somewhere in the function body, we emit a normal epilogue instead of marking the end of the function with a return type unreachable. Normal returns in functions not using this functionality will overwrite the return value slot, and in most cases LLVM should be able to eliminate the dead stores. Fixes PR17201. Reviewed By: majnemer Differential Revision: http://reviews.llvm.org/D5177 llvm-svn: 217187	2014-09-04 20:04:38 +00:00
Alexey Samsonov	cbe875a507	Kill one of EmitCallArgs overloads. NFC. llvm-svn: 216635	2014-08-28 00:22:11 +00:00
Craig Topper	3cb91b2ad1	Fix some cases were ArrayRefs were being passed by reference. llvm-svn: 216527	2014-08-27 06:28:16 +00:00
Alexey Samsonov	525bf650cc	Pass actual CXXConstructExpr instead of argument iterators into EmitSynthesizedCXXCopyCtorCall. No functionality change. llvm-svn: 216410	2014-08-25 21:58:56 +00:00
Alexey Samsonov	a5bf76bdf3	Pass actual CallExpr instead of CallExpr-specific iterators into EmitCXXMemberOrOperatorCall methods. In the end we want to make declaration visible in EmitCallArgs() method, that would allow us to alter CodeGen depending on function/parameter attributes. No functionality change. llvm-svn: 216404	2014-08-25 20:17:35 +00:00
David Blaikie	93be0b24b8	DebugInfo: Scope for condition variables more narrowly than the loop variable. for loops introduce two scopes - one for the outer loop variable and its initialization, and another for the body of the loop, including any variable declared inside the loop condition. llvm-svn: 216288	2014-08-22 21:37:04 +00:00
Alexey Samsonov	91cf455af1	CGCall: Factor out the logic mapping call arguments to LLVM IR arguments. Summary: This refactoring introduces ClangToLLVMArgMapping class, which encapsulates the information about the order in which function arguments listed in CGFunctionInfo should be passed to actual LLVM IR function, such as: 1) positions of sret, if there is any 2) position of inalloca argument, if there is any 3) position of helper padding argument for each call argument 4) positions of regular argument (there can be many if it's expanded). Simplify several related methods (ConstructAttributeList, EmitFunctionProlog and EmitCall): now they don't have to maintain iterators over the list of LLVM IR function arguments, dealing with all the sret/inalloca/this complexities, and just use expected positions of LLVM IR arguments stored in ClangToLLVMArgMapping. This may increase the running time of EmitFunctionProlog, as we have to traverse expandable arguments twice, but in further refactoring we will be able to speed up EmitCall by passing already calculated CallArgsToIRArgsMapping to ConstructAttributeList, thus avoiding traversing expandable argument there. No functionality change. Test Plan: regression test suite Reviewers: majnemer, rnk Reviewed By: rnk Subscribers: cfe-commits, rjmccall, timurrrr Differential Revision: http://reviews.llvm.org/D4938 llvm-svn: 216251	2014-08-22 01:06:06 +00:00
Alexey Samsonov	70b9c01bd4	Pass expressions instead of argument ranges to EmitCall/EmitCXXConstructorCall. Summary: This is a first small step towards passing generic "Expr" instead of ArgBeg/ArgEnd pair into EmitCallArgs() family of methods. Having "Expr" will allow us to get the corresponding FunctionDecl and its ParmVarDecls, thus allowing us to alter CodeGen depending on the function/parameter attributes. No functionality change. Test Plan: regression test suite Reviewers: rnk Reviewed By: rnk Subscribers: aemerson, cfe-commits Differential Revision: http://reviews.llvm.org/D4915 llvm-svn: 216214	2014-08-21 20:26:47 +00:00
Fariborz Jahanian	91b2fa2a9a	ext_vector IRGen. Patch to allow indexing into ext_vector_type's 'hi/lo' components when used as lvalue. rdar://18031917 pr20697 llvm-svn: 215991	2014-08-19 17:17:40 +00:00
Benjamin Kramer	2f5db8b3db	Header guard canonicalization, clang part. Modifications made by clang-tidy with minor tweaks. llvm-svn: 215557	2014-08-13 16:25:19 +00:00
Fariborz Jahanian	413297c53d	Objective-C ARC. First patch toward generating new APIs for Objective-C's array and dictionary literals. rdar://17554063. This is wip. llvm-svn: 214983	2014-08-06 18:13:46 +00:00
Fariborz Jahanian	bcd82afad6	Introduce f[no-]max-unknown-pointer-align=[number] option to instruct the code generator to not enforce a higher alignment than the given number (of bytes) when accessing memory via an opaque pointer or reference. Patch reviewed by John McCall (with post-commit review pending). rdar://16254558 llvm-svn: 214911	2014-08-05 18:37:48 +00:00
Reid Kleckner	fe5b4ed822	Remove separator parameter from static local naming code It was always set to ".", which was duplicated in a few places. llvm-svn: 214792	2014-08-04 22:35:30 +00:00
Reid Kleckner	ab2090d107	MS ABI: Use musttail for vtable thunks that pass arguments by value This moves some memptr specific code into the generic thunk emission codepath. Fixes PR20053. Reviewers: majnemer Differential Revision: http://reviews.llvm.org/D4613 llvm-svn: 214004	2014-07-26 01:34:32 +00:00
Reid Kleckner	3f76ac7daa	Remove an extra parameter and C++11 for loop-ify this code llvm-svn: 214003	2014-07-26 01:30:05 +00:00
Reid Kleckner	19819446eb	MS ABI: Don't push destructor cleanups for aggregate parameters in thunks The target method of the thunk will perform the cleanup. This can't be tested in 32-bit x86 yet because passing something by value would create an inalloca, and we refuse to generate broken code for that. llvm-svn: 213976	2014-07-25 21:39:46 +00:00
Alexey Bataev	0162e459ef	[OPENMP] Initial parsing and sema analysis for 'atomic' directive. llvm-svn: 213639	2014-07-22 10:10:35 +00:00
Alexey Bataev	9fb6e647e7	[OPENMP] Initial parsing and sema analysis for 'ordered' directive. llvm-svn: 213616	2014-07-22 06:45:04 +00:00
Arnaud A. de Grandmaison	6e24a46572	Revert "Emit lifetime.start / lifetime.end markers for unnamed temporary objects." This commit did break the sanitizer-x86 bot. Revert it while investigating. llvm-svn: 213579	2014-07-21 19:47:02 +00:00
Arnaud A. de Grandmaison	17a83cf4b6	Emit lifetime.start / lifetime.end markers for unnamed temporary objects. This will give more information to the optimizers so that they can reuse stack slots. llvm-svn: 213576	2014-07-21 18:54:21 +00:00
Alexey Bataev	6125da9258	[OPENMP] Initial parsing and sema analysis for 'flush' directive. llvm-svn: 213512	2014-07-21 11:26:11 +00:00
Alexander Musman	d9ed09f7a5	[OPENMP] Parsing/Sema of the OpenMP directive 'critical'. llvm-svn: 213510	2014-07-21 09:42:05 +00:00
Arnaud A. de Grandmaison	18bc4fff48	Revert "Emit lifetime.start / lifetime.end markers for unnamed temporary objects." This reverts commit dbf785a6432f78a8ec229665876647c4cc610d3d, while I qm investigating a buildbot failure. llvm-svn: 213380	2014-07-18 14:23:58 +00:00
Arnaud A. de Grandmaison	1be89f4977	Emit lifetime.start / lifetime.end markers for unnamed temporary objects. This will give more information to the optimizers so that they can reuse stack slots. llvm-svn: 213379	2014-07-18 13:36:33 +00:00
Alexey Bataev	2df347ad96	[OPENMP] Initial parsing and sema analysis for 'taskwait' directive. llvm-svn: 213363	2014-07-18 10:17:07 +00:00
Alexey Bataev	4d1dfeabc9	[OPENMP] Initial parsing and sema analysis for 'barrier' directive. llvm-svn: 213360	2014-07-18 09:11:51 +00:00
Alexey Bataev	68446b7253	[OPENMP] Initial parsing and sema analysis of 'taskyield' directive. llvm-svn: 213355	2014-07-18 07:47:19 +00:00
Alexey Samsonov	24cad99307	[UBSan] Add !nosanitize metadata to the code generated by UBSan. This is used to mark the instructions emitted by Clang to implement variety of UBSan checks. Generally, we don't want to instrument these instructions with another sanitizers (like ASan). Reviewed in http://reviews.llvm.org/D4544 llvm-svn: 213291	2014-07-17 18:46:27 +00:00
Alexander Musman	80c2289a03	[OPENMP] Parsing/Sema analysis of directive 'master' llvm-svn: 213237	2014-07-17 08:54:58 +00:00
Alexey Bataev	9c2e8ee72f	[OPENMP] Parsing and sema analysis for 'omp task' directive. llvm-svn: 212804	2014-07-11 11:25:16 +00:00
David Blaikie	1b5adb82d9	Fix the dtor location issues in PR20038 harder. Originally committed in r211722, this fixed one case of dtor calls being emitted without locations (this causes problems for debug info if the call is then inlined), this caught only some of the cases. Instead of trying to re-enable the location before the cleanup, simply re-enable the location immediately after the unconditional branches in question using a scoped device to ensure the no-location state doesn't leak out arbitrarily. llvm-svn: 212761	2014-07-10 20:42:59 +00:00
Alexey Bataev	84d0b3efee	[OPENMP] Parsing and sema analysis for 'omp parallel sections' directive. llvm-svn: 212516	2014-07-08 08:12:03 +00:00
Alexey Samsonov	ac4afe49e7	[Sanitizer] Remove brittle cache variable and slightly simplify blacklisting code. Now CodeGenFunction is responsible for looking at sanitizer blacklist (in CodeGenFunction::StartFunction) and turning off instrumentation, if necessary. No functionality change. llvm-svn: 212501	2014-07-07 23:59:57 +00:00
Alexey Bataev	4acb859fbd	[OPENMP] Added initial support for 'omp parallel for'. llvm-svn: 212453	2014-07-07 13:01:15 +00:00
Nico Weber	9b982078e9	Add an AST node for __leave statements, hook it up. Codegen is still missing (and I won't work on that), but __leave is now as implemented as __try and friends. llvm-svn: 212425	2014-07-07 00:12:30 +00:00
Logan Chien	e9c8ccbf8f	Remove CleanupHackLevel from CGException. This patch removes the dead code, and refines the getEHResumeBlock() slightly. The CleanupHackLevel was a hack to the old exception handling intrinsics, which have several issues with function inliner. Since LLVM 3.0, the new landingpad and resume instructions are added to LLVM IR. With the new exception handling mechanism, most of the issues are fixed now. We should always use these instructions to implement the exception handling code nowadays, and we don't need the hack any more. Besides, the `CleanupHackLevel` is a compile-time constant, thus other cases have been considered as dead code for a while. llvm-svn: 212097	2014-07-01 11:47:10 +00:00
Alexey Bataev	aca7fcf276	Using of variable length arrays in captured statements and OpenMP constructs. Differential Revision: http://reviews.llvm.org/D4067 llvm-svn: 212010	2014-06-30 02:55:54 +00:00
Craig Topper	00bbdcf9b3	Remove llvm:: from uses of ArrayRef. llvm-svn: 211987	2014-06-28 23:22:23 +00:00
Alexey Bataev	d1e40fbfe1	[OPENMP] Initial parsing and sema analysis for 'single' directive. llvm-svn: 211774	2014-06-26 12:05:45 +00:00
Alexey Bataev	1e0498a92d	[OPENMP] Initial parsing and sema analysis for 'section' directive. llvm-svn: 211767	2014-06-26 08:21:58 +00:00
Alexey Bataev	d3f8dd2d15	[OPENMP] Initial support for 'sections' directive. llvm-svn: 211685	2014-06-25 11:44:49 +00:00
Matt Arsenault	56f008d538	Add R600 builtin codegen. llvm-svn: 211631	2014-06-24 20:45:01 +00:00
Tim Northover	6ea28bdef5	ARM: remove dead CodeGen functions. These two are no longer being used by NEON codegen. llvm-svn: 211586	2014-06-24 12:07:44 +00:00
Alexey Bataev	f29276edb7	[OPENMP] Initial support for '#pragma omp for' (fixed incompatibility with MSVC). llvm-svn: 211140	2014-06-18 04:14:57 +00:00
Rafael Espindola	a566efbec9	Revert "[OPENMP] Initial support for '#pragma omp for'." This reverts commit r211096. Looks like it broke the msvc build: SemaOpenMP.cpp(140) : error C4519: default template arguments are only allowed on a class template llvm-svn: 211113	2014-06-17 17:20:53 +00:00
Alexey Bataev	c77dd5257a	[OPENMP] Initial support for '#pragma omp for'. llvm-svn: 211096	2014-06-17 11:49:22 +00:00
Aaron Ballman	b06b15aa28	Adding a new #pragma for the vectorize and interleave optimization hints. Patch thanks to Tyler Nowicki! llvm-svn: 210330	2014-06-06 12:40:24 +00:00
Richard Smith	760520bcb7	Add __builtin_operator_new and __builtin_operator_delete, which act like calls to the normal non-placement ::operator new and ::operator delete, but allow optimizations like new-expressions and delete-expressions do. llvm-svn: 210137	2014-06-03 23:27:44 +00:00
Richard Smith	06a67e2c6f	When emitting a multidimensional array new, emit the initializers for the trailing elements as a single loop, rather than sometimes emitting a nest of several loops. This fixes a bug where CodeGen would sometimes try to emit an expression with the wrong type for the element being initialized. Plus various other minor cleanups to the IR produced for array new initialization. llvm-svn: 210079	2014-06-03 06:58:52 +00:00
Tim Northover	573cbee543	AArch64/ARM64: rename ARM64 components to AArch64 This keeps Clang consistent with backend naming conventions. llvm-svn: 209579	2014-05-24 12:52:07 +00:00
Tim Northover	25e8a6754e	AArch64/ARM64: update Clang after AArch64 removal. A few (mostly CodeGen) parts of Clang were tightly coupled to the AArch64 backend. Now that it's gone, they will not even compile. I've also deduplicated RUN lines in many of the AArch64 tests. This might improve "make check-all" time noticably: some of those NEON tests were monsters. llvm-svn: 209578	2014-05-24 12:51:25 +00:00
Alexander Musman	515ad8c490	This patch adds a helper class (CGLoopInfo) for marking memory instructions with llvm.mem.parallel_loop_access metadata. It also adds a simple initial version of codegen for pragma omp simd (it will change in the future to support all the clauses). Differential revision: http://reviews.llvm.org/D3644 llvm-svn: 209411	2014-05-22 08:54:05 +00:00
Craig Topper	8a13c4180e	[C++11] Use 'nullptr'. CodeGen edition. llvm-svn: 209272	2014-05-21 05:09:00 +00:00
Renato Golin	230c5eb4bd	Non-allocatable Global Named Register This patch implements global named registers in Clang, lowering to the just created intrinsics in LLVM (@llvm.read/write_register). A new type of LValue had to be created (Register), which just adds support to carry the metadata node containing the name of the register. Two new methods to emit loads and stores interoperate with another to emit the named metadata node. No guarantees are being made and only non-allocatable global variable named registers are being supported. Local named register support is unchanged. llvm-svn: 209149	2014-05-19 18:15:42 +00:00
Rafael Espindola	42ae74531c	Don't indent in namespaces. llvm-svn: 208384	2014-05-09 00:57:59 +00:00
Alexey Bataev	9959db5fa9	[OPENMP] Initial codegen for '#pragma omp parallel' llvm-svn: 208077	2014-05-06 10:08:46 +00:00
Justin Bogner	81ab90f7ed	CodeGen: Handle CapturedStmt in instrumentation based profiling CapturedStmt was being ignored by instrumentation based profiling, and its counters attributed to the containing function. Instead, we need to treat this as a top level entity, like we do with blocks. llvm-svn: 206231	2014-04-15 00:50:54 +00:00
Adrian Prantl	22e66b434a	Cleanup: Add default arguments to CodeGenFunction::StartFunction. Thanks dblaikie for the suggestion! llvm-svn: 206012	2014-04-11 01:13:04 +00:00
Adrian Prantl	42d71b9906	Debug info: (Bugfix) Make sure artificial functions like _GLOBAL__I_a are not associated with any source lines. Previously, if the Location of a Decl was empty, EmitFunctionStart would just keep using CurLoc, which would sometimes be correct (e.g., thunks) but in other cases would just point to a hilariously random location. This patch fixes this by completely eliminating all uses of CurLoc from EmitFunctionStart and rather have clients explicitly pass in a SourceLocation for the function header and the function body. rdar://problem/14985269 llvm-svn: 205999	2014-04-10 23:21:53 +00:00
Tim Northover	a2ee433c8d	ARM64: initial clang support commit. This adds Clang support for the ARM64 backend. There are definitely still some rough edges, so please bring up any issues you see with this patch. As with the LLVM commit though, we think it'll be more useful for merging with AArch64 from within the tree. llvm-svn: 205100	2014-03-29 15:09:45 +00:00
Eli Bendersky	cb39943f6f	Proper handling of static local variables with address space qualifiers. Similar to the implementation for globals in r157167. Patch by Jingyue Wu. llvm-svn: 204677	2014-03-24 22:05:38 +00:00
Chandler Carruth	61743af166	[Modules] Update to reflect ValueHandle moving to the IR library in LLVM r202821. llvm-svn: 202822	2014-03-04 11:18:19 +00:00
Tim Northover	8fe03d6111	ARM & AArch64: use table for EmitCommonNeonBuiltinExpr This extends the intrinsic lookup table format slightly, and adds entries for use the shared ARM/AArch64 definitions. The benefit is currently smaller than for the SISD intrinsics (there's more custom code implementing this set), but a few lines are saved and there's scope for future expansion. llvm-svn: 201848	2014-02-21 11:57:24 +00:00
Tim Northover	2d83796860	AArch64: refactor table-driven NEON lookup. This extracts the table-driven intrinsic lookup phase into a separate function, to be used by EmitCommonNeonBuiltinExpr soon. It also simplifies the logic used in that lookup, since VectorCastArgN and ScalarArgN were actually identical. llvm-svn: 201847	2014-02-21 11:57:20 +00:00
Bob Wilson	bf854f0f53	Change PGO instrumentation to compute counts in a separate AST traversal. Previously, we made one traversal of the AST prior to codegen to assign counters to the ASTs and then propagated the count values during codegen. This patch now adds a separate AST traversal prior to codegen for the -fprofile-instr-use option to propagate the count values. The counts are then saved in a map from which they can be retrieved during codegen. This new approach has several advantages: 1. It gets rid of a lot of extra PGO-related code that had previously been added to codegen. 2. It fixes a serious bug. My original implementation (which was mailed to the list but never committed) used 3 counters for every loop. Justin improved it to move 2 of those counters into the less-frequently executed breaks and continues, but that turned out to produce wrong count values in some cases. The solution requires visiting a loop body before the condition so that the count for the condition properly includes the break and continue counts. Changing codegen to visit a loop body first would be a fairly invasive change, but with a separate AST traversal, it is easy to control the order of traversal. I've added a testcase (provided by Justin) to make sure this works correctly. 3. It improves the instrumentation overhead, reducing the number of counters for a loop from 3 to 1. We no longer need dedicated counters for breaks and continues, since we can just use the propagated count values when visiting breaks and continues. To make this work, I needed to make a change to the way we count case statements, going back to my original approach of not including the fall-through in the counter values. This was necessary because there isn't always an AST node that can be used to record the fall-through count. Now case statements are handled the same as default statements, with the fall-through paths branching over the counter increments. While I was at it, I also went back to using this approach for do-loops -- omitting the fall-through count into the loop body simplifies some of the calculations and make them behave the same as other loops. Whenever we start using this instrumentation for coverage, we'll need to add the fall-through counts into the counter values. llvm-svn: 201528	2014-02-17 19:21:09 +00:00
Fariborz Jahanian	7741101dce	[IRGen]. Fixes a crash in using Objective-C array properties by fixing shouldBindAsLValue to accept arrays (like record types) because we always manipulate them in memory. Patch suggested by John MaCall. // rdar://15610943 llvm-svn: 201428	2014-02-14 19:37:25 +00:00
Reid Kleckner	314ef7bafd	[ms-cxxabi] Use inalloca on win32 when passing non-trivial C++ objects When a non-trivial parameter is present, clang now gathers up all the parameters that lack inreg and puts them into a packed struct. MSVC always aligns each parameter to 4 bytes and no more, so this is a pretty simple struct to lay out. On win64, non-trivial records are passed indirectly. Prior to this change, clang was incorrectly using byval on win64. I'm able to self-host a working clang with this change and additional LLVM patches. Reviewers: rsmith Differential Revision: http://llvm-reviews.chandlerc.com/D2636 llvm-svn: 200597	2014-02-01 00:04:45 +00:00
Tim Northover	027b4ee607	ARM & AArch64: move shared vld/vst intrinsics to common implementation. llvm-svn: 200526	2014-01-31 10:46:45 +00:00
Tim Northover	58c4474dea	ARM & AArch64: extend shared NEON implementation to first block. This extends the refactoring to the whole of the first block of trivial correspondences (as a fairly arbitrary boundary). llvm-svn: 200472	2014-01-30 14:48:01 +00:00
Tim Northover	ac85c341ae	ARM & AArch64: fully share NEON implementation of permutation intrinsics As a starting point, this moves the CodeGen for NEON permutation instructions (vtrn, vzip, vuzp) into a new shared function. llvm-svn: 200471	2014-01-30 14:47:57 +00:00
Justin Bogner	e25ffdf8a1	Revert "CodeGen: Simplify CodeGenFunction::EmitCaseStmt" I misunderstood the discussion on this. The complexity here is justified by the malloc overhead it saves. This reverts commit r199302. llvm-svn: 199700	2014-01-21 00:35:11 +00:00
Alp Toker	9cacbabd33	Rename FunctionProtoType accessors from 'arguments' to 'parameters' Fix a perennial source of confusion in the clang type system: Declarations and function prototypes have parameters to which arguments are supplied, so calling these 'arguments' was a stretch even in C mode, let alone C++ where default arguments, templates and overloading make the distinction important to get right. Readability win across the board, especially in the casting, ADL and overloading implementations which make a lot more sense at a glance now. Will keep an eye on the builders and update dependent projects shortly. No functional change. llvm-svn: 199686	2014-01-20 20:26:09 +00:00
Justin Bogner	4c5c99f91a	CodeGen: Simplify CodeGenFunction::EmitCaseStmt Way back in r129652 we tried to avoid emitting an empty block at -O0 for switch cases that did nothing but break. This led to a poor debugging experience as reported in PR9796, so we disabled the optimization for -O0 but left it in for higher optimization levels in r154420. Since the whole point of this was to improve -O0, it's silly to keep the complexity at all. llvm-svn: 199302	2014-01-15 07:30:30 +00:00
Adrian Prantl	e83b130def	Revert "Debug info: Ensure that the last stop point in a function is still within" This reverts commit r198461. llvm-svn: 198714	2014-01-07 22:05:52 +00:00
Adrian Prantl	c6758879b3	Revert "Debug info: Implement a cleaner version of r198461. For symmetry with" This reverts commit 198699 so we can get a cleaner patch. llvm-svn: 198713	2014-01-07 22:05:45 +00:00
Adrian Prantl	f5ff0dc29b	Debug info: Implement a cleaner version of r198461. For symmetry with C and C++ don't emit an extra lexical scope for the compound statement that is the body of an Objective-C method. rdar://problem/15010825 llvm-svn: 198699	2014-01-07 19:24:24 +00:00
Chandler Carruth	5553d0d4ca	Sort all the #include lines with LLVM's utils/sort_includes.py which encodes the canonical rules for LLVM's style. I noticed this had drifted quite a bit when cleaning up LLVM, so wanted to clean up Clang as well. llvm-svn: 198686	2014-01-07 11:51:46 +00:00
Justin Bogner	ef512b9929	CodeGen: Initial instrumentation based PGO implementation llvm-svn: 198640	2014-01-06 22:27:43 +00:00
Adrian Prantl	96e70d9148	Debug info: Ensure that the last stop point in a function is still within the lexical block formed by the compound statement that is the function body. rdar://problem/15010825 llvm-svn: 198461	2014-01-03 23:34:30 +00:00
Reid Kleckner	89077a1b00	[ms-cxxabi] The 'most derived' ctor parameter usually comes last Unlike Itanium's VTTs, the 'most derived' boolean or bitfield is the last parameter for non-variadic constructors, rather than the second. For variadic constructors, the 'most derived' parameter comes after the 'this' parameter. This affects constructor calls and constructor decls in a variety of places. Reviewers: timurrrr Differential Revision: http://llvm-reviews.chandlerc.com/D2405 llvm-svn: 197518	2013-12-17 19:46:40 +00:00
Reid Kleckner	739756c0f9	[ms-cxxabi] Construct and destroy call arguments in the correct order Summary: MSVC destroys arguments in the callee from left to right. Because C++ objects have to be destroyed in the reverse order of construction, Clang has to construct arguments from right to left and destroy arguments from left to right. This patch fixes the ordering by reversing the order of evaluation of all call arguments under the MS C++ ABI. Fixes PR18035. Reviewers: rsmith Differential Revision: http://llvm-reviews.chandlerc.com/D2275 llvm-svn: 196402	2013-12-04 19:23:12 +00:00
Hans Wennborg	88497d6157	[-cxx-abi microsoft] Emit thunks for pointers to virtual member functions Instead of storing the vtable offset directly in the function pointer and doing a branch to check for virtualness at each call site, the MS ABI generates a thunk for calling the function at a specific vtable offset, and puts that in the function pointer. This patch adds support for emitting such thunks. However, it doesn't support pointers to virtual member functions that are variadic, have an incomplete aggregate return type or parameter, or are overriding a function in a virtual base class. Differential Revision: http://llvm-reviews.chandlerc.com/D2104 llvm-svn: 194827	2013-11-15 17:24:45 +00:00
Kevin Qin	1718af6f0a	Implement aarch64 neon instruction class misc. llvm-svn: 194657	2013-11-14 02:45:18 +00:00
Richard Smith	b47c36f8e1	C++1y sized deallocation: if we have a use, but not a definition, of a sized deallocation function (and the corresponding unsized deallocation function has been declared), emit a weak discardable definition of the function that forwards to the corresponding unsized deallocation. This allows a C++ standard library implementation to provide both a sized and an unsized deallocation function, where the unsized one does not just call the sized one, for instance by putting both in the same object file within an archive. llvm-svn: 194055	2013-11-05 09:12:18 +00:00
Peter Collingbourne	b453cd64a7	Implement function type checker for the undefined behavior sanitizer. This uses function prefix data to store function type information at the function pointer. Differential Revision: http://llvm-reviews.chandlerc.com/D1338 llvm-svn: 193058	2013-10-20 21:29:19 +00:00
Amaury de la Vieuville	21bf6ed730	Do not emit undefined lsrh/ashr for NEON shifts These IR instructions are undefined when the amount is equal to operand size, but NEON right shifts support such shifts. Work around that by emitting a different IR in these cases. llvm-svn: 191953	2013-10-04 13:13:15 +00:00
Nick Lewycky	2d84e84236	Thread a SourceLocation into the EmitCheck for "load_invalid_value". This occurs when scalars are loaded / undergo lvalue-to-rvalue conversion. llvm-svn: 191808	2013-10-02 02:29:49 +00:00
Faisal Vali	571df12581	Implement conversion to function pointer for generic lambdas without captures. The general strategy is to create template versions of the conversion function and static invoker and then during template argument deduction of the conversion function, create the corresponding call-operator and static invoker specializations, and when the conversion function is marked referenced generate the body of the conversion function using the corresponding static-invoker specialization. Similarly, Codegen does something similar - when asked to emit the IR for a specialized static invoker of a generic lambda, it forwards emission to the corresponding call operator. This patch has been reviewed in person both by Doug and Richard. Richard gave me the LGTM. A few minor changes: - per Richard's request i added a simple check to gracefully inform that captures (init, explicit or default) have not been added to generic lambdas just yet (instead of the assertion violation). - I removed a few lines of code that added the call operators instantiated parameters to the currentinstantiationscope. Not only did it not handle parameter packs, but it is more relevant in the patch for nested lambdas which will follow this one, and fix that problem more comprehensively. - Doug had commented that the original implementation strategy of using the TypeSourceInfo of the call operator to create the static-invoker was flawed and allowed const as a member qualifier to creep into the type of the static-invoker. I currently kludge around it - but after my initial discussion with Doug, with a follow up session with Richard, I have added a FIXME so that a more elegant solution that involves the use of TrivialTypeSourceInfo call followed by the correct wiring of the template parameters to the functionprototypeloc is forthcoming. Thanks! llvm-svn: 191634	2013-09-29 08:45:24 +00:00
Reid Kleckner	543a16c06b	Emit an error when attempting to generate IR for SEH __try Currently we silently omit the code in the try and finally bodies, which is pretty bad. This way we fail loudly. llvm-svn: 190809	2013-09-16 21:46:30 +00:00
Yunzhong Gao	0ebf1bb150	Revert r189649 because it was breaking sanitizer bots. llvm-svn: 189660	2013-08-30 08:53:09 +00:00
Yunzhong Gao	be8d7ba93a	Fixing a bug where debug info for a local variable gets emitted at file scope. The patch was discussed in Phabricator. See: http://llvm-reviews.chandlerc.com/D1281 llvm-svn: 189649	2013-08-30 05:37:02 +00:00
David Blaikie	ebe87e1cfa	Revert "PR14569: Omit debug info for thunks" This reverts commit r189320. Alexey Samsonov and Dmitry Vyukov presented some arguments for keeping these around - though it still seems like those tasks could be solved by a tool just using the symbol table. In a very small number of cases, thunks may be inlined & debug info might be able to save profilers & similar tools from misclassifying those cases as part of the caller. The extra changes here plumb through the VarDecl for various cases to CodeGenFunction - this provides better fidelity through a few APIs but generally just causes the CGF::StartFunction to fallback to using the name of the IR function as the name in the debug info. The changes to debug-info-global-ctor-dtor.cpp seem like goodness. The two names that go missing (in favor of only emitting those names as linkage names) are names that can be demangled - emitting them only as the linkage name should encourage tools to do just that. Again, thanks to Dinesh Dwivedi for investigation/work on this issue. llvm-svn: 189421	2013-08-27 23:57:18 +00:00
David Blaikie	92848dee31	Simplify/clean up debug info suppression in CodeGenFunction CodeGenFunction is run on only one function - a new object is made for each new function. I would add an assertion/flag to this effect, but there's an exception: ObjC properties involve emitting helper functions that are all emitted by the same CodeGenFunction object, so such a check is not possible/correct. llvm-svn: 189277	2013-08-26 20:33:21 +00:00
Benjamin Kramer	7463ed7c89	CodeGen: Unify two implementations of canDevirtualizeMemberFunctionCall. They were mostly copy&paste of each other, move it to CodeGenFunction. Of course the two implementations have diverged over time; the one in CGExprCXX seems to be the more modern one so I picked that one and moved it to CGClass which feels like a better home for it. No intended functionality change. llvm-svn: 189203	2013-08-25 22:46:27 +00:00
Timur Iskhodzhanov	d8fa10db12	[CGF] Get rid of passing redundant VTable pointer around in CodeGenFunction::InitializeVTablePointer[s] llvm-svn: 188909	2013-08-21 17:33:16 +00:00
Timur Iskhodzhanov	88fd439a24	Abstract out virtual calls and virtual function prologue code generation; implement them for -cxx-abi microsoft llvm-svn: 188870	2013-08-21 06:25:03 +00:00
David Blaikie	4a9ec7b59d	PR16933: Don't try to codegen things after we've seen errors. Refactor the underlying code a bit to remove unnecessary calls to "hasErrorOccurred" & make them consistently at all the entry points to the IRGen ASTConsumer. llvm-svn: 188707	2013-08-19 21:02:26 +00:00
Adrian Prantl	ca64c3e136	Debug Info / EmitCallArgs: arguments may modify the debug location. Restore it after each argument is emitted. This fixes the scope info for inlined subroutines inside of function argument expressions. (E.g., anything STL). rdar://problem/12592135 llvm-svn: 187240	2013-07-26 20:42:57 +00:00
Timur Iskhodzhanov	03e8746f90	Simplify the CodeGenFunction::BuildVirtualCall family of functions llvm-svn: 186657	2013-07-19 08:14:45 +00:00
Craig Topper	5603df45df	Use SmallVectorImpl& for function arguments instead of SmallVector. llvm-svn: 185715	2013-07-05 19:34:19 +00:00
Stephen Lin	9dc6eef755	Restore r184205 and associated commits (after commit of r185290) This allows clang to use the backend parameter attribute 'returned' when generating 'this'-returning constructors and destructors in ARM and MSVC C++ ABIs. llvm-svn: 185291	2013-06-30 20:40:16 +00:00
Eli Friedman	c7ad5c4e29	Delete dead code. llvm-svn: 185119	2013-06-28 00:23:34 +00:00
Stephen Lin	19cee1871e	Revert r184205 and associated patches while investigating issue with broken buildbot (possible interaction with LTO) <rdar://problem/14209661> llvm-svn: 184384	2013-06-19 23:23:19 +00:00
Reid Kleckner	d29f1342c2	[CodeGen] Move EHScopeStack into its own header CGCleanup.h isn't meant to be included by all of CodeGen according to John. llvm-svn: 184321	2013-06-19 17:07:50 +00:00
Stephen Lin	a637fb8ccd	CodeGen: Have 'this'-returning constructors and destructors to take advantage of the new backend 'returned' attribute. The backend will now use the generic 'returned' attribute to form tail calls where possible, as well as avoid save-restores of 'this' in some cases (specifically the cases that matter for the ARM C++ ABI). This patch also reverts a prior front-end only partial implementation of these optimizations, since it's no longer required. llvm-svn: 184205	2013-06-18 17:00:49 +00:00
Richard Smith	a1c9d4d932	Simplify: we don't need any special-case lifetime extension when initializing declarations of reference type; they're handled by the general case handling of MaterializeTemporaryExpr. llvm-svn: 183875	2013-06-12 23:38:09 +00:00
Richard Smith	cc1b96d356	PR12086, PR15117 Introduce CXXStdInitializerListExpr node, representing the implicit construction of a std::initializer_list<T> object from its underlying array. The AST representation of such an expression goes from an InitListExpr with a flag set, to a CXXStdInitializerListExpr containing a MaterializeTemporaryExpr containing an InitListExpr (possibly wrapped in a CXXBindTemporaryExpr). This more detailed representation has several advantages, the most important of which is that the new MaterializeTemporaryExpr allows us to directly model lifetime extension of the underlying temporary array. Using that, this patch drastically simplifies the IR generation of this construct, provides IR generation support for nested global initializer_list objects, fixes several bugs where the destructors for the underlying array would accidentally not get invoked, and provides constant expression evaluation support for std::initializer_list objects. llvm-svn: 183872	2013-06-12 22:31:48 +00:00
Richard Smith	736a947bdc	Reapply r183721, reverted in r183776, with a fix for a bug in the former (we were lacking ExprWithCleanups nodes in some cases where the new approach to lifetime extension needed them). Original commit message: Rework IR emission for lifetime-extended temporaries. Instead of trying to walk into the expression and dig out a single lifetime-extended entity and manually pull its cleanup outside the expression, instead keep a list of the cleanups which we'll need to emit when we get to the end of the full-expression. Also emit those cleanups early, as EH-only cleanups, to cover the case that the full-expression does not terminate normally. This allows IR generation to properly model temporary lifetime when multiple temporaries are extended by the same declaration. We have a pre-existing bug where an exception thrown from a temporary's destructor does not clean up lifetime-extended temporaries created in the same expression and extended to automatic storage duration; that is not fixed by this patch. llvm-svn: 183859	2013-06-12 20:42:33 +00:00
Eli Friedman	f045007f11	Add support for complex compound assignments where the LHS is a scalar. Fixes <rdar://problem/11224126> and PR12790. llvm-svn: 183821	2013-06-12 01:40:06 +00:00
Richard Smith	4a28f534e1	Revert r183721. It caused cleanups to be delayed too long in some cases. Testcase to follow. llvm-svn: 183776	2013-06-11 19:14:25 +00:00
Richard Smith	7c5d4dce49	Rework IR emission for lifetime-extended temporaries. Instead of trying to walk into the expression and dig out a single lifetime-extended entity and manually pull its cleanup outside the expression, instead keep a list of the cleanups which we'll need to emit when we get to the end of the full-expression. Also emit those cleanups early, as EH-only cleanups, to cover the case that the full-expression does not terminate normally. This allows IR generation to properly model temporary lifetime when multiple temporaries are extended by the same declaration. We have a pre-existing bug where an exception thrown from a temporary's destructor does not clean up lifetime-extended temporaries created in the same expression and extended to automatic storage duration; that is not fixed by this patch. llvm-svn: 183721	2013-06-11 02:41:00 +00:00
Eli Friedman	4871a46cc3	Make sure we don't emit invalid IR for StmtExprs with complex cleanups. Fixes <rdar://problem/14074868>. llvm-svn: 183699	2013-06-10 22:04:49 +00:00
Reid Kleckner	200fe22a13	[CodeGen] Move EHScopeStack to CGCleanup.h from CodeGenFunction.h No functionality change. CGCleanup.cpp provides the implementation for EHScopeStack, so it seems more consistent to place the class definition in CGCleanup.h. This should also help solve a header ordering problem that I have. llvm-svn: 183631	2013-06-09 16:45:02 +00:00
Reid Kleckner	d8cbeec178	[ms-cxxabi] Implement MSVC virtual base adjustment While we can't yet emit vbtables, this allows us to find virtual bases of objects constructed in other TUs. This make iostream hello world work, since basic_ostream virtually inherits from basic_ios. Differential Revision: http://llvm-reviews.chandlerc.com/D795 llvm-svn: 182870	2013-05-29 18:02:47 +00:00
Adrian Prantl	dc237b52bc	Cleanup: Use a member variable to store the SourceLocation for EH code. rdar://problem/13888152 llvm-svn: 181957	2013-05-16 00:41:26 +00:00
David Blaikie	7d17010db5	Use only explicit bool conversion operator The most common (non-buggy) case are where such objects are used as return expressions in bool-returning functions or as boolean function arguments. In those cases I've used (& added if necessary) a named function to provide the equivalent (or sometimes negative, depending on convenient wording) test. DiagnosticBuilder kept its implicit conversion operator owing to the prevalent use of it in return statements. One bug was found in ExprConstant.cpp involving a comparison of two PointerUnions (PointerUnion did not previously have an operator==, so instead both operands were converted to bool & then compared). A test is included in test/SemaCXX/constant-expression-cxx1y.cpp for the fix (adding operator== to PointerUnion in LLVM). llvm-svn: 181869	2013-05-15 07:37:26 +00:00
Ben Langmuir	3b4c30b7e7	CodeGen for CapturedStmts EmitCapturedStmt creates a captured struct containing all of the captured variables, and then emits a call to the outlined function. This is similar in principle to EmitBlockLiteral. GenerateCapturedFunction actually produces the outlined function. It is based on GenerateBlockFunction, but is much simpler. The function type is determined by the parameters that are in the CapturedDecl. Some changes have been added to this patch that were reviewed as part of the serialization patch and moving the parameters to the captured decl. Differential Revision: http://llvm-reviews.chandlerc.com/D640 llvm-svn: 181536	2013-05-09 19:17:11 +00:00
Richard Smith	ea85232c40	Don't crash in IRGen if a conditional with 'throw' in one of its branches is used as a branch condition. llvm-svn: 181368	2013-05-07 21:53:22 +00:00
Tim Northover	8ec8c4bf89	AArch64: teach Clang about __clear_cache intrinsic libgcc provides a __clear_cache intrinsic on AArch64, much like it does on 32-bit ARM. llvm-svn: 181111	2013-05-04 07:15:13 +00:00
Adrian Prantl	52bf3c4c3f	Reapply r180982 with repaired logic and an additional testcase. Un-break the gdb buildbot. - Use the debug location of the return expression for the cleanup code if the return expression is trivially evaluatable, regardless of the number of stop points in the function. - Ensure that any EH code in the cleanup still gets the line number of the closing } of the lexical scope. - Added a testcase with EH in the cleanup. rdar://problem/13442648 llvm-svn: 181056	2013-05-03 20:11:48 +00:00
John McCall	dec348f7db	Correctly emit certain implicit references to 'self' even within a lambda. Bug #1 is that CGF's CurFuncDecl was "stuck" at lambda invocation functions. Fix that by generally improving getNonClosureContext to look through lambdas and captured statements but only report code contexts, which is generally what's wanted. Audit uses of CurFuncDecl and getNonClosureAncestor for correctness. Bug #2 is that lambdas weren't specially mapping 'self' when inside an ObjC method. Fix that by removing the requirement for that and using the normal EmitDeclRefLValue path in LoadObjCSelf. rdar://13800041 llvm-svn: 181000	2013-05-03 07:33:41 +00:00
Adrian Prantl	857f92371a	Revert "Attempt to un-break the gdb buildbot." This reverts commit 180982. llvm-svn: 180990	2013-05-03 01:42:35 +00:00
Adrian Prantl	44f38013e2	Attempt to un-break the gdb buildbot. - Use the debug location of the return expression for the cleanup code if the return expression is trivially evaluatable, regardless of the number of stop points in the function. - Ensure that any EH code in the cleanup still gets the line number of the closing } of the lexical scope. - Added a testcase with EH in the cleanup. rdar://problem/13442648 llvm-svn: 180982	2013-05-03 00:44:13 +00:00
Adrian Prantl	3be10542af	Ensure that the line table for functions with cleanups is sequential. If there is cleanup code, the cleanup code gets the debug location of the closing '}'. The subsequent ret IR-instruction does not get a debug location. The return _expression_ will get the debug location of the return statement. If the function contains only a single, simple return statement, the cleanup code may become the first breakpoint in the function. In this case we set the debug location for the cleanup code to the location of the return statement. rdar://problem/13442648 llvm-svn: 180932	2013-05-02 17:30:20 +00:00
Benjamin Kramer	139cfc2e63	ArrayRefize code. No functionality change. llvm-svn: 180632	2013-04-26 21:32:52 +00:00
Richard Smith	852c9db72b	C++1y: Allow aggregates to have default initializers. Add a CXXDefaultInitExpr, analogous to CXXDefaultArgExpr, and use it both in CXXCtorInitializers and in InitListExprs to represent a default initializer. There's an additional complication here: because the default initializer can refer to the initialized object via its 'this' pointer, we need to make sure that 'this' points to the right thing within the evaluation. llvm-svn: 179958	2013-04-20 22:23:05 +00:00
Richard Smith	2fd1d7aee3	Implement CodeGen for C++11 thread_local, following the Itanium ABI specification as discussed on cxx-abi-dev. llvm-svn: 179858	2013-04-19 16:42:07 +00:00
John McCall	c8e0170578	Standardize accesses to the TargetInfo in IR-gen. Patch by Stephen Lin! llvm-svn: 179638	2013-04-16 22:48:15 +00:00
Tareq A. Siraj	24110cc733	Implement CapturedStmt AST CapturedStmt can be used to implement generic function outlining as described in http://lists.cs.uiuc.edu/pipermail/cfe-dev/2013-January/027540.html. CapturedStmt is not exposed to the C api. Serialization and template support are pending. Author: Wei Pan <wei.pan@intel.com> Differential Revision: http://llvm-reviews.chandlerc.com/D370 llvm-svn: 179615	2013-04-16 18:53:08 +00:00
Manman Ren	c451e5766e	Initial support for struct-path aware TBAA. Added TBAABaseType and TBAAOffset in LValue. These two fields are initialized to the actual type and 0, and are updated in EmitLValueForField. Path-aware TBAA tags are enabled for EmitLoadOfScalar and EmitStoreOfScalar. Added command line option -struct-path-tbaa. llvm-svn: 178797	2013-04-04 21:53:22 +00:00
Manman Ren	092d9e8f3b	revert r178784 since it does not have a commit message llvm-svn: 178796	2013-04-04 21:51:07 +00:00
Manman Ren	037d2b252d	Index: include/clang/Driver/CC1Options.td =================================================================== --- include/clang/Driver/CC1Options.td (revision 178718) +++ include/clang/Driver/CC1Options.td (working copy) @@ -161,6 +161,8 @@ HelpText<"Use register sized accesses to bit-fields, when possible.">; def relaxed_aliasing : Flag<["-"], "relaxed-aliasing">, HelpText<"Turn off Type Based Alias Analysis">; +def struct_path_tbaa : Flag<["-"], "struct-path-tbaa">, + HelpText<"Turn on struct-path aware Type Based Alias Analysis">; def masm_verbose : Flag<["-"], "masm-verbose">, HelpText<"Generate verbose assembly output">; def mcode_model : Separate<["-"], "mcode-model">, Index: include/clang/Driver/Options.td =================================================================== --- include/clang/Driver/Options.td (revision 178718) +++ include/clang/Driver/Options.td (working copy) @@ -587,6 +587,7 @@ Flags<[CC1Option]>, HelpText<"Disable spell-checking">; def fno_stack_protector : Flag<["-"], "fno-stack-protector">, Group<f_Group>; def fno_strict_aliasing : Flag<["-"], "fno-strict-aliasing">, Group<f_Group>; +def fstruct_path_tbaa : Flag<["-"], "fstruct-path-tbaa">, Group<f_Group>; def fno_strict_enums : Flag<["-"], "fno-strict-enums">, Group<f_Group>; def fno_strict_overflow : Flag<["-"], "fno-strict-overflow">, Group<f_Group>; def fno_threadsafe_statics : Flag<["-"], "fno-threadsafe-statics">, Group<f_Group>, Index: include/clang/Frontend/CodeGenOptions.def =================================================================== --- include/clang/Frontend/CodeGenOptions.def (revision 178718) +++ include/clang/Frontend/CodeGenOptions.def (working copy) @@ -85,6 +85,7 @@ VALUE_CODEGENOPT(OptimizeSize, 2, 0) ///< If -Os (==1) or -Oz (==2) is specified. CODEGENOPT(RelaxAll , 1, 0) ///< Relax all machine code instructions. CODEGENOPT(RelaxedAliasing , 1, 0) ///< Set when -fno-strict-aliasing is enabled. +CODEGENOPT(StructPathTBAA , 1, 0) ///< Whether or not to use struct-path TBAA. CODEGENOPT(SaveTempLabels , 1, 0) ///< Save temporary labels. CODEGENOPT(SanitizeAddressZeroBaseShadow , 1, 0) ///< Map shadow memory at zero ///< offset in AddressSanitizer. Index: lib/CodeGen/CGExpr.cpp =================================================================== --- lib/CodeGen/CGExpr.cpp (revision 178718) +++ lib/CodeGen/CGExpr.cpp (working copy) @@ -1044,7 +1044,8 @@ llvm::Value CodeGenFunction::EmitLoadOfScalar(LValue lvalue) { return EmitLoadOfScalar(lvalue.getAddress(), lvalue.isVolatile(), lvalue.getAlignment().getQuantity(), - lvalue.getType(), lvalue.getTBAAInfo()); + lvalue.getType(), lvalue.getTBAAInfo(), + lvalue.getTBAABaseType(), lvalue.getTBAAOffset()); } static bool hasBooleanRepresentation(QualType Ty) { @@ -1106,7 +1107,9 @@ llvm::Value CodeGenFunction::EmitLoadOfScalar(llvm::Value Addr, bool Volatile, unsigned Alignment, QualType Ty, - llvm::MDNode TBAAInfo) { + llvm::MDNode TBAAInfo, + QualType TBAABaseType, + uint64_t TBAAOffset) { // For better performance, handle vector loads differently. if (Ty->isVectorType()) { llvm::Value V; @@ -1158,8 +1161,11 @@ Load->setVolatile(true); if (Alignment) Load->setAlignment(Alignment); - if (TBAAInfo) - CGM.DecorateInstruction(Load, TBAAInfo); + if (TBAAInfo) { + llvm::MDNode TBAAPath = CGM.getTBAAStructTagInfo(TBAABaseType, TBAAInfo, + TBAAOffset); + CGM.DecorateInstruction(Load, TBAAPath); + } if ((SanOpts->Bool && hasBooleanRepresentation(Ty)) \|\| (SanOpts->Enum && Ty->getAs<EnumType>())) { @@ -1217,7 +1223,8 @@ bool Volatile, unsigned Alignment, QualType Ty, llvm::MDNode TBAAInfo, - bool isInit) { + bool isInit, QualType TBAABaseType, + uint64_t TBAAOffset) { // Handle vectors differently to get better performance. if (Ty->isVectorType()) { @@ -1268,15 +1275,19 @@ llvm::StoreInst Store = Builder.CreateStore(Value, Addr, Volatile); if (Alignment) Store->setAlignment(Alignment); - if (TBAAInfo) - CGM.DecorateInstruction(Store, TBAAInfo); + if (TBAAInfo) { + llvm::MDNode TBAAPath = CGM.getTBAAStructTagInfo(TBAABaseType, TBAAInfo, + TBAAOffset); + CGM.DecorateInstruction(Store, TBAAPath); + } } void CodeGenFunction::EmitStoreOfScalar(llvm::Value value, LValue lvalue, bool isInit) { EmitStoreOfScalar(value, lvalue.getAddress(), lvalue.isVolatile(), lvalue.getAlignment().getQuantity(), lvalue.getType(), - lvalue.getTBAAInfo(), isInit); + lvalue.getTBAAInfo(), isInit, lvalue.getTBAABaseType(), + lvalue.getTBAAOffset()); } /// EmitLoadOfLValue - Given an expression that represents a value lvalue, this @@ -2494,9 +2505,12 @@ llvm::Value addr = base.getAddress(); unsigned cvr = base.getVRQualifiers(); + bool TBAAPath = CGM.getCodeGenOpts().StructPathTBAA; if (rec->isUnion()) { // For unions, there is no pointer adjustment. assert(!type->isReferenceType() && "union has reference member"); + // TODO: handle path-aware TBAA for union. + TBAAPath = false; } else { // For structs, we GEP to the field that the record layout suggests. unsigned idx = CGM.getTypes().getCGRecordLayout(rec).getLLVMFieldNo(field); @@ -2508,6 +2522,8 @@ if (cvr & Qualifiers::Volatile) load->setVolatile(true); load->setAlignment(alignment.getQuantity()); + // Loading the reference will disable path-aware TBAA. + TBAAPath = false; if (CGM.shouldUseTBAA()) { llvm::MDNode tbaa; if (mayAlias) @@ -2541,6 +2557,16 @@ LValue LV = MakeAddrLValue(addr, type, alignment); LV.getQuals().addCVRQualifiers(cvr); + if (TBAAPath) { + const ASTRecordLayout &Layout = + getContext().getASTRecordLayout(field->getParent()); + // Set the base type to be the base type of the base LValue and + // update offset to be relative to the base type. + LV.setTBAABaseType(base.getTBAABaseType()); + LV.setTBAAOffset(base.getTBAAOffset() + + Layout.getFieldOffset(field->getFieldIndex()) / + getContext().getCharWidth()); + } // __weak attribute on a field is ignored. if (LV.getQuals().getObjCGCAttr() == Qualifiers::Weak) Index: lib/CodeGen/CGValue.h =================================================================== --- lib/CodeGen/CGValue.h (revision 178718) +++ lib/CodeGen/CGValue.h (working copy) @@ -157,6 +157,11 @@ Expr BaseIvarExp; + /// Used by struct-path-aware TBAA. + QualType TBAABaseType; + /// Offset relative to the base type. + uint64_t TBAAOffset; + /// TBAAInfo - TBAA information to attach to dereferences of this LValue. llvm::MDNode TBAAInfo; @@ -175,6 +180,10 @@ this->ImpreciseLifetime = false; this->ThreadLocalRef = false; this->BaseIvarExp = 0; + + // Initialize fields for TBAA. + this->TBAABaseType = Type; + this->TBAAOffset = 0; this->TBAAInfo = TBAAInfo; } @@ -232,6 +241,12 @@ Expr getBaseIvarExp() const { return BaseIvarExp; } void setBaseIvarExp(Expr V) { BaseIvarExp = V; } + QualType getTBAABaseType() const { return TBAABaseType; } + void setTBAABaseType(QualType T) { TBAABaseType = T; } + + uint64_t getTBAAOffset() const { return TBAAOffset; } + void setTBAAOffset(uint64_t O) { TBAAOffset = O; } + llvm::MDNode getTBAAInfo() const { return TBAAInfo; } void setTBAAInfo(llvm::MDNode N) { TBAAInfo = N; } Index: lib/CodeGen/CodeGenFunction.h =================================================================== --- lib/CodeGen/CodeGenFunction.h (revision 178718) +++ lib/CodeGen/CodeGenFunction.h (working copy) @@ -2211,7 +2211,9 @@ /// the LLVM value representation. llvm::Value EmitLoadOfScalar(llvm::Value Addr, bool Volatile, unsigned Alignment, QualType Ty, - llvm::MDNode TBAAInfo = 0); + llvm::MDNode TBAAInfo = 0, + QualType TBAABaseTy = QualType(), + uint64_t TBAAOffset = 0); /// EmitLoadOfScalar - Load a scalar value from an address, taking /// care to appropriately convert from the memory representation to @@ -2224,7 +2226,9 @@ /// the LLVM value representation. void EmitStoreOfScalar(llvm::Value Value, llvm::Value Addr, bool Volatile, unsigned Alignment, QualType Ty, - llvm::MDNode TBAAInfo = 0, bool isInit=false); + llvm::MDNode TBAAInfo = 0, bool isInit = false, + QualType TBAABaseTy = QualType(), + uint64_t TBAAOffset = 0); /// EmitStoreOfScalar - Store a scalar value to an address, taking /// care to appropriately convert from the memory representation to Index: lib/CodeGen/CodeGenModule.cpp =================================================================== --- lib/CodeGen/CodeGenModule.cpp (revision 178718) +++ lib/CodeGen/CodeGenModule.cpp (working copy) @@ -227,6 +227,20 @@ return TBAA->getTBAAStructInfo(QTy); } +llvm::MDNode CodeGenModule::getTBAAStructTypeInfo(QualType QTy) { + if (!TBAA) + return 0; + return TBAA->getTBAAStructTypeInfo(QTy); +} + +llvm::MDNode CodeGenModule::getTBAAStructTagInfo(QualType BaseTy, + llvm::MDNode AccessN, + uint64_t O) { + if (!TBAA) + return 0; + return TBAA->getTBAAStructTagInfo(BaseTy, AccessN, O); +} + void CodeGenModule::DecorateInstruction(llvm::Instruction Inst, llvm::MDNode TBAAInfo) { Inst->setMetadata(llvm::LLVMContext::MD_tbaa, TBAAInfo); Index: lib/CodeGen/CodeGenModule.h =================================================================== --- lib/CodeGen/CodeGenModule.h (revision 178718) +++ lib/CodeGen/CodeGenModule.h (working copy) @@ -501,6 +501,11 @@ llvm::MDNode getTBAAInfo(QualType QTy); llvm::MDNode getTBAAInfoForVTablePtr(); llvm::MDNode getTBAAStructInfo(QualType QTy); + /// Return the MDNode in the type DAG for the given struct type. + llvm::MDNode getTBAAStructTypeInfo(QualType QTy); + /// Return the path-aware tag for given base type, access node and offset. + llvm::MDNode getTBAAStructTagInfo(QualType BaseTy, llvm::MDNode AccessN, + uint64_t O); bool isTypeConstant(QualType QTy, bool ExcludeCtorDtor); Index: lib/CodeGen/CodeGenTBAA.cpp =================================================================== --- lib/CodeGen/CodeGenTBAA.cpp (revision 178718) +++ lib/CodeGen/CodeGenTBAA.cpp (working copy) @@ -21,6 +21,7 @@ #include "clang/AST/Mangle.h" #include "clang/AST/RecordLayout.h" #include "clang/Frontend/CodeGenOptions.h" +#include "llvm/ADT/SmallSet.h" #include "llvm/IR/Constants.h" #include "llvm/IR/LLVMContext.h" #include "llvm/IR/Metadata.h" @@ -225,3 +226,87 @@ // For now, handle any other kind of type conservatively. return StructMetadataCache[Ty] = NULL; } + +/// Check if the given type can be handled by path-aware TBAA. +static bool isTBAAPathStruct(QualType QTy) { + if (const RecordType TTy = QTy->getAs<RecordType>()) { + const RecordDecl RD = TTy->getDecl()->getDefinition(); + // RD can be struct, union, class, interface or enum. + // For now, we only handle struct. + if (RD->isStruct() && !RD->hasFlexibleArrayMember()) + return true; + } + return false; +} + +llvm::MDNode * +CodeGenTBAA::getTBAAStructTypeInfo(QualType QTy) { + const Type Ty = Context.getCanonicalType(QTy).getTypePtr(); + assert(isTBAAPathStruct(QTy)); + + if (llvm::MDNode N = StructTypeMetadataCache[Ty]) + return N; + + if (const RecordType TTy = QTy->getAs<RecordType>()) { + const RecordDecl RD = TTy->getDecl()->getDefinition(); + + const ASTRecordLayout &Layout = Context.getASTRecordLayout(RD); + SmallVector <std::pair<uint64_t, llvm::MDNode>, 4> Fields; + // To reduce the size of MDNode for a given struct type, we only output + // once for all the fields with the same scalar types. + // Offsets for scalar fields in the type DAG are not used. + llvm::SmallSet <llvm::MDNode, 4> ScalarFieldTypes; + unsigned idx = 0; + for (RecordDecl::field_iterator i = RD->field_begin(), + e = RD->field_end(); i != e; ++i, ++idx) { + QualType FieldQTy = i->getType(); + llvm::MDNode FieldNode; + if (isTBAAPathStruct(FieldQTy)) + FieldNode = getTBAAStructTypeInfo(FieldQTy); + else { + FieldNode = getTBAAInfo(FieldQTy); + // Ignore this field if the type already exists. + if (ScalarFieldTypes.count(FieldNode)) + continue; + ScalarFieldTypes.insert(FieldNode); + } + if (!FieldNode) + return StructTypeMetadataCache[Ty] = NULL; + Fields.push_back(std::make_pair( + Layout.getFieldOffset(idx) / Context.getCharWidth(), FieldNode)); + } + + // TODO: This is using the RTTI name. Is there a better way to get + // a unique string for a type? + SmallString<256> OutName; + llvm::raw_svector_ostream Out(OutName); + MContext.mangleCXXRTTIName(QualType(Ty, 0), Out); + Out.flush(); + // Create the struct type node with a vector of pairs (offset, type). + return StructTypeMetadataCache[Ty] = + MDHelper.createTBAAStructTypeNode(OutName, Fields); + } + + return StructMetadataCache[Ty] = NULL; +} + +llvm::MDNode +CodeGenTBAA::getTBAAStructTagInfo(QualType BaseQTy, llvm::MDNode AccessNode, + uint64_t Offset) { + if (!CodeGenOpts.StructPathTBAA) + return AccessNode; + + const Type BTy = Context.getCanonicalType(BaseQTy).getTypePtr(); + TBAAPathTag PathTag = TBAAPathTag(BTy, AccessNode, Offset); + if (llvm::MDNode N = StructTagMetadataCache[PathTag]) + return N; + + llvm::MDNode BNode = 0; + if (isTBAAPathStruct(BaseQTy)) + BNode = getTBAAStructTypeInfo(BaseQTy); + if (!BNode) + return StructTagMetadataCache[PathTag] = AccessNode; + + return StructTagMetadataCache[PathTag] = + MDHelper.createTBAAStructTagNode(BNode, AccessNode, Offset); +} Index: lib/CodeGen/CodeGenTBAA.h =================================================================== --- lib/CodeGen/CodeGenTBAA.h (revision 178718) +++ lib/CodeGen/CodeGenTBAA.h (working copy) @@ -35,6 +35,14 @@ namespace CodeGen { class CGRecordLayout; + struct TBAAPathTag { + TBAAPathTag(const Type B, const llvm::MDNode A, uint64_t O) + : BaseT(B), AccessN(A), Offset(O) {} + const Type BaseT; + const llvm::MDNode AccessN; + uint64_t Offset; + }; + /// CodeGenTBAA - This class organizes the cross-module state that is used /// while lowering AST types to LLVM types. class CodeGenTBAA { @@ -46,8 +54,13 @@ // MDHelper - Helper for creating metadata. llvm::MDBuilder MDHelper; - /// MetadataCache - This maps clang::Types to llvm::MDNodes describing them. + /// MetadataCache - This maps clang::Types to scalar llvm::MDNodes describing + /// them. llvm::DenseMap<const Type , llvm::MDNode > MetadataCache; + /// This maps clang::Types to a struct node in the type DAG. + llvm::DenseMap<const Type , llvm::MDNode > StructTypeMetadataCache; + /// This maps TBAAPathTags to a tag node. + llvm::DenseMap<TBAAPathTag, llvm::MDNode > StructTagMetadataCache; /// StructMetadataCache - This maps clang::Types to llvm::MDNodes describing /// them for struct assignments. @@ -89,9 +102,49 @@ /// getTBAAStructInfo - Get the TBAAStruct MDNode to be used for a memcpy of /// the given type. llvm::MDNode getTBAAStructInfo(QualType QTy); + + /// Get the MDNode in the type DAG for given struct type QType. + llvm::MDNode getTBAAStructTypeInfo(QualType QType); + /// Get the tag MDNode for a given base type, the actual sclar access MDNode + /// and offset into the base type. + llvm::MDNode getTBAAStructTagInfo(QualType BaseQType, + llvm::MDNode AccessNode, uint64_t Offset); }; } // end namespace CodeGen } // end namespace clang +namespace llvm { + +template<> struct DenseMapInfo<clang::CodeGen::TBAAPathTag> { + static clang::CodeGen::TBAAPathTag getEmptyKey() { + return clang::CodeGen::TBAAPathTag( + DenseMapInfo<const clang::Type >::getEmptyKey(), + DenseMapInfo<const MDNode >::getEmptyKey(), + DenseMapInfo<uint64_t>::getEmptyKey()); + } + + static clang::CodeGen::TBAAPathTag getTombstoneKey() { + return clang::CodeGen::TBAAPathTag( + DenseMapInfo<const clang::Type >::getTombstoneKey(), + DenseMapInfo<const MDNode >::getTombstoneKey(), + DenseMapInfo<uint64_t>::getTombstoneKey()); + } + + static unsigned getHashValue(const clang::CodeGen::TBAAPathTag &Val) { + return DenseMapInfo<const clang::Type >::getHashValue(Val.BaseT) ^ + DenseMapInfo<const MDNode >::getHashValue(Val.AccessN) ^ + DenseMapInfo<uint64_t>::getHashValue(Val.Offset); + } + + static bool isEqual(const clang::CodeGen::TBAAPathTag &LHS, + const clang::CodeGen::TBAAPathTag &RHS) { + return LHS.BaseT == RHS.BaseT && + LHS.AccessN == RHS.AccessN && + LHS.Offset == RHS.Offset; + } +}; + +} // end namespace llvm + #endif Index: lib/Driver/Tools.cpp =================================================================== --- lib/Driver/Tools.cpp (revision 178718) +++ lib/Driver/Tools.cpp (working copy) @@ -2105,6 +2105,8 @@ options::OPT_fno_strict_aliasing, getToolChain().IsStrictAliasingDefault())) CmdArgs.push_back("-relaxed-aliasing"); + if (Args.hasArg(options::OPT_fstruct_path_tbaa)) + CmdArgs.push_back("-struct-path-tbaa"); if (Args.hasFlag(options::OPT_fstrict_enums, options::OPT_fno_strict_enums, false)) CmdArgs.push_back("-fstrict-enums"); Index: lib/Frontend/CompilerInvocation.cpp =================================================================== --- lib/Frontend/CompilerInvocation.cpp (revision 178718) +++ lib/Frontend/CompilerInvocation.cpp (working copy) @@ -324,6 +324,7 @@ Opts.UseRegisterSizedBitfieldAccess = Args.hasArg( OPT_fuse_register_sized_bitfield_access); Opts.RelaxedAliasing = Args.hasArg(OPT_relaxed_aliasing); + Opts.StructPathTBAA = Args.hasArg(OPT_struct_path_tbaa); Opts.DwarfDebugFlags = Args.getLastArgValue(OPT_dwarf_debug_flags); Opts.MergeAllConstants = !Args.hasArg(OPT_fno_merge_all_constants); Opts.NoCommon = Args.hasArg(OPT_fno_common); Index: test/CodeGen/tbaa.cpp =================================================================== --- test/CodeGen/tbaa.cpp (revision 0) +++ test/CodeGen/tbaa.cpp (working copy) @@ -0,0 +1,217 @@ +// RUN: %clang_cc1 -O1 -disable-llvm-optzns %s -emit-llvm -o - \| FileCheck %s +// RUN: %clang_cc1 -O1 -struct-path-tbaa -disable-llvm-optzns %s -emit-llvm -o - \| FileCheck %s -check-prefix=PATH +// Test TBAA metadata generated by front-end. + +#include <stdint.h> +typedef struct +{ + uint16_t f16; + uint32_t f32; + uint16_t f16_2; + uint32_t f32_2; +} StructA; +typedef struct +{ + uint16_t f16; + StructA a; + uint32_t f32; +} StructB; +typedef struct +{ + uint16_t f16; + StructB b; + uint32_t f32; +} StructC; +typedef struct +{ + uint16_t f16; + StructB b; + uint32_t f32; + uint8_t f8; +} StructD; + +typedef struct +{ + uint16_t f16; + uint32_t f32; +} StructS; +typedef struct +{ + uint16_t f16; + uint32_t f32; +} StructS2; + +uint32_t g(uint32_t s, StructA A, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i32 4, i32 %{{.}}, align 4, !tbaa !4 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// PATH: store i32 4, i32 %{{.}}, align 4, !tbaa !5 + s = 1; + A->f32 = 4; + return s; +} + +uint32_t g2(uint32_t s, StructA A, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i16 4, i16 %{{.}}, align 2, !tbaa !5 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// PATH: store i16 4, i16 %{{.}}, align 2, !tbaa !8 + s = 1; + A->f16 = 4; + return s; +} + +uint32_t g3(StructA A, StructB B, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i32 4, i32 %{{.}}, align 4, !tbaa !4 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !5 +// PATH: store i32 4, i32 %{{.}}, align 4, !tbaa !9 + A->f32 = 1; + B->a.f32 = 4; + return A->f32; +} + +uint32_t g4(StructA A, StructB B, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i16 4, i16 %{{.}}, align 2, !tbaa !5 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !5 +// PATH: store i16 4, i16 %{{.}}, align 2, !tbaa !11 + A->f32 = 1; + B->a.f16 = 4; + return A->f32; +} + +uint32_t g5(StructA A, StructB B, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i32 4, i32 %{{.}}, align 4, !tbaa !4 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !5 +// PATH: store i32 4, i32 %{{.}}, align 4, !tbaa !12 + A->f32 = 1; + B->f32 = 4; + return A->f32; +} + +uint32_t g6(StructA A, StructB B, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i32 4, i32 %{{.}}, align 4, !tbaa !4 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !5 +// PATH: store i32 4, i32 %{{.}}, align 4, !tbaa !13 + A->f32 = 1; + B->a.f32_2 = 4; + return A->f32; +} + +uint32_t g7(StructA A, StructS S, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i32 4, i32 %{{.}}, align 4, !tbaa !4 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !5 +// PATH: store i32 4, i32 %{{.}}, align 4, !tbaa !14 + A->f32 = 1; + S->f32 = 4; + return A->f32; +} + +uint32_t g8(StructA A, StructS S, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i16 4, i16 %{{.}}, align 2, !tbaa !5 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !5 +// PATH: store i16 4, i16 %{{.}}, align 2, !tbaa !16 + A->f32 = 1; + S->f16 = 4; + return A->f32; +} + +uint32_t g9(StructS S, StructS2 S2, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i32 4, i32 %{{.}}, align 4, !tbaa !4 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !14 +// PATH: store i32 4, i32 %{{.}}, align 4, !tbaa !17 + S->f32 = 1; + S2->f32 = 4; + return S->f32; +} + +uint32_t g10(StructS S, StructS2 S2, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i16 4, i16 %{{.}}, align 2, !tbaa !5 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !14 +// PATH: store i16 4, i16 %{{.}}, align 2, !tbaa !19 + S->f32 = 1; + S2->f16 = 4; + return S->f32; +} + +uint32_t g11(StructC C, StructD D, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i32 4, i32 %{{.}}, align 4, !tbaa !4 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !20 +// PATH: store i32 4, i32 %{{.}}, align 4, !tbaa !22 + C->b.a.f32 = 1; + D->b.a.f32 = 4; + return C->b.a.f32; +} + +uint32_t g12(StructC C, StructD D, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i32 4, i32 %{{.}}, align 4, !tbaa !4 +// TODO: differentiate the two accesses. +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !9 +// PATH: store i32 4, i32 %{{.}}, align 4, !tbaa !9 + StructB b1 = &(C->b); + StructB *b2 = &(D->b); + // b1, b2 have different context. + b1->a.f32 = 1; + b2->a.f32 = 4; + return b1->a.f32; +} + +// CHECK: !1 = metadata !{metadata !"omnipotent char", metadata !2} +// CHECK: !2 = metadata !{metadata !"Simple C/C++ TBAA"} +// CHECK: !4 = metadata !{metadata !"int", metadata !1} +// CHECK: !5 = metadata !{metadata !"short", metadata !1} + +// PATH: !1 = metadata !{metadata !"omnipotent char", metadata !2} +// PATH: !4 = metadata !{metadata !"int", metadata !1} +// PATH: !5 = metadata !{metadata !6, metadata !4, i64 4} +// PATH: !6 = metadata !{metadata !"_ZTS7StructA", i64 0, metadata !7, i64 4, metadata !4} +// PATH: !7 = metadata !{metadata !"short", metadata !1} +// PATH: !8 = metadata !{metadata !6, metadata !7, i64 0} +// PATH: !9 = metadata !{metadata !10, metadata !4, i64 8} +// PATH: !10 = metadata !{metadata !"_ZTS7StructB", i64 0, metadata !7, i64 4, metadata !6, i64 20, metadata !4} +// PATH: !11 = metadata !{metadata !10, metadata !7, i64 4} +// PATH: !12 = metadata !{metadata !10, metadata !4, i64 20} +// PATH: !13 = metadata !{metadata !10, metadata !4, i64 16} +// PATH: !14 = metadata !{metadata !15, metadata !4, i64 4} +// PATH: !15 = metadata !{metadata !"_ZTS7StructS", i64 0, metadata !7, i64 4, metadata !4} +// PATH: !16 = metadata !{metadata !15, metadata !7, i64 0} +// PATH: !17 = metadata !{metadata !18, metadata !4, i64 4} +// PATH: !18 = metadata !{metadata !"_ZTS8StructS2", i64 0, metadata !7, i64 4, metadata !4} +// PATH: !19 = metadata !{metadata !18, metadata !7, i64 0} +// PATH: !20 = metadata !{metadata !21, metadata !4, i64 12} +// PATH: !21 = metadata !{metadata !"_ZTS7StructC", i64 0, metadata !7, i64 4, metadata !10, i64 28, metadata !4} +// PATH: !22 = metadata !{metadata !23, metadata !4, i64 12} +// PATH: !23 = metadata !{metadata !"_ZTS7StructD", i64 0, metadata !7, i64 4, metadata !10, i64 28, metadata !4, i64 32, metadata !1} llvm-svn: 178784	2013-04-04 20:14:17 +00:00
Adrian Prantl	5d5b67c52c	* Attempt to un-break gdb buildbot by emitting a lexical block end only when we actually end a lexical block. * Added new test for line table / block cleanup. * Follow-up to r177819 / rdar://problem/13115369 llvm-svn: 178490	2013-04-01 19:02:06 +00:00
Nadav Rotem	1da30944a6	Make clang to mark static stack allocations with lifetime markers to enable a more aggressive stack coloring. Patch by John McCall with help by Shuxin Yang. rdar://13115369 llvm-svn: 177819	2013-03-23 06:43:35 +00:00
John McCall	eff1884274	Under ARC, when we're passing the address of a strong variable to an out-parameter using the indirect-writeback conversion, and we copied the current value of the variable to the temporary, make sure that we register an intrinsic use of that value with the optimizer so that the value won't get released until we have a chance to retain it. rdar://13195034 llvm-svn: 177813	2013-03-23 02:35:54 +00:00
Manman Ren	0175461296	Exploit this-return of a callsite in a this-return function. For constructors/desctructors that return 'this', if there exists a callsite that returns 'this' and is immediately before the return instruction, make sure we are using the return value from the callsite. We don't need to keep 'this' alive through the callsite. It also enables optimizations in the backend, such as tail call optimization. Updated from r177211. rdar://12818789 llvm-svn: 177541	2013-03-20 16:59:38 +00:00
Manman Ren	c089074aa5	revert r177211 due to its potential issues llvm-svn: 177222	2013-03-16 04:47:38 +00:00
Manman Ren	58dd990c11	Exploit this-return of a callsite in a this-return function. For constructors/desctructors that return 'this', if there exists a callsite that returns 'this' and is immediately before the return instruction, make sure we are using the return value from the callsite. We don't need to keep 'this' alive through the callsite. It also enables optimizations in the backend, such as tail call optimization. rdar://12818789 llvm-svn: 177211	2013-03-16 00:11:09 +00:00
John McCall	cdda29c968	Tighten up the rules for precise lifetime and document the requirements on the ARC optimizer. rdar://13407451 llvm-svn: 176924	2013-03-13 03:10:54 +00:00
Joey Gouly	aba589cceb	Add support for the OpenCL attribute 'vec_type_hint'. Patch by Murat Bolat! llvm-svn: 176686	2013-03-08 09:42:32 +00:00
John McCall	a8ec7eb9cf	Promote atomic type sizes up to a power of two, capped by MaxAtomicPromoteWidth. Fix a ton of terrible bugs with _Atomic types and (non-intrinsic-mediated) loads and stores thereto. llvm-svn: 176658	2013-03-07 21:37:17 +00:00
John McCall	47fb950871	Change hasAggregateLLVMType, which conflates complex and aggregate types in a profoundly wrong way that has to be worked around in every call site, to getEvaluationKind, which classifies and distinguishes between all of these cases. Also, normalize the API for loading and storing complexes. I'm working on a larger patch and wanted to pull these changes out, but it would have be annoying to detangle them from each other. llvm-svn: 176656	2013-03-07 21:37:08 +00:00
John McCall	e739a49325	Restore order to placate test. I had no real reason to switch them. llvm-svn: 176328	2013-03-01 01:38:54 +00:00
John McCall	07e60263dd	Re-use bit from superclass and extract stuff into a local function. Serves a patch we're kicking around out-of-tree. llvm-svn: 176327	2013-03-01 01:24:35 +00:00
John McCall	882987f30c	Use the actual ABI-determined C calling convention for runtime calls and declarations. LLVM has a default CC determined by the target triple. This is not always the actual default CC for the ABI we've been asked to target, and so we sometimes find ourselves annotating all user functions with an explicit calling convention. Since these calling conventions usually agree for the simple set of argument types passed to most runtime functions, using the LLVM-default CC in principle has no effect. However, the LLVM optimizer goes into histrionics if it sees this kind of formal CC mismatch, since it has no concept of CC compatibility. Therefore, if this module happens to define the "runtime" function, or got LTO'ed with such a definition, we can miscompile; so it's quite important to get this right. Defining runtime functions locally is quite common in embedded applications. llvm-svn: 176286	2013-02-28 19:01:20 +00:00
Timur Iskhodzhanov	57cbe5c790	Better support for constructors with -cxx-abi microsoft, partly fixes PR12784 llvm-svn: 176186	2013-02-27 13:46:31 +00:00
Richard Smith	539e4a77bb	ubsan: Emit bounds checks for array indexing, vector indexing, and (in really simple cases) pointer arithmetic. This augments the existing bounds checking with language-level array bounds information. llvm-svn: 175949	2013-02-23 02:53:19 +00:00
Lang Hames	bf122744e5	Re-apply r174919 - smarter copy/move assignment/construction, with fixes for bitfield related issues. The original commit broke Takumi's builder. The bug was caused by bitfield sizes being determined by their underlying type, rather than the field info. A similar issue with bitfield alignments showed up on closer testing. Both have been fixed in this patch. llvm-svn: 175389	2013-02-17 07:22:09 +00:00
Richard Smith	2c5868c334	ubsan: Add checking for invalid downcasts. Per [expr.static.cast]p2 and p11, base-to-derived casts have undefined behavior if the object is not actually an instance of the derived type. llvm-svn: 175078	2013-02-13 21:18:23 +00:00
Timur Iskhodzhanov	ee6bc53365	Emit virtual/deleting destructors properly with -cxx-abi microsoft, PR15058 llvm-svn: 175045	2013-02-13 08:37:51 +00:00
Lang Hames	697b004219	Backing out r174919 while I investigate a self-host bug on Takumi's builder. llvm-svn: 174925	2013-02-12 00:44:43 +00:00
Lang Hames	5824a4f1b0	When generating IR for default copy-constructors, copy-assignment operators, move-constructors and move-assignment operators, use memcpy to copy adjacent POD members. Previously, classes with one or more Non-POD members would fall back on element-wise copies for all members, including POD members. This often generated a lot of IR. Without padding metadata, it wasn't often possible for the LLVM optimizers to turn the element-wise copies into a memcpy. This code hasn't yet received any serious tuning. I didn't see any serious regressions on a self-hosted clang build, or any of the nightly tests, but I think it's important to get this out in the wild to get more testing. Insights, feedback and comments welcome. Many thanks to David Blaikie, Richard Smith, and especially John McCall for their help and feedback on this work. llvm-svn: 174919	2013-02-11 23:44:11 +00:00
Arnaud A. de Grandmaison	49c04467ea	Fix typo in comment llvm-svn: 174359	2013-02-05 09:06:17 +00:00
David Blaikie	357aafb566	Fix exception handling line table problems introduced by r173593 r173593 made us a little too eager to associate all code at the end of a function with the user-written 'return' line. This caused problems with breakpoints as they'd be set in exception handling code preceeding the actual non-exception return handling code, leading to the breakpoint never being hit in non-exceptional execution. This change restores the pre-r173593 exception handling line information where the cleanup code is associated with the '}' not the return line. llvm-svn: 174206	2013-02-01 19:09:49 +00:00
John McCall	12cc42aa1b	Destroy arrays and ARC fields when throwing out of ctors. Previously we were only handling non-array fields of class type. Testcases derived from a patch by WenHan Gu. llvm-svn: 174146	2013-02-01 05:11:40 +00:00
Douglas Gregor	6153500517	When we're emitting a constructor or destructor call from a delegating constructor, retrieve our VTT parameter directly. Fixes PR14588 / <rdar://problem/12867962>. llvm-svn: 174042	2013-01-31 05:50:40 +00:00
Chad Rosier	ae229d599b	[ubsan] Implement the -fcatch-undefined-behavior flag using a trapping implementation; this is much more inline with the original implementation (i.e., pre-ubsan) and does not require run-time library support. The trapping implementation can be invoked using either '-fcatch-undefined-behavior' or '-fsanitize=undefined-trap -fsanitize-undefined-trap-on-error', with the latter being preferred. Eventually, the -fcatch-undefined-behavior' flag will be removed. llvm-svn: 173848	2013-01-29 23:31:22 +00:00
David Blaikie	0a21d0da17	PR14566: Debug Info: avoid top level lexical blocks in functions One of the gotchas (see changes to CodeGenFunction) was due to the fix in r139416 (for PR10829). This only worked previously because the top level lexical block would set the location to the end of the function, the debug location would be updated (as per r139416), the location would be set to the end of the function again (but that would no-op, since it was the same as the previous location), then the return instruction would be emitted using the debug location. Once the top level lexical block was no longer emitted, the end-of-function location change was causing the debug loc to be updated, regressing that bug. llvm-svn: 173593	2013-01-26 22:16:26 +00:00
Fariborz Jahanian	7865220da4	patch for PR9027 and // rdar://11861085 Title: [PR9027] volatile struct bug: member is not loaded at -O; This is caused by last flag passed to @llvm.memcpy being false, not honoring that aggregate has at least one 'volatile' data member (even though aggregate itself has not been qualified as 'volatile'. As a result, optimization optimizes away the memcpy altogether. Patch review by John MaCall (I still need to fix up a test though). llvm-svn: 173535	2013-01-25 23:57:05 +00:00
Will Dietz	f54319c891	[ubsan] Add support for -fsanitize-blacklist llvm-svn: 172808	2013-01-18 11:30:38 +00:00
Dmitri Gribenko	f857950d39	Remove useless 'llvm::' qualifier from names like StringRef and others that are brought into 'clang' namespace by clang/Basic/LLVM.h llvm-svn: 172323	2013-01-12 19:30:44 +00:00
Eli Friedman	33accdf602	Don't assert/crash on reference variables in lambdas bound to a static local variable from the parent scope. PR14773. llvm-svn: 171433	2013-01-03 00:39:26 +00:00
Chandler Carruth	3a02247dc9	Sort all of Clang's files under 'lib', and fix up the broken headers uncovered. This required manually correcting all of the incorrect main-module headers I could find, and running the new llvm/utils/sort_includes.py script over the files. I also manually added quite a few missing headers that were uncovered by shuffling the order or moving headers up to be main-module-headers. llvm-svn: 169237	2012-12-04 09:13:33 +00:00
Will Dietz	88e0233ff4	[ubsan] Add flag to enable recovery from checks when possible. llvm-svn: 169114	2012-12-02 19:50:33 +00:00
David Chisnall	9a837be2b9	Fix the Objective-C exception rethrow from cleanups (GNU runtimes). Note that a bug in the inliner still causes the wrong thing to happen at -O2 and above (PR14116). llvm-svn: 167534	2012-11-07 16:50:40 +00:00
Richard Smith	b1b0ab41e7	Use the individual -fsanitize=<...> arguments to control which of the UBSan checks to enable. Remove frontend support for -fcatch-undefined-behavior, -faddress-sanitizer and -fthread-sanitizer now that they don't do anything. llvm-svn: 167413	2012-11-05 22:21:05 +00:00
Richard Smith	de67068fc1	Split emission of -ftrapv checks and -fcatch-undefined-behavior checks into separate functions, since they share essentially no code. llvm-svn: 167259	2012-11-01 22:15:34 +00:00
Richard Smith	4d3110af06	-fcatch-undefined-behavior checking for appropriate vptr value: Clang CodeGen side. llvm-svn: 166661	2012-10-25 02:14:12 +00:00
John McCall	e68b8f4dcc	At -O0, prefer objc_storeStrong with a null new value to the combination of a load+objc_release; this is generally better for tools that try to track why values are retained and released. Also use objc_storeStrong when copying a block (again, only at -O0), which requires us to do a preliminary store of null in order to compensate for objc_storeStrong's assign semantics. llvm-svn: 166085	2012-10-17 02:28:37 +00:00
Alexey Samsonov	38e2496497	Transform pattern: if (CGM.getModuleDebugInfo()) DebugInfo = CGM.getModuleDebugInfo() into a call: maybeInitializeDebugInfo(); This is a simplification for a possible future fix of PR13942. llvm-svn: 166019	2012-10-16 07:22:28 +00:00
Nico Weber	cf4ff586e8	Add codegen support for __uuidof(). llvm-svn: 165710	2012-10-11 10:13:44 +00:00
Richard Smith	e30752c93b	-fcatch-undefined-behavior: emit calls to the runtime library whenever one of the checks fails. llvm-svn: 165536	2012-10-09 19:52:38 +00:00
Benjamin Kramer	1ca66919a5	CodeGen: Copy tail padding when we're not dealing with a trivial copy assign or move assign operator. This fixes a regression from r162254, the optimizer has problems reasoning about the smaller memcpy as it's often not safe to widen a store but making it smaller is. llvm-svn: 164917	2012-09-30 12:43:37 +00:00
Sylvestre Ledru	33b5baf189	Revert 'Fix a typo 'iff' => 'if''. iff is an abreviation of if and only if. See: http://en.wikipedia.org/wiki/If_and_only_if Commit 164766 llvm-svn: 164769	2012-09-27 10:16:10 +00:00
Sylvestre Ledru	a876013dc9	Fix a typo 'iff' => 'if' llvm-svn: 164766	2012-09-27 09:57:10 +00:00
Dmitri Gribenko	a664e5b88f	Use LLVM_DELETED_FUNCTION in place of 'DO NOT IMPLEMENT' comments. llvm-svn: 163983	2012-09-15 20:20:27 +00:00
Richard Smith	4d1458ed38	-fcatch-undefined-behavior: Factor emission of the creation of, and branch to, the trap BB out of the individual checks and into a common function, to prepare for making this code call into a runtime library. Rename the existing EmitCheck to EmitTypeCheck to clarify it and to move it out of the way of the new EmitCheck. llvm-svn: 163451	2012-09-08 02:08:36 +00:00
Chad Rosier	649dfc317d	[ms-inline asm] Have MSAsmStmts use the generic EmitAsmStmt codegen function. llvm-svn: 162796	2012-08-28 21:11:24 +00:00
Chad Rosier	6051bb94c0	[ms-inline asm] Rename EmitGCCAsmStmt to EmitAsmStmt and have it accept AsmStmts. This function is only used by GCCAsmStmts, however. Constraints need to be properly computed before MSAsmStmts can use EmitAsmStmt. No functional change intended. llvm-svn: 162776	2012-08-28 18:54:39 +00:00
Chad Rosier	de70e0ef45	[ms-inline asm] As part of a larger refactoring, rename AsmStmt to GCCAsmStmt. No functional change intended. llvm-svn: 162632	2012-08-25 00:11:56 +00:00
Richard Smith	69d0d2626a	New -fcatch-undefined-behavior features: * when checking that a pointer or reference refers to appropriate storage for a type, also check the alignment and perform a null check * check that references are bound to appropriate storage * check that 'this' has appropriate storage in member accesses and member function calls llvm-svn: 162523	2012-08-24 00:54:33 +00:00
Chad Rosier	59df25b659	[ms-inline asm] Remove an unused argument. This logic can now be reused by the ms-style inline asms. llvm-svn: 162463	2012-08-23 20:00:18 +00:00
Dmitri Gribenko	adba9be7c5	Fix a bunch of -Wdocumentation warnings. llvm-svn: 162452	2012-08-23 17:58:28 +00:00
Eli Friedman	a5dd5684dc	Use the alignment from lvalue emission to more accurately compute the alignment of a pointer for builtin emission, instead of just depending on the type of the pointee. <rdar://problem/11314941>. llvm-svn: 162425	2012-08-23 03:10:17 +00:00
Eli Friedman	f6d2184c83	Fix an assertion failure with a C++ constructor initializing a member of reference type in an anonymous struct. PR13154. llvm-svn: 161473	2012-08-08 03:51:37 +00:00
Richard Trieu	c320c745cc	Change APInt to APSInt in one instance. Also change a call to operator==() to APSInt::isSameValue() when comparing different sized APSInt's. llvm-svn: 160641	2012-07-23 20:21:35 +00:00
Simon Atanasyan	94a6d863a9	Revert commit r160308. We decide to move builtins selection to the backend. llvm-svn: 160353	2012-07-17 08:15:06 +00:00
Simon Atanasyan	a06d06b660	MIPS: Implement __builtin_mips_shll_qb builtin function overloading. This function has two versions. The first one is used for a register operand. The second one is used for an immediate number. llvm-svn: 160308	2012-07-16 18:52:02 +00:00
Eric Christopher	f8b9809fab	Temporarily revert this to see if it brings the gdb bot back. llvm-svn: 160049	2012-07-11 15:32:13 +00:00
Eric Christopher	2977378974	The end of a block doesn't necessarily need a line table entry unless there's something going on there. Remove the unconditional line entry and only add one if we're emitting cleanups (any other statements would be handled normally). Fixes rdar://9199234 llvm-svn: 160033	2012-07-11 01:49:26 +00:00
Tanya Lattner	bcffcdfd18	Patch by Anton Lokhmotov to add OpenCL work group size attributes. llvm-svn: 159965	2012-07-09 22:06:01 +00:00
John McCall	4e8ca4fa14	Significantly simplify CGExprAgg's logic about ignored results: if we want to ignore a result, the Dest will be null. Otherwise, we must copy into it. This means we need to ensure a slot when loading from a volatile l-value. With all that in place, fix a bug with chained assignments into __block variables of aggregate type where we were losing insight into the actual source of the value during the second assignment. llvm-svn: 159630	2012-07-02 23:58:38 +00:00
Benjamin Kramer	46a72fb741	Dead code eliminate the massive hexagon builtin intrinsic supporting code. The tablegen'd code does the same thing without this egregious duplication. In my limited testing everything seems to work, however there can be differences if the clang and llvm builtin definitions don't match. llvm-svn: 159371	2012-06-28 20:08:55 +00:00
Simon Atanasyan	07ce7d8fb5	Support MIPS DSP Rev1 intrinsics. This patch was reviewed in the llvm-commits list by Jim Grosbach. llvm-svn: 159366	2012-06-28 18:23:16 +00:00
Eli Friedman	c24e2fb1fb	Propagate lvalue alignment into bitfields. Per report on cfe-dev. llvm-svn: 159295	2012-06-27 21:19:48 +00:00
Fariborz Jahanian	6362803cfe	block literal irgen: several improvements on naming block literal helper functions. All helper functions (global and locals) use block_invoke as their prefix. Local literal helper names are prefixed by their enclosing mangled function names. Blocks in non-local initializers (e.g. a global variable or a C++11 field) are prefixed by their mangled variable name. The descriminator number added to end of the name starts off with blank (for first block) and _<N> (for the N+2-th block). llvm-svn: 159206	2012-06-26 16:06:38 +00:00
Chad Rosier	32503020a4	Etch out the code path for MS-style inline assembly. llvm-svn: 158325	2012-06-11 20:47:18 +00:00
Fariborz Jahanian	b5dd2cb13c	objective-c: fix a sema and IRGen crash when property getter result type is safe but does not match with property type resulting in spurious warning followed by crash in IRGen. // rdar://11515196 llvm-svn: 157641	2012-05-29 19:56:01 +00:00
Richard Smith	bb653bd5f9	Implement IRGen for C++11's "T{1, 2, 3}", where T is an aggregate and the expression is treated as an lvalue. llvm-svn: 156781	2012-05-14 21:57:21 +00:00
Nuno Lopes	3d6311d5f7	add -fbounds-checking option. When enabled, clang generates bounds checks for array and pointers dereferences. Work to follow in LLVM's backend. OK'ed by Chad; thanks for the review. llvm-svn: 156431	2012-05-08 22:10:46 +00:00

... 4 5 6 7 8 ...

1196 Commits