1 /* Driver of optimization process
2 Copyright (C) 2003-2021 Free Software Foundation, Inc.
3 Contributed by Jan Hubicka
4
5 This file is part of GCC.
6
7 GCC is free software; you can redistribute it and/or modify it under
8 the terms of the GNU General Public License as published by the Free
9 Software Foundation; either version 3, or (at your option) any later
10 version.
11
12 GCC is distributed in the hope that it will be useful, but WITHOUT ANY
13 WARRANTY; without even the implied warranty of MERCHANTABILITY or
14 FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License
15 for more details.
16
17 You should have received a copy of the GNU General Public License
18 along with GCC; see the file COPYING3. If not see
19 <http://www.gnu.org/licenses/>. */
20
21 /* This module implements main driver of compilation process.
22
23 The main scope of this file is to act as an interface in between
24 tree based frontends and the backend.
25
26 The front-end is supposed to use following functionality:
27
28 - finalize_function
29
30 This function is called once front-end has parsed whole body of function
31 and it is certain that the function body nor the declaration will change.
32
33 (There is one exception needed for implementing GCC extern inline
34 function.)
35
36 - varpool_finalize_decl
37
38 This function has same behavior as the above but is used for static
39 variables.
40
41 - add_asm_node
42
43 Insert new toplevel ASM statement
44
45 - finalize_compilation_unit
46
47 This function is called once (source level) compilation unit is finalized
48 and it will no longer change.
49
50 The symbol table is constructed starting from the trivially needed
51 symbols finalized by the frontend. Functions are lowered into
52 GIMPLE representation and callgraph/reference lists are constructed.
53 Those are used to discover other necessary functions and variables.
54
55 At the end the bodies of unreachable functions are removed.
56
57 The function can be called multiple times when multiple source level
58 compilation units are combined.
59
60 - compile
61
62 This passes control to the back-end. Optimizations are performed and
63 final assembler is generated. This is done in the following way. Note
64 that with link time optimization the process is split into three
65 stages (compile time, linktime analysis and parallel linktime as
66 indicated bellow).
67
68 Compile time:
69
70 1) Inter-procedural optimization.
71 (ipa_passes)
72
73 This part is further split into:
74
75 a) early optimizations. These are local passes executed in
76 the topological order on the callgraph.
77
78 The purpose of early optimizations is to optimize away simple
79 things that may otherwise confuse IP analysis. Very simple
80 propagation across the callgraph is done i.e. to discover
81 functions without side effects and simple inlining is performed.
82
83 b) early small interprocedural passes.
84
85 Those are interprocedural passes executed only at compilation
86 time. These include, for example, transactional memory lowering,
87 unreachable code removal and other simple transformations.
88
89 c) IP analysis stage. All interprocedural passes do their
90 analysis.
91
92 Interprocedural passes differ from small interprocedural
93 passes by their ability to operate across whole program
94 at linktime. Their analysis stage is performed early to
95 both reduce linking times and linktime memory usage by
96 not having to represent whole program in memory.
97
98 d) LTO streaming. When doing LTO, everything important gets
99 streamed into the object file.
100
101 Compile time and or linktime analysis stage (WPA):
102
103 At linktime units gets streamed back and symbol table is
104 merged. Function bodies are not streamed in and not
105 available.
106 e) IP propagation stage. All IP passes execute their
107 IP propagation. This is done based on the earlier analysis
108 without having function bodies at hand.
109 f) Ltrans streaming. When doing WHOPR LTO, the program
110 is partitioned and streamed into multiple object files.
111
112 Compile time and/or parallel linktime stage (ltrans)
113
114 Each of the object files is streamed back and compiled
115 separately. Now the function bodies becomes available
116 again.
117
118 2) Virtual clone materialization
119 (cgraph_materialize_clone)
120
121 IP passes can produce copies of existing functions (such
122 as versioned clones or inline clones) without actually
123 manipulating their bodies by creating virtual clones in
124 the callgraph. At this time the virtual clones are
125 turned into real functions
126 3) IP transformation
127
128 All IP passes transform function bodies based on earlier
129 decision of the IP propagation.
130
131 4) late small IP passes
132
133 Simple IP passes working within single program partition.
134
135 5) Expansion
136 (expand_all_functions)
137
138 At this stage functions that needs to be output into
139 assembler are identified and compiled in topological order
140 6) Output of variables and aliases
141 Now it is known what variable references was not optimized
142 out and thus all variables are output to the file.
143
144 Note that with -fno-toplevel-reorder passes 5 and 6
145 are combined together in cgraph_output_in_order.
146
147 Finally there are functions to manipulate the callgraph from
148 backend.
149 - cgraph_add_new_function is used to add backend produced
150 functions introduced after the unit is finalized.
151 The functions are enqueue for later processing and inserted
152 into callgraph with cgraph_process_new_functions.
153
154 - cgraph_function_versioning
155
156 produces a copy of function into new one (a version)
157 and apply simple transformations
158 */
159
160 #include "config.h"
161 #include "system.h"
162 #include "coretypes.h"
163 #include "backend.h"
164 #include "target.h"
165 #include "rtl.h"
166 #include "tree.h"
167 #include "gimple.h"
168 #include "cfghooks.h"
169 #include "regset.h" /* FIXME: For reg_obstack. */
170 #include "alloc-pool.h"
171 #include "tree-pass.h"
172 #include "stringpool.h"
173 #include "gimple-ssa.h"
174 #include "cgraph.h"
175 #include "coverage.h"
176 #include "lto-streamer.h"
177 #include "fold-const.h"
178 #include "varasm.h"
179 #include "stor-layout.h"
180 #include "output.h"
181 #include "cfgcleanup.h"
182 #include "gimple-fold.h"
183 #include "gimplify.h"
184 #include "gimple-iterator.h"
185 #include "gimplify-me.h"
186 #include "tree-cfg.h"
187 #include "tree-into-ssa.h"
188 #include "tree-ssa.h"
189 #include "langhooks.h"
190 #include "toplev.h"
191 #include "debug.h"
192 #include "symbol-summary.h"
193 #include "tree-vrp.h"
194 #include "ipa-prop.h"
195 #include "gimple-pretty-print.h"
196 #include "plugin.h"
197 #include "ipa-fnsummary.h"
198 #include "ipa-utils.h"
199 #include "except.h"
200 #include "cfgloop.h"
201 #include "context.h"
202 #include "pass_manager.h"
203 #include "tree-nested.h"
204 #include "dbgcnt.h"
205 #include "lto-section-names.h"
206 #include "stringpool.h"
207 #include "attribs.h"
208 #include "ipa-inline.h"
209 #include "omp-offload.h"
210 #include "symtab-thunks.h"
211
212 /* Queue of cgraph nodes scheduled to be added into cgraph. This is a
213 secondary queue used during optimization to accommodate passes that
214 may generate new functions that need to be optimized and expanded. */
215 vec<cgraph_node *> cgraph_new_nodes;
216
217 static void expand_all_functions (void);
218 static void mark_functions_to_output (void);
219 static void handle_alias_pairs (void);
220
221 /* Return true if this symbol is a function from the C frontend specified
222 directly in RTL form (with "__RTL"). */
223
224 bool
native_rtl_p()225 symtab_node::native_rtl_p () const
226 {
227 if (TREE_CODE (decl) != FUNCTION_DECL)
228 return false;
229 if (!DECL_STRUCT_FUNCTION (decl))
230 return false;
231 return DECL_STRUCT_FUNCTION (decl)->curr_properties & PROP_rtl;
232 }
233
234 /* Determine if symbol declaration is needed. That is, visible to something
235 either outside this translation unit, something magic in the system
236 configury */
237 bool
needed_p(void)238 symtab_node::needed_p (void)
239 {
240 /* Double check that no one output the function into assembly file
241 early. */
242 if (!native_rtl_p ())
243 gcc_checking_assert
244 (!DECL_ASSEMBLER_NAME_SET_P (decl)
245 || !TREE_SYMBOL_REFERENCED (DECL_ASSEMBLER_NAME (decl)));
246
247 if (!definition)
248 return false;
249
250 if (DECL_EXTERNAL (decl))
251 return false;
252
253 /* If the user told us it is used, then it must be so. */
254 if (force_output)
255 return true;
256
257 /* ABI forced symbols are needed when they are external. */
258 if (forced_by_abi && TREE_PUBLIC (decl))
259 return true;
260
261 /* Keep constructors, destructors and virtual functions. */
262 if (TREE_CODE (decl) == FUNCTION_DECL
263 && (DECL_STATIC_CONSTRUCTOR (decl) || DECL_STATIC_DESTRUCTOR (decl)))
264 return true;
265
266 /* Externally visible variables must be output. The exception is
267 COMDAT variables that must be output only when they are needed. */
268 if (TREE_PUBLIC (decl) && !DECL_COMDAT (decl))
269 return true;
270
271 return false;
272 }
273
274 /* Head and terminator of the queue of nodes to be processed while building
275 callgraph. */
276
277 static symtab_node symtab_terminator (SYMTAB_SYMBOL);
278 static symtab_node *queued_nodes = &symtab_terminator;
279
280 /* Add NODE to queue starting at QUEUED_NODES.
281 The queue is linked via AUX pointers and terminated by pointer to 1. */
282
283 static void
enqueue_node(symtab_node * node)284 enqueue_node (symtab_node *node)
285 {
286 if (node->aux)
287 return;
288 gcc_checking_assert (queued_nodes);
289 node->aux = queued_nodes;
290 queued_nodes = node;
291 }
292
293 /* Process CGRAPH_NEW_FUNCTIONS and perform actions necessary to add these
294 functions into callgraph in a way so they look like ordinary reachable
295 functions inserted into callgraph already at construction time. */
296
297 void
process_new_functions(void)298 symbol_table::process_new_functions (void)
299 {
300 tree fndecl;
301
302 if (!cgraph_new_nodes.exists ())
303 return;
304
305 handle_alias_pairs ();
306 /* Note that this queue may grow as its being processed, as the new
307 functions may generate new ones. */
308 for (unsigned i = 0; i < cgraph_new_nodes.length (); i++)
309 {
310 cgraph_node *node = cgraph_new_nodes[i];
311 fndecl = node->decl;
312 switch (state)
313 {
314 case CONSTRUCTION:
315 /* At construction time we just need to finalize function and move
316 it into reachable functions list. */
317
318 cgraph_node::finalize_function (fndecl, false);
319 call_cgraph_insertion_hooks (node);
320 enqueue_node (node);
321 break;
322
323 case IPA:
324 case IPA_SSA:
325 case IPA_SSA_AFTER_INLINING:
326 /* When IPA optimization already started, do all essential
327 transformations that has been already performed on the whole
328 cgraph but not on this function. */
329
330 gimple_register_cfg_hooks ();
331 if (!node->analyzed)
332 node->analyze ();
333 push_cfun (DECL_STRUCT_FUNCTION (fndecl));
334 if ((state == IPA_SSA || state == IPA_SSA_AFTER_INLINING)
335 && !gimple_in_ssa_p (DECL_STRUCT_FUNCTION (fndecl)))
336 {
337 bool summaried_computed = ipa_fn_summaries != NULL;
338 g->get_passes ()->execute_early_local_passes ();
339 /* Early passes compute inline parameters to do inlining
340 and splitting. This is redundant for functions added late.
341 Just throw away whatever it did. */
342 if (!summaried_computed)
343 {
344 ipa_free_fn_summary ();
345 ipa_free_size_summary ();
346 }
347 }
348 else if (ipa_fn_summaries != NULL)
349 compute_fn_summary (node, true);
350 free_dominance_info (CDI_POST_DOMINATORS);
351 free_dominance_info (CDI_DOMINATORS);
352 pop_cfun ();
353 call_cgraph_insertion_hooks (node);
354 break;
355
356 case EXPANSION:
357 /* Functions created during expansion shall be compiled
358 directly. */
359 node->process = 0;
360 call_cgraph_insertion_hooks (node);
361 node->expand ();
362 break;
363
364 default:
365 gcc_unreachable ();
366 break;
367 }
368 }
369
370 cgraph_new_nodes.release ();
371 }
372
373 /* As an GCC extension we allow redefinition of the function. The
374 semantics when both copies of bodies differ is not well defined.
375 We replace the old body with new body so in unit at a time mode
376 we always use new body, while in normal mode we may end up with
377 old body inlined into some functions and new body expanded and
378 inlined in others.
379
380 ??? It may make more sense to use one body for inlining and other
381 body for expanding the function but this is difficult to do. */
382
383 void
reset(void)384 cgraph_node::reset (void)
385 {
386 /* If process is set, then we have already begun whole-unit analysis.
387 This is *not* testing for whether we've already emitted the function.
388 That case can be sort-of legitimately seen with real function redefinition
389 errors. I would argue that the front end should never present us with
390 such a case, but don't enforce that for now. */
391 gcc_assert (!process);
392
393 /* Reset our data structures so we can analyze the function again. */
394 inlined_to = NULL;
395 memset (&rtl, 0, sizeof (rtl));
396 analyzed = false;
397 definition = false;
398 alias = false;
399 transparent_alias = false;
400 weakref = false;
401 cpp_implicit_alias = false;
402
403 remove_callees ();
404 remove_all_references ();
405 }
406
407 /* Return true when there are references to the node. INCLUDE_SELF is
408 true if a self reference counts as a reference. */
409
410 bool
referred_to_p(bool include_self)411 symtab_node::referred_to_p (bool include_self)
412 {
413 ipa_ref *ref = NULL;
414
415 /* See if there are any references at all. */
416 if (iterate_referring (0, ref))
417 return true;
418 /* For functions check also calls. */
419 cgraph_node *cn = dyn_cast <cgraph_node *> (this);
420 if (cn && cn->callers)
421 {
422 if (include_self)
423 return true;
424 for (cgraph_edge *e = cn->callers; e; e = e->next_caller)
425 if (e->caller != this)
426 return true;
427 }
428 return false;
429 }
430
431 /* DECL has been parsed. Take it, queue it, compile it at the whim of the
432 logic in effect. If NO_COLLECT is true, then our caller cannot stand to have
433 the garbage collector run at the moment. We would need to either create
434 a new GC context, or just not compile right now. */
435
436 void
finalize_function(tree decl,bool no_collect)437 cgraph_node::finalize_function (tree decl, bool no_collect)
438 {
439 cgraph_node *node = cgraph_node::get_create (decl);
440
441 if (node->definition)
442 {
443 /* Nested functions should only be defined once. */
444 gcc_assert (!DECL_CONTEXT (decl)
445 || TREE_CODE (DECL_CONTEXT (decl)) != FUNCTION_DECL);
446 node->reset ();
447 node->redefined_extern_inline = true;
448 }
449
450 /* Set definition first before calling notice_global_symbol so that
451 it is available to notice_global_symbol. */
452 node->definition = true;
453 notice_global_symbol (decl);
454 node->lowered = DECL_STRUCT_FUNCTION (decl)->cfg != NULL;
455 node->semantic_interposition = opt_for_fn (decl, flag_semantic_interposition);
456 if (!flag_toplevel_reorder)
457 node->no_reorder = true;
458
459 /* With -fkeep-inline-functions we are keeping all inline functions except
460 for extern inline ones. */
461 if (flag_keep_inline_functions
462 && DECL_DECLARED_INLINE_P (decl)
463 && !DECL_EXTERNAL (decl)
464 && !DECL_DISREGARD_INLINE_LIMITS (decl))
465 node->force_output = 1;
466
467 /* __RTL functions were already output as soon as they were parsed (due
468 to the large amount of global state in the backend).
469 Mark such functions as "force_output" to reflect the fact that they
470 will be in the asm file when considering the symbols they reference.
471 The attempt to output them later on will bail out immediately. */
472 if (node->native_rtl_p ())
473 node->force_output = 1;
474
475 /* When not optimizing, also output the static functions. (see
476 PR24561), but don't do so for always_inline functions, functions
477 declared inline and nested functions. These were optimized out
478 in the original implementation and it is unclear whether we want
479 to change the behavior here. */
480 if (((!opt_for_fn (decl, optimize) || flag_keep_static_functions
481 || node->no_reorder)
482 && !node->cpp_implicit_alias
483 && !DECL_DISREGARD_INLINE_LIMITS (decl)
484 && !DECL_DECLARED_INLINE_P (decl)
485 && !(DECL_CONTEXT (decl)
486 && TREE_CODE (DECL_CONTEXT (decl)) == FUNCTION_DECL))
487 && !DECL_COMDAT (decl) && !DECL_EXTERNAL (decl))
488 node->force_output = 1;
489
490 /* If we've not yet emitted decl, tell the debug info about it. */
491 if (!TREE_ASM_WRITTEN (decl))
492 (*debug_hooks->deferred_inline_function) (decl);
493
494 if (!no_collect)
495 ggc_collect ();
496
497 if (symtab->state == CONSTRUCTION
498 && (node->needed_p () || node->referred_to_p ()))
499 enqueue_node (node);
500 }
501
502 /* Add the function FNDECL to the call graph.
503 Unlike finalize_function, this function is intended to be used
504 by middle end and allows insertion of new function at arbitrary point
505 of compilation. The function can be either in high, low or SSA form
506 GIMPLE.
507
508 The function is assumed to be reachable and have address taken (so no
509 API breaking optimizations are performed on it).
510
511 Main work done by this function is to enqueue the function for later
512 processing to avoid need the passes to be re-entrant. */
513
514 void
add_new_function(tree fndecl,bool lowered)515 cgraph_node::add_new_function (tree fndecl, bool lowered)
516 {
517 gcc::pass_manager *passes = g->get_passes ();
518 cgraph_node *node;
519
520 if (dump_file)
521 {
522 struct function *fn = DECL_STRUCT_FUNCTION (fndecl);
523 const char *function_type = ((gimple_has_body_p (fndecl))
524 ? (lowered
525 ? (gimple_in_ssa_p (fn)
526 ? "ssa gimple"
527 : "low gimple")
528 : "high gimple")
529 : "to-be-gimplified");
530 fprintf (dump_file,
531 "Added new %s function %s to callgraph\n",
532 function_type,
533 fndecl_name (fndecl));
534 }
535
536 switch (symtab->state)
537 {
538 case PARSING:
539 cgraph_node::finalize_function (fndecl, false);
540 break;
541 case CONSTRUCTION:
542 /* Just enqueue function to be processed at nearest occurrence. */
543 node = cgraph_node::get_create (fndecl);
544 if (lowered)
545 node->lowered = true;
546 cgraph_new_nodes.safe_push (node);
547 break;
548
549 case IPA:
550 case IPA_SSA:
551 case IPA_SSA_AFTER_INLINING:
552 case EXPANSION:
553 /* Bring the function into finalized state and enqueue for later
554 analyzing and compilation. */
555 node = cgraph_node::get_create (fndecl);
556 node->local = false;
557 node->definition = true;
558 node->semantic_interposition = opt_for_fn (fndecl,
559 flag_semantic_interposition);
560 node->force_output = true;
561 if (TREE_PUBLIC (fndecl))
562 node->externally_visible = true;
563 if (!lowered && symtab->state == EXPANSION)
564 {
565 push_cfun (DECL_STRUCT_FUNCTION (fndecl));
566 gimple_register_cfg_hooks ();
567 bitmap_obstack_initialize (NULL);
568 execute_pass_list (cfun, passes->all_lowering_passes);
569 passes->execute_early_local_passes ();
570 bitmap_obstack_release (NULL);
571 pop_cfun ();
572
573 lowered = true;
574 }
575 if (lowered)
576 node->lowered = true;
577 cgraph_new_nodes.safe_push (node);
578 break;
579
580 case FINISHED:
581 /* At the very end of compilation we have to do all the work up
582 to expansion. */
583 node = cgraph_node::create (fndecl);
584 if (lowered)
585 node->lowered = true;
586 node->definition = true;
587 node->semantic_interposition = opt_for_fn (fndecl,
588 flag_semantic_interposition);
589 node->analyze ();
590 push_cfun (DECL_STRUCT_FUNCTION (fndecl));
591 gimple_register_cfg_hooks ();
592 bitmap_obstack_initialize (NULL);
593 if (!gimple_in_ssa_p (DECL_STRUCT_FUNCTION (fndecl)))
594 g->get_passes ()->execute_early_local_passes ();
595 bitmap_obstack_release (NULL);
596 pop_cfun ();
597 node->expand ();
598 break;
599
600 default:
601 gcc_unreachable ();
602 }
603
604 /* Set a personality if required and we already passed EH lowering. */
605 if (lowered
606 && (function_needs_eh_personality (DECL_STRUCT_FUNCTION (fndecl))
607 == eh_personality_lang))
608 DECL_FUNCTION_PERSONALITY (fndecl) = lang_hooks.eh_personality ();
609 }
610
611 /* Analyze the function scheduled to be output. */
612 void
analyze(void)613 cgraph_node::analyze (void)
614 {
615 if (native_rtl_p ())
616 {
617 analyzed = true;
618 return;
619 }
620
621 tree decl = this->decl;
622 location_t saved_loc = input_location;
623 input_location = DECL_SOURCE_LOCATION (decl);
624
625 if (thunk)
626 {
627 thunk_info *info = thunk_info::get (this);
628 cgraph_node *t = cgraph_node::get (info->alias);
629
630 create_edge (t, NULL, t->count);
631 callees->can_throw_external = !TREE_NOTHROW (t->decl);
632 /* Target code in expand_thunk may need the thunk's target
633 to be analyzed, so recurse here. */
634 if (!t->analyzed && t->definition)
635 t->analyze ();
636 if (t->alias)
637 {
638 t = t->get_alias_target ();
639 if (!t->analyzed && t->definition)
640 t->analyze ();
641 }
642 bool ret = expand_thunk (this, false, false);
643 thunk_info::get (this)->alias = NULL;
644 if (!ret)
645 return;
646 }
647 if (alias)
648 resolve_alias (cgraph_node::get (alias_target), transparent_alias);
649 else if (dispatcher_function)
650 {
651 /* Generate the dispatcher body of multi-versioned functions. */
652 cgraph_function_version_info *dispatcher_version_info
653 = function_version ();
654 if (dispatcher_version_info != NULL
655 && (dispatcher_version_info->dispatcher_resolver
656 == NULL_TREE))
657 {
658 tree resolver = NULL_TREE;
659 gcc_assert (targetm.generate_version_dispatcher_body);
660 resolver = targetm.generate_version_dispatcher_body (this);
661 gcc_assert (resolver != NULL_TREE);
662 }
663 }
664 else
665 {
666 push_cfun (DECL_STRUCT_FUNCTION (decl));
667
668 assign_assembler_name_if_needed (decl);
669
670 /* Make sure to gimplify bodies only once. During analyzing a
671 function we lower it, which will require gimplified nested
672 functions, so we can end up here with an already gimplified
673 body. */
674 if (!gimple_has_body_p (decl))
675 gimplify_function_tree (decl);
676
677 /* Lower the function. */
678 if (!lowered)
679 {
680 if (first_nested_function (this))
681 lower_nested_functions (decl);
682
683 gimple_register_cfg_hooks ();
684 bitmap_obstack_initialize (NULL);
685 execute_pass_list (cfun, g->get_passes ()->all_lowering_passes);
686 compact_blocks ();
687 bitmap_obstack_release (NULL);
688 lowered = true;
689 }
690
691 pop_cfun ();
692 }
693 analyzed = true;
694
695 input_location = saved_loc;
696 }
697
698 /* C++ frontend produce same body aliases all over the place, even before PCH
699 gets streamed out. It relies on us linking the aliases with their function
700 in order to do the fixups, but ipa-ref is not PCH safe. Consequently we
701 first produce aliases without links, but once C++ FE is sure he won't stream
702 PCH we build the links via this function. */
703
704 void
process_same_body_aliases(void)705 symbol_table::process_same_body_aliases (void)
706 {
707 symtab_node *node;
708 FOR_EACH_SYMBOL (node)
709 if (node->cpp_implicit_alias && !node->analyzed)
710 node->resolve_alias
711 (VAR_P (node->alias_target)
712 ? (symtab_node *)varpool_node::get_create (node->alias_target)
713 : (symtab_node *)cgraph_node::get_create (node->alias_target));
714 cpp_implicit_aliases_done = true;
715 }
716
717 /* Process a symver attribute. */
718
719 static void
process_symver_attribute(symtab_node * n)720 process_symver_attribute (symtab_node *n)
721 {
722 tree value = lookup_attribute ("symver", DECL_ATTRIBUTES (n->decl));
723
724 for (; value != NULL; value = TREE_CHAIN (value))
725 {
726 /* Starting from bintuils 2.35 gas supports:
727 # Assign foo to bar@V1 and baz@V2.
728 .symver foo, bar@V1
729 .symver foo, baz@V2
730 */
731 const char *purpose = IDENTIFIER_POINTER (TREE_PURPOSE (value));
732 if (strcmp (purpose, "symver") != 0)
733 continue;
734
735 tree symver = get_identifier_with_length
736 (TREE_STRING_POINTER (TREE_VALUE (TREE_VALUE (value))),
737 TREE_STRING_LENGTH (TREE_VALUE (TREE_VALUE (value))));
738 symtab_node *def = symtab_node::get_for_asmname (symver);
739
740 if (def)
741 {
742 error_at (DECL_SOURCE_LOCATION (n->decl),
743 "duplicate definition of a symbol version");
744 inform (DECL_SOURCE_LOCATION (def->decl),
745 "same version was previously defined here");
746 return;
747 }
748 if (!n->definition)
749 {
750 error_at (DECL_SOURCE_LOCATION (n->decl),
751 "symbol needs to be defined to have a version");
752 return;
753 }
754 if (DECL_COMMON (n->decl))
755 {
756 error_at (DECL_SOURCE_LOCATION (n->decl),
757 "common symbol cannot be versioned");
758 return;
759 }
760 if (DECL_COMDAT (n->decl))
761 {
762 error_at (DECL_SOURCE_LOCATION (n->decl),
763 "comdat symbol cannot be versioned");
764 return;
765 }
766 if (n->weakref)
767 {
768 error_at (DECL_SOURCE_LOCATION (n->decl),
769 "%<weakref%> cannot be versioned");
770 return;
771 }
772 if (!TREE_PUBLIC (n->decl))
773 {
774 error_at (DECL_SOURCE_LOCATION (n->decl),
775 "versioned symbol must be public");
776 return;
777 }
778 if (DECL_VISIBILITY (n->decl) != VISIBILITY_DEFAULT)
779 {
780 error_at (DECL_SOURCE_LOCATION (n->decl),
781 "versioned symbol must have default visibility");
782 return;
783 }
784
785 /* Create new symbol table entry representing the version. */
786 tree new_decl = copy_node (n->decl);
787
788 DECL_INITIAL (new_decl) = NULL_TREE;
789 if (TREE_CODE (new_decl) == FUNCTION_DECL)
790 DECL_STRUCT_FUNCTION (new_decl) = NULL;
791 SET_DECL_ASSEMBLER_NAME (new_decl, symver);
792 TREE_PUBLIC (new_decl) = 1;
793 DECL_ATTRIBUTES (new_decl) = NULL;
794
795 symtab_node *symver_node = symtab_node::get_create (new_decl);
796 symver_node->alias = true;
797 symver_node->definition = true;
798 symver_node->symver = true;
799 symver_node->create_reference (n, IPA_REF_ALIAS, NULL);
800 symver_node->analyzed = true;
801 }
802 }
803
804 /* Process attributes common for vars and functions. */
805
806 static void
process_common_attributes(symtab_node * node,tree decl)807 process_common_attributes (symtab_node *node, tree decl)
808 {
809 tree weakref = lookup_attribute ("weakref", DECL_ATTRIBUTES (decl));
810
811 if (weakref && !lookup_attribute ("alias", DECL_ATTRIBUTES (decl)))
812 {
813 warning_at (DECL_SOURCE_LOCATION (decl), OPT_Wattributes,
814 "%<weakref%> attribute should be accompanied with"
815 " an %<alias%> attribute");
816 DECL_WEAK (decl) = 0;
817 DECL_ATTRIBUTES (decl) = remove_attribute ("weakref",
818 DECL_ATTRIBUTES (decl));
819 }
820
821 if (lookup_attribute ("no_reorder", DECL_ATTRIBUTES (decl)))
822 node->no_reorder = 1;
823 process_symver_attribute (node);
824 }
825
826 /* Look for externally_visible and used attributes and mark cgraph nodes
827 accordingly.
828
829 We cannot mark the nodes at the point the attributes are processed (in
830 handle_*_attribute) because the copy of the declarations available at that
831 point may not be canonical. For example, in:
832
833 void f();
834 void f() __attribute__((used));
835
836 the declaration we see in handle_used_attribute will be the second
837 declaration -- but the front end will subsequently merge that declaration
838 with the original declaration and discard the second declaration.
839
840 Furthermore, we can't mark these nodes in finalize_function because:
841
842 void f() {}
843 void f() __attribute__((externally_visible));
844
845 is valid.
846
847 So, we walk the nodes at the end of the translation unit, applying the
848 attributes at that point. */
849
850 static void
process_function_and_variable_attributes(cgraph_node * first,varpool_node * first_var)851 process_function_and_variable_attributes (cgraph_node *first,
852 varpool_node *first_var)
853 {
854 cgraph_node *node;
855 varpool_node *vnode;
856
857 for (node = symtab->first_function (); node != first;
858 node = symtab->next_function (node))
859 {
860 tree decl = node->decl;
861
862 if (node->alias
863 && lookup_attribute ("flatten", DECL_ATTRIBUTES (decl)))
864 {
865 tree tdecl = node->get_alias_target_tree ();
866 if (!tdecl || !DECL_P (tdecl)
867 || !lookup_attribute ("flatten", DECL_ATTRIBUTES (tdecl)))
868 warning_at (DECL_SOURCE_LOCATION (decl), OPT_Wattributes,
869 "%<flatten%> attribute is ignored on aliases");
870 }
871 if (DECL_PRESERVE_P (decl))
872 node->mark_force_output ();
873 else if (lookup_attribute ("externally_visible", DECL_ATTRIBUTES (decl)))
874 {
875 if (! TREE_PUBLIC (node->decl))
876 warning_at (DECL_SOURCE_LOCATION (node->decl), OPT_Wattributes,
877 "%<externally_visible%>"
878 " attribute have effect only on public objects");
879 }
880 if (lookup_attribute ("weakref", DECL_ATTRIBUTES (decl))
881 && node->definition
882 && (!node->alias || DECL_INITIAL (decl) != error_mark_node))
883 {
884 /* NODE->DEFINITION && NODE->ALIAS is nonzero for valid weakref
885 function declarations; DECL_INITIAL is non-null for invalid
886 weakref functions that are also defined. */
887 warning_at (DECL_SOURCE_LOCATION (decl), OPT_Wattributes,
888 "%<weakref%> attribute ignored"
889 " because function is defined");
890 DECL_WEAK (decl) = 0;
891 DECL_ATTRIBUTES (decl) = remove_attribute ("weakref",
892 DECL_ATTRIBUTES (decl));
893 DECL_ATTRIBUTES (decl) = remove_attribute ("alias",
894 DECL_ATTRIBUTES (decl));
895 node->alias = false;
896 node->weakref = false;
897 node->transparent_alias = false;
898 }
899 else if (lookup_attribute ("alias", DECL_ATTRIBUTES (decl))
900 && node->definition
901 && !node->alias)
902 warning_at (DECL_SOURCE_LOCATION (node->decl), OPT_Wattributes,
903 "%<alias%> attribute ignored"
904 " because function is defined");
905
906 if (lookup_attribute ("always_inline", DECL_ATTRIBUTES (decl))
907 && !DECL_DECLARED_INLINE_P (decl)
908 /* redefining extern inline function makes it DECL_UNINLINABLE. */
909 && !DECL_UNINLINABLE (decl))
910 warning_at (DECL_SOURCE_LOCATION (decl), OPT_Wattributes,
911 "%<always_inline%> function might not be inlinable");
912
913 process_common_attributes (node, decl);
914 }
915 for (vnode = symtab->first_variable (); vnode != first_var;
916 vnode = symtab->next_variable (vnode))
917 {
918 tree decl = vnode->decl;
919 if (DECL_EXTERNAL (decl)
920 && DECL_INITIAL (decl))
921 varpool_node::finalize_decl (decl);
922 if (DECL_PRESERVE_P (decl))
923 vnode->force_output = true;
924 else if (lookup_attribute ("externally_visible", DECL_ATTRIBUTES (decl)))
925 {
926 if (! TREE_PUBLIC (vnode->decl))
927 warning_at (DECL_SOURCE_LOCATION (vnode->decl), OPT_Wattributes,
928 "%<externally_visible%>"
929 " attribute have effect only on public objects");
930 }
931 if (lookup_attribute ("weakref", DECL_ATTRIBUTES (decl))
932 && vnode->definition
933 && DECL_INITIAL (decl))
934 {
935 warning_at (DECL_SOURCE_LOCATION (vnode->decl), OPT_Wattributes,
936 "%<weakref%> attribute ignored"
937 " because variable is initialized");
938 DECL_WEAK (decl) = 0;
939 DECL_ATTRIBUTES (decl) = remove_attribute ("weakref",
940 DECL_ATTRIBUTES (decl));
941 }
942 process_common_attributes (vnode, decl);
943 }
944 }
945
946 /* Mark DECL as finalized. By finalizing the declaration, frontend instruct the
947 middle end to output the variable to asm file, if needed or externally
948 visible. */
949
950 void
finalize_decl(tree decl)951 varpool_node::finalize_decl (tree decl)
952 {
953 varpool_node *node = varpool_node::get_create (decl);
954
955 gcc_assert (TREE_STATIC (decl) || DECL_EXTERNAL (decl));
956
957 if (node->definition)
958 return;
959 /* Set definition first before calling notice_global_symbol so that
960 it is available to notice_global_symbol. */
961 node->definition = true;
962 node->semantic_interposition = flag_semantic_interposition;
963 notice_global_symbol (decl);
964 if (!flag_toplevel_reorder)
965 node->no_reorder = true;
966 if (TREE_THIS_VOLATILE (decl) || DECL_PRESERVE_P (decl)
967 /* Traditionally we do not eliminate static variables when not
968 optimizing and when not doing toplevel reorder. */
969 || (node->no_reorder && !DECL_COMDAT (node->decl)
970 && !DECL_ARTIFICIAL (node->decl)))
971 node->force_output = true;
972
973 if (symtab->state == CONSTRUCTION
974 && (node->needed_p () || node->referred_to_p ()))
975 enqueue_node (node);
976 if (symtab->state >= IPA_SSA)
977 node->analyze ();
978 /* Some frontends produce various interface variables after compilation
979 finished. */
980 if (symtab->state == FINISHED
981 || (node->no_reorder
982 && symtab->state == EXPANSION))
983 node->assemble_decl ();
984 }
985
986 /* EDGE is an polymorphic call. Mark all possible targets as reachable
987 and if there is only one target, perform trivial devirtualization.
988 REACHABLE_CALL_TARGETS collects target lists we already walked to
989 avoid duplicate work. */
990
991 static void
walk_polymorphic_call_targets(hash_set<void * > * reachable_call_targets,cgraph_edge * edge)992 walk_polymorphic_call_targets (hash_set<void *> *reachable_call_targets,
993 cgraph_edge *edge)
994 {
995 unsigned int i;
996 void *cache_token;
997 bool final;
998 vec <cgraph_node *>targets
999 = possible_polymorphic_call_targets
1000 (edge, &final, &cache_token);
1001
1002 if (!reachable_call_targets->add (cache_token))
1003 {
1004 if (symtab->dump_file)
1005 dump_possible_polymorphic_call_targets
1006 (symtab->dump_file, edge);
1007
1008 for (i = 0; i < targets.length (); i++)
1009 {
1010 /* Do not bother to mark virtual methods in anonymous namespace;
1011 either we will find use of virtual table defining it, or it is
1012 unused. */
1013 if (targets[i]->definition
1014 && TREE_CODE
1015 (TREE_TYPE (targets[i]->decl))
1016 == METHOD_TYPE
1017 && !type_in_anonymous_namespace_p
1018 (TYPE_METHOD_BASETYPE (TREE_TYPE (targets[i]->decl))))
1019 enqueue_node (targets[i]);
1020 }
1021 }
1022
1023 /* Very trivial devirtualization; when the type is
1024 final or anonymous (so we know all its derivation)
1025 and there is only one possible virtual call target,
1026 make the edge direct. */
1027 if (final)
1028 {
1029 if (targets.length () <= 1 && dbg_cnt (devirt))
1030 {
1031 cgraph_node *target;
1032 if (targets.length () == 1)
1033 target = targets[0];
1034 else
1035 target = cgraph_node::create
1036 (builtin_decl_implicit (BUILT_IN_UNREACHABLE));
1037
1038 if (symtab->dump_file)
1039 {
1040 fprintf (symtab->dump_file,
1041 "Devirtualizing call: ");
1042 print_gimple_stmt (symtab->dump_file,
1043 edge->call_stmt, 0,
1044 TDF_SLIM);
1045 }
1046 if (dump_enabled_p ())
1047 {
1048 dump_printf_loc (MSG_OPTIMIZED_LOCATIONS, edge->call_stmt,
1049 "devirtualizing call in %s to %s\n",
1050 edge->caller->dump_name (),
1051 target->dump_name ());
1052 }
1053
1054 edge = cgraph_edge::make_direct (edge, target);
1055 gimple *new_call = cgraph_edge::redirect_call_stmt_to_callee (edge);
1056
1057 if (symtab->dump_file)
1058 {
1059 fprintf (symtab->dump_file, "Devirtualized as: ");
1060 print_gimple_stmt (symtab->dump_file, new_call, 0, TDF_SLIM);
1061 }
1062 }
1063 }
1064 }
1065
1066 /* Issue appropriate warnings for the global declaration DECL. */
1067
1068 static void
check_global_declaration(symtab_node * snode)1069 check_global_declaration (symtab_node *snode)
1070 {
1071 const char *decl_file;
1072 tree decl = snode->decl;
1073
1074 /* Warn about any function declared static but not defined. We don't
1075 warn about variables, because many programs have static variables
1076 that exist only to get some text into the object file. */
1077 if (TREE_CODE (decl) == FUNCTION_DECL
1078 && DECL_INITIAL (decl) == 0
1079 && DECL_EXTERNAL (decl)
1080 && ! DECL_ARTIFICIAL (decl)
1081 && ! TREE_PUBLIC (decl))
1082 {
1083 if (warning_suppressed_p (decl, OPT_Wunused))
1084 ;
1085 else if (snode->referred_to_p (/*include_self=*/false))
1086 pedwarn (input_location, 0, "%q+F used but never defined", decl);
1087 else
1088 warning (OPT_Wunused_function, "%q+F declared %<static%> but never "
1089 "defined", decl);
1090 /* This symbol is effectively an "extern" declaration now. */
1091 TREE_PUBLIC (decl) = 1;
1092 }
1093
1094 /* Warn about static fns or vars defined but not used. */
1095 if (((warn_unused_function && TREE_CODE (decl) == FUNCTION_DECL)
1096 || (((warn_unused_variable && ! TREE_READONLY (decl))
1097 || (warn_unused_const_variable > 0 && TREE_READONLY (decl)
1098 && (warn_unused_const_variable == 2
1099 || (main_input_filename != NULL
1100 && (decl_file = DECL_SOURCE_FILE (decl)) != NULL
1101 && filename_cmp (main_input_filename,
1102 decl_file) == 0))))
1103 && VAR_P (decl)))
1104 && ! DECL_IN_SYSTEM_HEADER (decl)
1105 && ! snode->referred_to_p (/*include_self=*/false)
1106 /* This TREE_USED check is needed in addition to referred_to_p
1107 above, because the `__unused__' attribute is not being
1108 considered for referred_to_p. */
1109 && ! TREE_USED (decl)
1110 /* The TREE_USED bit for file-scope decls is kept in the identifier,
1111 to handle multiple external decls in different scopes. */
1112 && ! (DECL_NAME (decl) && TREE_USED (DECL_NAME (decl)))
1113 && ! DECL_EXTERNAL (decl)
1114 && ! DECL_ARTIFICIAL (decl)
1115 && ! DECL_ABSTRACT_ORIGIN (decl)
1116 && ! TREE_PUBLIC (decl)
1117 /* A volatile variable might be used in some non-obvious way. */
1118 && (! VAR_P (decl) || ! TREE_THIS_VOLATILE (decl))
1119 /* Global register variables must be declared to reserve them. */
1120 && ! (VAR_P (decl) && DECL_REGISTER (decl))
1121 /* Global ctors and dtors are called by the runtime. */
1122 && (TREE_CODE (decl) != FUNCTION_DECL
1123 || (!DECL_STATIC_CONSTRUCTOR (decl)
1124 && !DECL_STATIC_DESTRUCTOR (decl)))
1125 /* Otherwise, ask the language. */
1126 && lang_hooks.decls.warn_unused_global (decl))
1127 warning_at (DECL_SOURCE_LOCATION (decl),
1128 (TREE_CODE (decl) == FUNCTION_DECL)
1129 ? OPT_Wunused_function
1130 : (TREE_READONLY (decl)
1131 ? OPT_Wunused_const_variable_
1132 : OPT_Wunused_variable),
1133 "%qD defined but not used", decl);
1134 }
1135
1136 /* Discover all functions and variables that are trivially needed, analyze
1137 them as well as all functions and variables referred by them */
1138 static cgraph_node *first_analyzed;
1139 static varpool_node *first_analyzed_var;
1140
1141 /* FIRST_TIME is set to TRUE for the first time we are called for a
1142 translation unit from finalize_compilation_unit() or false
1143 otherwise. */
1144
1145 static void
analyze_functions(bool first_time)1146 analyze_functions (bool first_time)
1147 {
1148 /* Keep track of already processed nodes when called multiple times for
1149 intermodule optimization. */
1150 cgraph_node *first_handled = first_analyzed;
1151 varpool_node *first_handled_var = first_analyzed_var;
1152 hash_set<void *> reachable_call_targets;
1153
1154 symtab_node *node;
1155 symtab_node *next;
1156 int i;
1157 ipa_ref *ref;
1158 bool changed = true;
1159 location_t saved_loc = input_location;
1160
1161 bitmap_obstack_initialize (NULL);
1162 symtab->state = CONSTRUCTION;
1163 input_location = UNKNOWN_LOCATION;
1164
1165 thunk_info::process_early_thunks ();
1166
1167 /* Ugly, but the fixup cannot happen at a time same body alias is created;
1168 C++ FE is confused about the COMDAT groups being right. */
1169 if (symtab->cpp_implicit_aliases_done)
1170 FOR_EACH_SYMBOL (node)
1171 if (node->cpp_implicit_alias)
1172 node->fixup_same_cpp_alias_visibility (node->get_alias_target ());
1173 build_type_inheritance_graph ();
1174
1175 if (flag_openmp && first_time)
1176 omp_discover_implicit_declare_target ();
1177
1178 /* Analysis adds static variables that in turn adds references to new functions.
1179 So we need to iterate the process until it stabilize. */
1180 while (changed)
1181 {
1182 changed = false;
1183 process_function_and_variable_attributes (first_analyzed,
1184 first_analyzed_var);
1185
1186 /* First identify the trivially needed symbols. */
1187 for (node = symtab->first_symbol ();
1188 node != first_analyzed
1189 && node != first_analyzed_var; node = node->next)
1190 {
1191 /* Convert COMDAT group designators to IDENTIFIER_NODEs. */
1192 node->get_comdat_group_id ();
1193 if (node->needed_p ())
1194 {
1195 enqueue_node (node);
1196 if (!changed && symtab->dump_file)
1197 fprintf (symtab->dump_file, "Trivially needed symbols:");
1198 changed = true;
1199 if (symtab->dump_file)
1200 fprintf (symtab->dump_file, " %s", node->dump_asm_name ());
1201 }
1202 if (node == first_analyzed
1203 || node == first_analyzed_var)
1204 break;
1205 }
1206 symtab->process_new_functions ();
1207 first_analyzed_var = symtab->first_variable ();
1208 first_analyzed = symtab->first_function ();
1209
1210 if (changed && symtab->dump_file)
1211 fprintf (symtab->dump_file, "\n");
1212
1213 /* Lower representation, build callgraph edges and references for all trivially
1214 needed symbols and all symbols referred by them. */
1215 while (queued_nodes != &symtab_terminator)
1216 {
1217 changed = true;
1218 node = queued_nodes;
1219 queued_nodes = (symtab_node *)queued_nodes->aux;
1220 cgraph_node *cnode = dyn_cast <cgraph_node *> (node);
1221 if (cnode && cnode->definition)
1222 {
1223 cgraph_edge *edge;
1224 tree decl = cnode->decl;
1225
1226 /* ??? It is possible to create extern inline function
1227 and later using weak alias attribute to kill its body.
1228 See gcc.c-torture/compile/20011119-1.c */
1229 if (!DECL_STRUCT_FUNCTION (decl)
1230 && !cnode->alias
1231 && !cnode->thunk
1232 && !cnode->dispatcher_function)
1233 {
1234 cnode->reset ();
1235 cnode->redefined_extern_inline = true;
1236 continue;
1237 }
1238
1239 if (!cnode->analyzed)
1240 cnode->analyze ();
1241
1242 for (edge = cnode->callees; edge; edge = edge->next_callee)
1243 if (edge->callee->definition
1244 && (!DECL_EXTERNAL (edge->callee->decl)
1245 /* When not optimizing, do not try to analyze extern
1246 inline functions. Doing so is pointless. */
1247 || opt_for_fn (edge->callee->decl, optimize)
1248 /* Weakrefs needs to be preserved. */
1249 || edge->callee->alias
1250 /* always_inline functions are inlined even at -O0. */
1251 || lookup_attribute
1252 ("always_inline",
1253 DECL_ATTRIBUTES (edge->callee->decl))
1254 /* Multiversioned functions needs the dispatcher to
1255 be produced locally even for extern functions. */
1256 || edge->callee->function_version ()))
1257 enqueue_node (edge->callee);
1258 if (opt_for_fn (cnode->decl, optimize)
1259 && opt_for_fn (cnode->decl, flag_devirtualize))
1260 {
1261 cgraph_edge *next;
1262
1263 for (edge = cnode->indirect_calls; edge; edge = next)
1264 {
1265 next = edge->next_callee;
1266 if (edge->indirect_info->polymorphic)
1267 walk_polymorphic_call_targets (&reachable_call_targets,
1268 edge);
1269 }
1270 }
1271
1272 /* If decl is a clone of an abstract function,
1273 mark that abstract function so that we don't release its body.
1274 The DECL_INITIAL() of that abstract function declaration
1275 will be later needed to output debug info. */
1276 if (DECL_ABSTRACT_ORIGIN (decl))
1277 {
1278 cgraph_node *origin_node
1279 = cgraph_node::get_create (DECL_ABSTRACT_ORIGIN (decl));
1280 origin_node->used_as_abstract_origin = true;
1281 }
1282 /* Preserve a functions function context node. It will
1283 later be needed to output debug info. */
1284 if (tree fn = decl_function_context (decl))
1285 {
1286 cgraph_node *origin_node = cgraph_node::get_create (fn);
1287 enqueue_node (origin_node);
1288 }
1289 }
1290 else
1291 {
1292 varpool_node *vnode = dyn_cast <varpool_node *> (node);
1293 if (vnode && vnode->definition && !vnode->analyzed)
1294 vnode->analyze ();
1295 }
1296
1297 if (node->same_comdat_group)
1298 {
1299 symtab_node *next;
1300 for (next = node->same_comdat_group;
1301 next != node;
1302 next = next->same_comdat_group)
1303 if (!next->comdat_local_p ())
1304 enqueue_node (next);
1305 }
1306 for (i = 0; node->iterate_reference (i, ref); i++)
1307 if (ref->referred->definition
1308 && (!DECL_EXTERNAL (ref->referred->decl)
1309 || ((TREE_CODE (ref->referred->decl) != FUNCTION_DECL
1310 && optimize)
1311 || (TREE_CODE (ref->referred->decl) == FUNCTION_DECL
1312 && opt_for_fn (ref->referred->decl, optimize))
1313 || node->alias
1314 || ref->referred->alias)))
1315 enqueue_node (ref->referred);
1316 symtab->process_new_functions ();
1317 }
1318 }
1319 update_type_inheritance_graph ();
1320
1321 /* Collect entry points to the unit. */
1322 if (symtab->dump_file)
1323 {
1324 fprintf (symtab->dump_file, "\n\nInitial ");
1325 symtab->dump (symtab->dump_file);
1326 }
1327
1328 if (first_time)
1329 {
1330 symtab_node *snode;
1331 FOR_EACH_SYMBOL (snode)
1332 check_global_declaration (snode);
1333 }
1334
1335 if (symtab->dump_file)
1336 fprintf (symtab->dump_file, "\nRemoving unused symbols:");
1337
1338 for (node = symtab->first_symbol ();
1339 node != first_handled
1340 && node != first_handled_var; node = next)
1341 {
1342 next = node->next;
1343 /* For symbols declared locally we clear TREE_READONLY when emitting
1344 the constructor (if one is needed). For external declarations we can
1345 not safely assume that the type is readonly because we may be called
1346 during its construction. */
1347 if (TREE_CODE (node->decl) == VAR_DECL
1348 && TYPE_P (TREE_TYPE (node->decl))
1349 && TYPE_NEEDS_CONSTRUCTING (TREE_TYPE (node->decl))
1350 && DECL_EXTERNAL (node->decl))
1351 TREE_READONLY (node->decl) = 0;
1352 if (!node->aux && !node->referred_to_p ())
1353 {
1354 if (symtab->dump_file)
1355 fprintf (symtab->dump_file, " %s", node->dump_name ());
1356
1357 /* See if the debugger can use anything before the DECL
1358 passes away. Perhaps it can notice a DECL that is now a
1359 constant and can tag the early DIE with an appropriate
1360 attribute.
1361
1362 Otherwise, this is the last chance the debug_hooks have
1363 at looking at optimized away DECLs, since
1364 late_global_decl will subsequently be called from the
1365 contents of the now pruned symbol table. */
1366 if (VAR_P (node->decl)
1367 && !decl_function_context (node->decl))
1368 {
1369 /* We are reclaiming totally unreachable code and variables
1370 so they effectively appear as readonly. Show that to
1371 the debug machinery. */
1372 TREE_READONLY (node->decl) = 1;
1373 node->definition = false;
1374 (*debug_hooks->late_global_decl) (node->decl);
1375 }
1376
1377 node->remove ();
1378 continue;
1379 }
1380 if (cgraph_node *cnode = dyn_cast <cgraph_node *> (node))
1381 {
1382 tree decl = node->decl;
1383
1384 if (cnode->definition && !gimple_has_body_p (decl)
1385 && !cnode->alias
1386 && !cnode->thunk)
1387 cnode->reset ();
1388
1389 gcc_assert (!cnode->definition || cnode->thunk
1390 || cnode->alias
1391 || gimple_has_body_p (decl)
1392 || cnode->native_rtl_p ());
1393 gcc_assert (cnode->analyzed == cnode->definition);
1394 }
1395 node->aux = NULL;
1396 }
1397 for (;node; node = node->next)
1398 node->aux = NULL;
1399 first_analyzed = symtab->first_function ();
1400 first_analyzed_var = symtab->first_variable ();
1401 if (symtab->dump_file)
1402 {
1403 fprintf (symtab->dump_file, "\n\nReclaimed ");
1404 symtab->dump (symtab->dump_file);
1405 }
1406 bitmap_obstack_release (NULL);
1407 ggc_collect ();
1408 /* Initialize assembler name hash, in particular we want to trigger C++
1409 mangling and same body alias creation before we free DECL_ARGUMENTS
1410 used by it. */
1411 if (!seen_error ())
1412 symtab->symtab_initialize_asm_name_hash ();
1413
1414 input_location = saved_loc;
1415 }
1416
1417 /* Check declaration of the type of ALIAS for compatibility with its TARGET
1418 (which may be an ifunc resolver) and issue a diagnostic when they are
1419 not compatible according to language rules (plus a C++ extension for
1420 non-static member functions). */
1421
1422 static void
maybe_diag_incompatible_alias(tree alias,tree target)1423 maybe_diag_incompatible_alias (tree alias, tree target)
1424 {
1425 tree altype = TREE_TYPE (alias);
1426 tree targtype = TREE_TYPE (target);
1427
1428 bool ifunc = cgraph_node::get (alias)->ifunc_resolver;
1429 tree funcptr = altype;
1430
1431 if (ifunc)
1432 {
1433 /* Handle attribute ifunc first. */
1434 if (TREE_CODE (altype) == METHOD_TYPE)
1435 {
1436 /* Set FUNCPTR to the type of the alias target. If the type
1437 is a non-static member function of class C, construct a type
1438 of an ordinary function taking C* as the first argument,
1439 followed by the member function argument list, and use it
1440 instead to check for incompatibility. This conversion is
1441 not defined by the language but an extension provided by
1442 G++. */
1443
1444 tree rettype = TREE_TYPE (altype);
1445 tree args = TYPE_ARG_TYPES (altype);
1446 altype = build_function_type (rettype, args);
1447 funcptr = altype;
1448 }
1449
1450 targtype = TREE_TYPE (targtype);
1451
1452 if (POINTER_TYPE_P (targtype))
1453 {
1454 targtype = TREE_TYPE (targtype);
1455
1456 /* Only issue Wattribute-alias for conversions to void* with
1457 -Wextra. */
1458 if (VOID_TYPE_P (targtype) && !extra_warnings)
1459 return;
1460
1461 /* Proceed to handle incompatible ifunc resolvers below. */
1462 }
1463 else
1464 {
1465 funcptr = build_pointer_type (funcptr);
1466
1467 error_at (DECL_SOURCE_LOCATION (target),
1468 "%<ifunc%> resolver for %qD must return %qT",
1469 alias, funcptr);
1470 inform (DECL_SOURCE_LOCATION (alias),
1471 "resolver indirect function declared here");
1472 return;
1473 }
1474 }
1475
1476 if ((!FUNC_OR_METHOD_TYPE_P (targtype)
1477 || (prototype_p (altype)
1478 && prototype_p (targtype)
1479 && !types_compatible_p (altype, targtype))))
1480 {
1481 /* Warn for incompatibilities. Avoid warning for functions
1482 without a prototype to make it possible to declare aliases
1483 without knowing the exact type, as libstdc++ does. */
1484 if (ifunc)
1485 {
1486 funcptr = build_pointer_type (funcptr);
1487
1488 auto_diagnostic_group d;
1489 if (warning_at (DECL_SOURCE_LOCATION (target),
1490 OPT_Wattribute_alias_,
1491 "%<ifunc%> resolver for %qD should return %qT",
1492 alias, funcptr))
1493 inform (DECL_SOURCE_LOCATION (alias),
1494 "resolver indirect function declared here");
1495 }
1496 else
1497 {
1498 auto_diagnostic_group d;
1499 if (warning_at (DECL_SOURCE_LOCATION (alias),
1500 OPT_Wattribute_alias_,
1501 "%qD alias between functions of incompatible "
1502 "types %qT and %qT", alias, altype, targtype))
1503 inform (DECL_SOURCE_LOCATION (target),
1504 "aliased declaration here");
1505 }
1506 }
1507 }
1508
1509 /* Translate the ugly representation of aliases as alias pairs into nice
1510 representation in callgraph. We don't handle all cases yet,
1511 unfortunately. */
1512
1513 static void
handle_alias_pairs(void)1514 handle_alias_pairs (void)
1515 {
1516 alias_pair *p;
1517 unsigned i;
1518
1519 for (i = 0; alias_pairs && alias_pairs->iterate (i, &p);)
1520 {
1521 symtab_node *target_node = symtab_node::get_for_asmname (p->target);
1522
1523 /* Weakrefs with target not defined in current unit are easy to handle:
1524 they behave just as external variables except we need to note the
1525 alias flag to later output the weakref pseudo op into asm file. */
1526 if (!target_node
1527 && lookup_attribute ("weakref", DECL_ATTRIBUTES (p->decl)) != NULL)
1528 {
1529 symtab_node *node = symtab_node::get (p->decl);
1530 if (node)
1531 {
1532 node->alias_target = p->target;
1533 node->weakref = true;
1534 node->alias = true;
1535 node->transparent_alias = true;
1536 }
1537 alias_pairs->unordered_remove (i);
1538 continue;
1539 }
1540 else if (!target_node)
1541 {
1542 error ("%q+D aliased to undefined symbol %qE", p->decl, p->target);
1543 symtab_node *node = symtab_node::get (p->decl);
1544 if (node)
1545 node->alias = false;
1546 alias_pairs->unordered_remove (i);
1547 continue;
1548 }
1549
1550 if (DECL_EXTERNAL (target_node->decl)
1551 /* We use local aliases for C++ thunks to force the tailcall
1552 to bind locally. This is a hack - to keep it working do
1553 the following (which is not strictly correct). */
1554 && (TREE_CODE (target_node->decl) != FUNCTION_DECL
1555 || ! DECL_VIRTUAL_P (target_node->decl))
1556 && ! lookup_attribute ("weakref", DECL_ATTRIBUTES (p->decl)))
1557 {
1558 error ("%q+D aliased to external symbol %qE",
1559 p->decl, p->target);
1560 }
1561
1562 if (TREE_CODE (p->decl) == FUNCTION_DECL
1563 && target_node && is_a <cgraph_node *> (target_node))
1564 {
1565 maybe_diag_incompatible_alias (p->decl, target_node->decl);
1566
1567 maybe_diag_alias_attributes (p->decl, target_node->decl);
1568
1569 cgraph_node *src_node = cgraph_node::get (p->decl);
1570 if (src_node && src_node->definition)
1571 src_node->reset ();
1572 cgraph_node::create_alias (p->decl, target_node->decl);
1573 alias_pairs->unordered_remove (i);
1574 }
1575 else if (VAR_P (p->decl)
1576 && target_node && is_a <varpool_node *> (target_node))
1577 {
1578 varpool_node::create_alias (p->decl, target_node->decl);
1579 alias_pairs->unordered_remove (i);
1580 }
1581 else
1582 {
1583 error ("%q+D alias between function and variable is not supported",
1584 p->decl);
1585 inform (DECL_SOURCE_LOCATION (target_node->decl),
1586 "aliased declaration here");
1587
1588 alias_pairs->unordered_remove (i);
1589 }
1590 }
1591 vec_free (alias_pairs);
1592 }
1593
1594
1595 /* Figure out what functions we want to assemble. */
1596
1597 static void
mark_functions_to_output(void)1598 mark_functions_to_output (void)
1599 {
1600 bool check_same_comdat_groups = false;
1601 cgraph_node *node;
1602
1603 if (flag_checking)
1604 FOR_EACH_FUNCTION (node)
1605 gcc_assert (!node->process);
1606
1607 FOR_EACH_FUNCTION (node)
1608 {
1609 tree decl = node->decl;
1610
1611 gcc_assert (!node->process || node->same_comdat_group);
1612 if (node->process)
1613 continue;
1614
1615 /* We need to output all local functions that are used and not
1616 always inlined, as well as those that are reachable from
1617 outside the current compilation unit. */
1618 if (node->analyzed
1619 && !node->thunk
1620 && !node->alias
1621 && !node->inlined_to
1622 && !TREE_ASM_WRITTEN (decl)
1623 && !DECL_EXTERNAL (decl))
1624 {
1625 node->process = 1;
1626 if (node->same_comdat_group)
1627 {
1628 cgraph_node *next;
1629 for (next = dyn_cast<cgraph_node *> (node->same_comdat_group);
1630 next != node;
1631 next = dyn_cast<cgraph_node *> (next->same_comdat_group))
1632 if (!next->thunk && !next->alias
1633 && !next->comdat_local_p ())
1634 next->process = 1;
1635 }
1636 }
1637 else if (node->same_comdat_group)
1638 {
1639 if (flag_checking)
1640 check_same_comdat_groups = true;
1641 }
1642 else
1643 {
1644 /* We should've reclaimed all functions that are not needed. */
1645 if (flag_checking
1646 && !node->inlined_to
1647 && gimple_has_body_p (decl)
1648 /* FIXME: in ltrans unit when offline copy is outside partition but inline copies
1649 are inside partition, we can end up not removing the body since we no longer
1650 have analyzed node pointing to it. */
1651 && !node->in_other_partition
1652 && !node->alias
1653 && !node->clones
1654 && !DECL_EXTERNAL (decl))
1655 {
1656 node->debug ();
1657 internal_error ("failed to reclaim unneeded function");
1658 }
1659 gcc_assert (node->inlined_to
1660 || !gimple_has_body_p (decl)
1661 || node->in_other_partition
1662 || node->clones
1663 || DECL_ARTIFICIAL (decl)
1664 || DECL_EXTERNAL (decl));
1665
1666 }
1667
1668 }
1669 if (flag_checking && check_same_comdat_groups)
1670 FOR_EACH_FUNCTION (node)
1671 if (node->same_comdat_group && !node->process)
1672 {
1673 tree decl = node->decl;
1674 if (!node->inlined_to
1675 && gimple_has_body_p (decl)
1676 /* FIXME: in an ltrans unit when the offline copy is outside a
1677 partition but inline copies are inside a partition, we can
1678 end up not removing the body since we no longer have an
1679 analyzed node pointing to it. */
1680 && !node->in_other_partition
1681 && !node->clones
1682 && !DECL_EXTERNAL (decl))
1683 {
1684 node->debug ();
1685 internal_error ("failed to reclaim unneeded function in same "
1686 "comdat group");
1687 }
1688 }
1689 }
1690
1691 /* DECL is FUNCTION_DECL. Initialize datastructures so DECL is a function
1692 in lowered gimple form. IN_SSA is true if the gimple is in SSA.
1693
1694 Set current_function_decl and cfun to newly constructed empty function body.
1695 return basic block in the function body. */
1696
1697 basic_block
init_lowered_empty_function(tree decl,bool in_ssa,profile_count count)1698 init_lowered_empty_function (tree decl, bool in_ssa, profile_count count)
1699 {
1700 basic_block bb;
1701 edge e;
1702
1703 current_function_decl = decl;
1704 allocate_struct_function (decl, false);
1705 gimple_register_cfg_hooks ();
1706 init_empty_tree_cfg ();
1707 init_tree_ssa (cfun);
1708
1709 if (in_ssa)
1710 {
1711 init_ssa_operands (cfun);
1712 cfun->gimple_df->in_ssa_p = true;
1713 cfun->curr_properties |= PROP_ssa;
1714 }
1715
1716 DECL_INITIAL (decl) = make_node (BLOCK);
1717 BLOCK_SUPERCONTEXT (DECL_INITIAL (decl)) = decl;
1718
1719 DECL_SAVED_TREE (decl) = error_mark_node;
1720 cfun->curr_properties |= (PROP_gimple_lcf | PROP_gimple_leh | PROP_gimple_any
1721 | PROP_cfg | PROP_loops);
1722
1723 set_loops_for_fn (cfun, ggc_cleared_alloc<loops> ());
1724 init_loops_structure (cfun, loops_for_fn (cfun), 1);
1725 loops_for_fn (cfun)->state |= LOOPS_MAY_HAVE_MULTIPLE_LATCHES;
1726
1727 /* Create BB for body of the function and connect it properly. */
1728 ENTRY_BLOCK_PTR_FOR_FN (cfun)->count = count;
1729 EXIT_BLOCK_PTR_FOR_FN (cfun)->count = count;
1730 bb = create_basic_block (NULL, ENTRY_BLOCK_PTR_FOR_FN (cfun));
1731 bb->count = count;
1732 e = make_edge (ENTRY_BLOCK_PTR_FOR_FN (cfun), bb, EDGE_FALLTHRU);
1733 e->probability = profile_probability::always ();
1734 e = make_edge (bb, EXIT_BLOCK_PTR_FOR_FN (cfun), 0);
1735 e->probability = profile_probability::always ();
1736 add_bb_to_loop (bb, ENTRY_BLOCK_PTR_FOR_FN (cfun)->loop_father);
1737
1738 return bb;
1739 }
1740
1741 /* Assemble thunks and aliases associated to node. */
1742
1743 void
assemble_thunks_and_aliases(void)1744 cgraph_node::assemble_thunks_and_aliases (void)
1745 {
1746 cgraph_edge *e;
1747 ipa_ref *ref;
1748
1749 for (e = callers; e;)
1750 if (e->caller->thunk
1751 && !e->caller->inlined_to)
1752 {
1753 cgraph_node *thunk = e->caller;
1754
1755 e = e->next_caller;
1756 expand_thunk (thunk, true, false);
1757 thunk->assemble_thunks_and_aliases ();
1758 }
1759 else
1760 e = e->next_caller;
1761
1762 FOR_EACH_ALIAS (this, ref)
1763 {
1764 cgraph_node *alias = dyn_cast <cgraph_node *> (ref->referring);
1765 if (!alias->transparent_alias)
1766 {
1767 bool saved_written = TREE_ASM_WRITTEN (decl);
1768
1769 /* Force assemble_alias to really output the alias this time instead
1770 of buffering it in same alias pairs. */
1771 TREE_ASM_WRITTEN (decl) = 1;
1772 if (alias->symver)
1773 do_assemble_symver (alias->decl,
1774 DECL_ASSEMBLER_NAME (decl));
1775 else
1776 do_assemble_alias (alias->decl,
1777 DECL_ASSEMBLER_NAME (decl));
1778 alias->assemble_thunks_and_aliases ();
1779 TREE_ASM_WRITTEN (decl) = saved_written;
1780 }
1781 }
1782 }
1783
1784 /* Expand function specified by node. */
1785
1786 void
expand(void)1787 cgraph_node::expand (void)
1788 {
1789 location_t saved_loc;
1790
1791 /* We ought to not compile any inline clones. */
1792 gcc_assert (!inlined_to);
1793
1794 /* __RTL functions are compiled as soon as they are parsed, so don't
1795 do it again. */
1796 if (native_rtl_p ())
1797 return;
1798
1799 announce_function (decl);
1800 process = 0;
1801 gcc_assert (lowered);
1802
1803 /* Initialize the default bitmap obstack. */
1804 bitmap_obstack_initialize (NULL);
1805 get_untransformed_body ();
1806
1807 /* Generate RTL for the body of DECL. */
1808
1809 timevar_push (TV_REST_OF_COMPILATION);
1810
1811 gcc_assert (symtab->global_info_ready);
1812
1813 /* Initialize the RTL code for the function. */
1814 saved_loc = input_location;
1815 input_location = DECL_SOURCE_LOCATION (decl);
1816
1817 gcc_assert (DECL_STRUCT_FUNCTION (decl));
1818 push_cfun (DECL_STRUCT_FUNCTION (decl));
1819 init_function_start (decl);
1820
1821 gimple_register_cfg_hooks ();
1822
1823 bitmap_obstack_initialize (®_obstack); /* FIXME, only at RTL generation*/
1824
1825 update_ssa (TODO_update_ssa_only_virtuals);
1826 if (ipa_transforms_to_apply.exists ())
1827 execute_all_ipa_transforms (false);
1828
1829 /* Perform all tree transforms and optimizations. */
1830
1831 /* Signal the start of passes. */
1832 invoke_plugin_callbacks (PLUGIN_ALL_PASSES_START, NULL);
1833
1834 execute_pass_list (cfun, g->get_passes ()->all_passes);
1835
1836 /* Signal the end of passes. */
1837 invoke_plugin_callbacks (PLUGIN_ALL_PASSES_END, NULL);
1838
1839 bitmap_obstack_release (®_obstack);
1840
1841 /* Release the default bitmap obstack. */
1842 bitmap_obstack_release (NULL);
1843
1844 /* If requested, warn about function definitions where the function will
1845 return a value (usually of some struct or union type) which itself will
1846 take up a lot of stack space. */
1847 if (!DECL_EXTERNAL (decl) && TREE_TYPE (decl))
1848 {
1849 tree ret_type = TREE_TYPE (TREE_TYPE (decl));
1850
1851 if (ret_type && TYPE_SIZE_UNIT (ret_type)
1852 && TREE_CODE (TYPE_SIZE_UNIT (ret_type)) == INTEGER_CST
1853 && compare_tree_int (TYPE_SIZE_UNIT (ret_type),
1854 warn_larger_than_size) > 0)
1855 {
1856 unsigned int size_as_int
1857 = TREE_INT_CST_LOW (TYPE_SIZE_UNIT (ret_type));
1858
1859 if (compare_tree_int (TYPE_SIZE_UNIT (ret_type), size_as_int) == 0)
1860 warning (OPT_Wlarger_than_,
1861 "size of return value of %q+D is %u bytes",
1862 decl, size_as_int);
1863 else
1864 warning (OPT_Wlarger_than_,
1865 "size of return value of %q+D is larger than %wu bytes",
1866 decl, warn_larger_than_size);
1867 }
1868 }
1869
1870 gimple_set_body (decl, NULL);
1871 if (DECL_STRUCT_FUNCTION (decl) == 0)
1872 {
1873 /* Stop pointing to the local nodes about to be freed.
1874 But DECL_INITIAL must remain nonzero so we know this
1875 was an actual function definition. */
1876 if (DECL_INITIAL (decl) != 0)
1877 DECL_INITIAL (decl) = error_mark_node;
1878 }
1879
1880 input_location = saved_loc;
1881
1882 ggc_collect ();
1883 timevar_pop (TV_REST_OF_COMPILATION);
1884
1885 /* Make sure that BE didn't give up on compiling. */
1886 gcc_assert (TREE_ASM_WRITTEN (decl));
1887 if (cfun)
1888 pop_cfun ();
1889
1890 /* It would make a lot more sense to output thunks before function body to
1891 get more forward and fewer backward jumps. This however would need
1892 solving problem with comdats. See PR48668. Also aliases must come after
1893 function itself to make one pass assemblers, like one on AIX, happy.
1894 See PR 50689.
1895 FIXME: Perhaps thunks should be move before function IFF they are not in
1896 comdat groups. */
1897 assemble_thunks_and_aliases ();
1898 release_body ();
1899 }
1900
1901 /* Node comparator that is responsible for the order that corresponds
1902 to time when a function was launched for the first time. */
1903
1904 int
tp_first_run_node_cmp(const void * pa,const void * pb)1905 tp_first_run_node_cmp (const void *pa, const void *pb)
1906 {
1907 const cgraph_node *a = *(const cgraph_node * const *) pa;
1908 const cgraph_node *b = *(const cgraph_node * const *) pb;
1909 unsigned int tp_first_run_a = a->tp_first_run;
1910 unsigned int tp_first_run_b = b->tp_first_run;
1911
1912 if (!opt_for_fn (a->decl, flag_profile_reorder_functions)
1913 || a->no_reorder)
1914 tp_first_run_a = 0;
1915 if (!opt_for_fn (b->decl, flag_profile_reorder_functions)
1916 || b->no_reorder)
1917 tp_first_run_b = 0;
1918
1919 if (tp_first_run_a == tp_first_run_b)
1920 return a->order - b->order;
1921
1922 /* Functions with time profile must be before these without profile. */
1923 tp_first_run_a = (tp_first_run_a - 1) & INT_MAX;
1924 tp_first_run_b = (tp_first_run_b - 1) & INT_MAX;
1925
1926 return tp_first_run_a - tp_first_run_b;
1927 }
1928
1929 /* Expand all functions that must be output.
1930
1931 Attempt to topologically sort the nodes so function is output when
1932 all called functions are already assembled to allow data to be
1933 propagated across the callgraph. Use a stack to get smaller distance
1934 between a function and its callees (later we may choose to use a more
1935 sophisticated algorithm for function reordering; we will likely want
1936 to use subsections to make the output functions appear in top-down
1937 order). */
1938
1939 static void
expand_all_functions(void)1940 expand_all_functions (void)
1941 {
1942 cgraph_node *node;
1943 cgraph_node **order = XCNEWVEC (cgraph_node *,
1944 symtab->cgraph_count);
1945 cgraph_node **tp_first_run_order = XCNEWVEC (cgraph_node *,
1946 symtab->cgraph_count);
1947 unsigned int expanded_func_count = 0, profiled_func_count = 0;
1948 int order_pos, tp_first_run_order_pos = 0, new_order_pos = 0;
1949 int i;
1950
1951 order_pos = ipa_reverse_postorder (order);
1952 gcc_assert (order_pos == symtab->cgraph_count);
1953
1954 /* Garbage collector may remove inline clones we eliminate during
1955 optimization. So we must be sure to not reference them. */
1956 for (i = 0; i < order_pos; i++)
1957 if (order[i]->process)
1958 {
1959 if (order[i]->tp_first_run
1960 && opt_for_fn (order[i]->decl, flag_profile_reorder_functions))
1961 tp_first_run_order[tp_first_run_order_pos++] = order[i];
1962 else
1963 order[new_order_pos++] = order[i];
1964 }
1965
1966 /* First output functions with time profile in specified order. */
1967 qsort (tp_first_run_order, tp_first_run_order_pos,
1968 sizeof (cgraph_node *), tp_first_run_node_cmp);
1969 for (i = 0; i < tp_first_run_order_pos; i++)
1970 {
1971 node = tp_first_run_order[i];
1972
1973 if (node->process)
1974 {
1975 expanded_func_count++;
1976 profiled_func_count++;
1977
1978 if (symtab->dump_file)
1979 fprintf (symtab->dump_file,
1980 "Time profile order in expand_all_functions:%s:%d\n",
1981 node->dump_asm_name (), node->tp_first_run);
1982 node->process = 0;
1983 node->expand ();
1984 }
1985 }
1986
1987 /* Output functions in RPO so callees get optimized before callers. This
1988 makes ipa-ra and other propagators to work.
1989 FIXME: This is far from optimal code layout. */
1990 for (i = new_order_pos - 1; i >= 0; i--)
1991 {
1992 node = order[i];
1993
1994 if (node->process)
1995 {
1996 expanded_func_count++;
1997 node->process = 0;
1998 node->expand ();
1999 }
2000 }
2001
2002 if (dump_file)
2003 fprintf (dump_file, "Expanded functions with time profile (%s):%u/%u\n",
2004 main_input_filename, profiled_func_count, expanded_func_count);
2005
2006 if (symtab->dump_file && tp_first_run_order_pos)
2007 fprintf (symtab->dump_file, "Expanded functions with time profile:%u/%u\n",
2008 profiled_func_count, expanded_func_count);
2009
2010 symtab->process_new_functions ();
2011 free_gimplify_stack ();
2012 delete ipa_saved_clone_sources;
2013 ipa_saved_clone_sources = NULL;
2014 free (order);
2015 free (tp_first_run_order);
2016 }
2017
2018 /* This is used to sort the node types by the cgraph order number. */
2019
2020 enum cgraph_order_sort_kind
2021 {
2022 ORDER_FUNCTION,
2023 ORDER_VAR,
2024 ORDER_VAR_UNDEF,
2025 ORDER_ASM
2026 };
2027
2028 struct cgraph_order_sort
2029 {
2030 /* Construct from a cgraph_node. */
cgraph_order_sortcgraph_order_sort2031 cgraph_order_sort (cgraph_node *node)
2032 : kind (ORDER_FUNCTION), order (node->order)
2033 {
2034 u.f = node;
2035 }
2036
2037 /* Construct from a varpool_node. */
cgraph_order_sortcgraph_order_sort2038 cgraph_order_sort (varpool_node *node)
2039 : kind (node->definition ? ORDER_VAR : ORDER_VAR_UNDEF), order (node->order)
2040 {
2041 u.v = node;
2042 }
2043
2044 /* Construct from a asm_node. */
cgraph_order_sortcgraph_order_sort2045 cgraph_order_sort (asm_node *node)
2046 : kind (ORDER_ASM), order (node->order)
2047 {
2048 u.a = node;
2049 }
2050
2051 /* Assembly cgraph_order_sort based on its type. */
2052 void process ();
2053
2054 enum cgraph_order_sort_kind kind;
2055 union
2056 {
2057 cgraph_node *f;
2058 varpool_node *v;
2059 asm_node *a;
2060 } u;
2061 int order;
2062 };
2063
2064 /* Assembly cgraph_order_sort based on its type. */
2065
2066 void
process()2067 cgraph_order_sort::process ()
2068 {
2069 switch (kind)
2070 {
2071 case ORDER_FUNCTION:
2072 u.f->process = 0;
2073 u.f->expand ();
2074 break;
2075 case ORDER_VAR:
2076 u.v->assemble_decl ();
2077 break;
2078 case ORDER_VAR_UNDEF:
2079 assemble_undefined_decl (u.v->decl);
2080 break;
2081 case ORDER_ASM:
2082 assemble_asm (u.a->asm_str);
2083 break;
2084 default:
2085 gcc_unreachable ();
2086 }
2087 }
2088
2089 /* Compare cgraph_order_sort by order. */
2090
2091 static int
cgraph_order_cmp(const void * a_p,const void * b_p)2092 cgraph_order_cmp (const void *a_p, const void *b_p)
2093 {
2094 const cgraph_order_sort *nodea = (const cgraph_order_sort *)a_p;
2095 const cgraph_order_sort *nodeb = (const cgraph_order_sort *)b_p;
2096
2097 return nodea->order - nodeb->order;
2098 }
2099
2100 /* Output all functions, variables, and asm statements in the order
2101 according to their order fields, which is the order in which they
2102 appeared in the file. This implements -fno-toplevel-reorder. In
2103 this mode we may output functions and variables which don't really
2104 need to be output. */
2105
2106 static void
output_in_order(void)2107 output_in_order (void)
2108 {
2109 int i;
2110 cgraph_node *cnode;
2111 varpool_node *vnode;
2112 asm_node *anode;
2113 auto_vec<cgraph_order_sort> nodes;
2114 cgraph_order_sort *node;
2115
2116 FOR_EACH_DEFINED_FUNCTION (cnode)
2117 if (cnode->process && !cnode->thunk
2118 && !cnode->alias && cnode->no_reorder)
2119 nodes.safe_push (cgraph_order_sort (cnode));
2120
2121 /* There is a similar loop in symbol_table::output_variables.
2122 Please keep them in sync. */
2123 FOR_EACH_VARIABLE (vnode)
2124 if (vnode->no_reorder
2125 && !DECL_HARD_REGISTER (vnode->decl)
2126 && !DECL_HAS_VALUE_EXPR_P (vnode->decl))
2127 nodes.safe_push (cgraph_order_sort (vnode));
2128
2129 for (anode = symtab->first_asm_symbol (); anode; anode = anode->next)
2130 nodes.safe_push (cgraph_order_sort (anode));
2131
2132 /* Sort nodes by order. */
2133 nodes.qsort (cgraph_order_cmp);
2134
2135 /* In toplevel reorder mode we output all statics; mark them as needed. */
2136 FOR_EACH_VEC_ELT (nodes, i, node)
2137 if (node->kind == ORDER_VAR)
2138 node->u.v->finalize_named_section_flags ();
2139
2140 FOR_EACH_VEC_ELT (nodes, i, node)
2141 node->process ();
2142
2143 symtab->clear_asm_symbols ();
2144 }
2145
2146 static void
ipa_passes(void)2147 ipa_passes (void)
2148 {
2149 gcc::pass_manager *passes = g->get_passes ();
2150
2151 set_cfun (NULL);
2152 current_function_decl = NULL;
2153 gimple_register_cfg_hooks ();
2154 bitmap_obstack_initialize (NULL);
2155
2156 invoke_plugin_callbacks (PLUGIN_ALL_IPA_PASSES_START, NULL);
2157
2158 if (!in_lto_p)
2159 {
2160 execute_ipa_pass_list (passes->all_small_ipa_passes);
2161 if (seen_error ())
2162 return;
2163 }
2164
2165 /* This extra symtab_remove_unreachable_nodes pass tends to catch some
2166 devirtualization and other changes where removal iterate. */
2167 symtab->remove_unreachable_nodes (symtab->dump_file);
2168
2169 /* If pass_all_early_optimizations was not scheduled, the state of
2170 the cgraph will not be properly updated. Update it now. */
2171 if (symtab->state < IPA_SSA)
2172 symtab->state = IPA_SSA;
2173
2174 if (!in_lto_p)
2175 {
2176 /* Generate coverage variables and constructors. */
2177 coverage_finish ();
2178
2179 /* Process new functions added. */
2180 set_cfun (NULL);
2181 current_function_decl = NULL;
2182 symtab->process_new_functions ();
2183
2184 execute_ipa_summary_passes
2185 ((ipa_opt_pass_d *) passes->all_regular_ipa_passes);
2186 }
2187
2188 /* Some targets need to handle LTO assembler output specially. */
2189 if (flag_generate_lto || flag_generate_offload)
2190 targetm.asm_out.lto_start ();
2191
2192 if (!in_lto_p
2193 || flag_incremental_link == INCREMENTAL_LINK_LTO)
2194 {
2195 if (!quiet_flag)
2196 fprintf (stderr, "Streaming LTO\n");
2197 if (g->have_offload)
2198 {
2199 section_name_prefix = OFFLOAD_SECTION_NAME_PREFIX;
2200 lto_stream_offload_p = true;
2201 ipa_write_summaries ();
2202 lto_stream_offload_p = false;
2203 }
2204 if (flag_lto)
2205 {
2206 section_name_prefix = LTO_SECTION_NAME_PREFIX;
2207 lto_stream_offload_p = false;
2208 ipa_write_summaries ();
2209 }
2210 }
2211
2212 if (flag_generate_lto || flag_generate_offload)
2213 targetm.asm_out.lto_end ();
2214
2215 if (!flag_ltrans
2216 && ((in_lto_p && flag_incremental_link != INCREMENTAL_LINK_LTO)
2217 || !flag_lto || flag_fat_lto_objects))
2218 execute_ipa_pass_list (passes->all_regular_ipa_passes);
2219 invoke_plugin_callbacks (PLUGIN_ALL_IPA_PASSES_END, NULL);
2220
2221 bitmap_obstack_release (NULL);
2222 }
2223
2224
2225 /* Weakrefs may be associated to external decls and thus not output
2226 at expansion time. Emit all necessary aliases. */
2227
2228 void
output_weakrefs(void)2229 symbol_table::output_weakrefs (void)
2230 {
2231 symtab_node *node;
2232 FOR_EACH_SYMBOL (node)
2233 if (node->alias
2234 && !TREE_ASM_WRITTEN (node->decl)
2235 && node->weakref)
2236 {
2237 tree target;
2238
2239 /* Weakrefs are special by not requiring target definition in current
2240 compilation unit. It is thus bit hard to work out what we want to
2241 alias.
2242 When alias target is defined, we need to fetch it from symtab reference,
2243 otherwise it is pointed to by alias_target. */
2244 if (node->alias_target)
2245 target = (DECL_P (node->alias_target)
2246 ? DECL_ASSEMBLER_NAME (node->alias_target)
2247 : node->alias_target);
2248 else if (node->analyzed)
2249 target = DECL_ASSEMBLER_NAME (node->get_alias_target ()->decl);
2250 else
2251 gcc_unreachable ();
2252 do_assemble_alias (node->decl, target);
2253 }
2254 }
2255
2256 /* Perform simple optimizations based on callgraph. */
2257
2258 void
compile(void)2259 symbol_table::compile (void)
2260 {
2261 if (seen_error ())
2262 return;
2263
2264 symtab_node::checking_verify_symtab_nodes ();
2265
2266 timevar_push (TV_CGRAPHOPT);
2267 if (pre_ipa_mem_report)
2268 dump_memory_report ("Memory consumption before IPA");
2269 if (!quiet_flag)
2270 fprintf (stderr, "Performing interprocedural optimizations\n");
2271 state = IPA;
2272
2273 /* If LTO is enabled, initialize the streamer hooks needed by GIMPLE. */
2274 if (flag_generate_lto || flag_generate_offload)
2275 lto_streamer_hooks_init ();
2276
2277 /* Don't run the IPA passes if there was any error or sorry messages. */
2278 if (!seen_error ())
2279 {
2280 timevar_start (TV_CGRAPH_IPA_PASSES);
2281 ipa_passes ();
2282 timevar_stop (TV_CGRAPH_IPA_PASSES);
2283 }
2284 /* Do nothing else if any IPA pass found errors or if we are just streaming LTO. */
2285 if (seen_error ()
2286 || ((!in_lto_p || flag_incremental_link == INCREMENTAL_LINK_LTO)
2287 && flag_lto && !flag_fat_lto_objects))
2288 {
2289 timevar_pop (TV_CGRAPHOPT);
2290 return;
2291 }
2292
2293 global_info_ready = true;
2294 if (dump_file)
2295 {
2296 fprintf (dump_file, "Optimized ");
2297 symtab->dump (dump_file);
2298 }
2299 if (post_ipa_mem_report)
2300 dump_memory_report ("Memory consumption after IPA");
2301 timevar_pop (TV_CGRAPHOPT);
2302
2303 /* Output everything. */
2304 switch_to_section (text_section);
2305 (*debug_hooks->assembly_start) ();
2306 if (!quiet_flag)
2307 fprintf (stderr, "Assembling functions:\n");
2308 symtab_node::checking_verify_symtab_nodes ();
2309
2310 bitmap_obstack_initialize (NULL);
2311 execute_ipa_pass_list (g->get_passes ()->all_late_ipa_passes);
2312 bitmap_obstack_release (NULL);
2313 mark_functions_to_output ();
2314
2315 /* When weakref support is missing, we automatically translate all
2316 references to NODE to references to its ultimate alias target.
2317 The renaming mechanism uses flag IDENTIFIER_TRANSPARENT_ALIAS and
2318 TREE_CHAIN.
2319
2320 Set up this mapping before we output any assembler but once we are sure
2321 that all symbol renaming is done.
2322
2323 FIXME: All this ugliness can go away if we just do renaming at gimple
2324 level by physically rewriting the IL. At the moment we can only redirect
2325 calls, so we need infrastructure for renaming references as well. */
2326 #ifndef ASM_OUTPUT_WEAKREF
2327 symtab_node *node;
2328
2329 FOR_EACH_SYMBOL (node)
2330 if (node->alias
2331 && lookup_attribute ("weakref", DECL_ATTRIBUTES (node->decl)))
2332 {
2333 IDENTIFIER_TRANSPARENT_ALIAS
2334 (DECL_ASSEMBLER_NAME (node->decl)) = 1;
2335 TREE_CHAIN (DECL_ASSEMBLER_NAME (node->decl))
2336 = (node->alias_target ? node->alias_target
2337 : DECL_ASSEMBLER_NAME (node->get_alias_target ()->decl));
2338 }
2339 #endif
2340
2341 state = EXPANSION;
2342
2343 /* Output first asm statements and anything ordered. The process
2344 flag is cleared for these nodes, so we skip them later. */
2345 output_in_order ();
2346
2347 timevar_start (TV_CGRAPH_FUNC_EXPANSION);
2348 expand_all_functions ();
2349 timevar_stop (TV_CGRAPH_FUNC_EXPANSION);
2350
2351 output_variables ();
2352
2353 process_new_functions ();
2354 state = FINISHED;
2355 output_weakrefs ();
2356
2357 if (dump_file)
2358 {
2359 fprintf (dump_file, "\nFinal ");
2360 symtab->dump (dump_file);
2361 }
2362 if (!flag_checking)
2363 return;
2364 symtab_node::verify_symtab_nodes ();
2365 /* Double check that all inline clones are gone and that all
2366 function bodies have been released from memory. */
2367 if (!seen_error ())
2368 {
2369 cgraph_node *node;
2370 bool error_found = false;
2371
2372 FOR_EACH_DEFINED_FUNCTION (node)
2373 if (node->inlined_to
2374 || gimple_has_body_p (node->decl))
2375 {
2376 error_found = true;
2377 node->debug ();
2378 }
2379 if (error_found)
2380 internal_error ("nodes with unreleased memory found");
2381 }
2382 }
2383
2384 /* Earlydebug dump file, flags, and number. */
2385
2386 static int debuginfo_early_dump_nr;
2387 static FILE *debuginfo_early_dump_file;
2388 static dump_flags_t debuginfo_early_dump_flags;
2389
2390 /* Debug dump file, flags, and number. */
2391
2392 static int debuginfo_dump_nr;
2393 static FILE *debuginfo_dump_file;
2394 static dump_flags_t debuginfo_dump_flags;
2395
2396 /* Register the debug and earlydebug dump files. */
2397
2398 void
debuginfo_early_init(void)2399 debuginfo_early_init (void)
2400 {
2401 gcc::dump_manager *dumps = g->get_dumps ();
2402 debuginfo_early_dump_nr = dumps->dump_register (".earlydebug", "earlydebug",
2403 "earlydebug", DK_tree,
2404 OPTGROUP_NONE,
2405 false);
2406 debuginfo_dump_nr = dumps->dump_register (".debug", "debug",
2407 "debug", DK_tree,
2408 OPTGROUP_NONE,
2409 false);
2410 }
2411
2412 /* Initialize the debug and earlydebug dump files. */
2413
2414 void
debuginfo_init(void)2415 debuginfo_init (void)
2416 {
2417 gcc::dump_manager *dumps = g->get_dumps ();
2418 debuginfo_dump_file = dump_begin (debuginfo_dump_nr, NULL);
2419 debuginfo_dump_flags = dumps->get_dump_file_info (debuginfo_dump_nr)->pflags;
2420 debuginfo_early_dump_file = dump_begin (debuginfo_early_dump_nr, NULL);
2421 debuginfo_early_dump_flags
2422 = dumps->get_dump_file_info (debuginfo_early_dump_nr)->pflags;
2423 }
2424
2425 /* Finalize the debug and earlydebug dump files. */
2426
2427 void
debuginfo_fini(void)2428 debuginfo_fini (void)
2429 {
2430 if (debuginfo_dump_file)
2431 dump_end (debuginfo_dump_nr, debuginfo_dump_file);
2432 if (debuginfo_early_dump_file)
2433 dump_end (debuginfo_early_dump_nr, debuginfo_early_dump_file);
2434 }
2435
2436 /* Set dump_file to the debug dump file. */
2437
2438 void
debuginfo_start(void)2439 debuginfo_start (void)
2440 {
2441 set_dump_file (debuginfo_dump_file);
2442 }
2443
2444 /* Undo setting dump_file to the debug dump file. */
2445
2446 void
debuginfo_stop(void)2447 debuginfo_stop (void)
2448 {
2449 set_dump_file (NULL);
2450 }
2451
2452 /* Set dump_file to the earlydebug dump file. */
2453
2454 void
debuginfo_early_start(void)2455 debuginfo_early_start (void)
2456 {
2457 set_dump_file (debuginfo_early_dump_file);
2458 }
2459
2460 /* Undo setting dump_file to the earlydebug dump file. */
2461
2462 void
debuginfo_early_stop(void)2463 debuginfo_early_stop (void)
2464 {
2465 set_dump_file (NULL);
2466 }
2467
2468 /* Analyze the whole compilation unit once it is parsed completely. */
2469
2470 void
finalize_compilation_unit(void)2471 symbol_table::finalize_compilation_unit (void)
2472 {
2473 timevar_push (TV_CGRAPH);
2474
2475 /* If we're here there's no current function anymore. Some frontends
2476 are lazy in clearing these. */
2477 current_function_decl = NULL;
2478 set_cfun (NULL);
2479
2480 /* Do not skip analyzing the functions if there were errors, we
2481 miss diagnostics for following functions otherwise. */
2482
2483 /* Emit size functions we didn't inline. */
2484 finalize_size_functions ();
2485
2486 /* Mark alias targets necessary and emit diagnostics. */
2487 handle_alias_pairs ();
2488
2489 if (!quiet_flag)
2490 {
2491 fprintf (stderr, "\nAnalyzing compilation unit\n");
2492 fflush (stderr);
2493 }
2494
2495 if (flag_dump_passes)
2496 dump_passes ();
2497
2498 /* Gimplify and lower all functions, compute reachability and
2499 remove unreachable nodes. */
2500 analyze_functions (/*first_time=*/true);
2501
2502 /* Mark alias targets necessary and emit diagnostics. */
2503 handle_alias_pairs ();
2504
2505 /* Gimplify and lower thunks. */
2506 analyze_functions (/*first_time=*/false);
2507
2508 /* All nested functions should be lowered now. */
2509 nested_function_info::release ();
2510
2511 /* Offloading requires LTO infrastructure. */
2512 if (!in_lto_p && g->have_offload)
2513 flag_generate_offload = 1;
2514
2515 if (!seen_error ())
2516 {
2517 /* Give the frontends the chance to emit early debug based on
2518 what is still reachable in the TU. */
2519 (*lang_hooks.finalize_early_debug) ();
2520
2521 /* Clean up anything that needs cleaning up after initial debug
2522 generation. */
2523 debuginfo_early_start ();
2524 (*debug_hooks->early_finish) (main_input_filename);
2525 debuginfo_early_stop ();
2526 }
2527
2528 /* Finally drive the pass manager. */
2529 compile ();
2530
2531 timevar_pop (TV_CGRAPH);
2532 }
2533
2534 /* Reset all state within cgraphunit.c so that we can rerun the compiler
2535 within the same process. For use by toplev::finalize. */
2536
2537 void
cgraphunit_c_finalize(void)2538 cgraphunit_c_finalize (void)
2539 {
2540 gcc_assert (cgraph_new_nodes.length () == 0);
2541 cgraph_new_nodes.truncate (0);
2542
2543 queued_nodes = &symtab_terminator;
2544
2545 first_analyzed = NULL;
2546 first_analyzed_var = NULL;
2547 }
2548
2549 /* Creates a wrapper from cgraph_node to TARGET node. Thunk is used for this
2550 kind of wrapper method. */
2551
2552 void
create_wrapper(cgraph_node * target)2553 cgraph_node::create_wrapper (cgraph_node *target)
2554 {
2555 /* Preserve DECL_RESULT so we get right by reference flag. */
2556 tree decl_result = DECL_RESULT (decl);
2557
2558 /* Remove the function's body but keep arguments to be reused
2559 for thunk. */
2560 release_body (true);
2561 reset ();
2562
2563 DECL_UNINLINABLE (decl) = false;
2564 DECL_RESULT (decl) = decl_result;
2565 DECL_INITIAL (decl) = NULL;
2566 allocate_struct_function (decl, false);
2567 set_cfun (NULL);
2568
2569 /* Turn alias into thunk and expand it into GIMPLE representation. */
2570 definition = true;
2571 semantic_interposition = opt_for_fn (decl, flag_semantic_interposition);
2572
2573 /* Create empty thunk, but be sure we did not keep former thunk around.
2574 In that case we would need to preserve the info. */
2575 gcc_checking_assert (!thunk_info::get (this));
2576 thunk_info::get_create (this);
2577 thunk = true;
2578 create_edge (target, NULL, count);
2579 callees->can_throw_external = !TREE_NOTHROW (target->decl);
2580
2581 tree arguments = DECL_ARGUMENTS (decl);
2582
2583 while (arguments)
2584 {
2585 TREE_ADDRESSABLE (arguments) = false;
2586 arguments = TREE_CHAIN (arguments);
2587 }
2588
2589 expand_thunk (this, false, true);
2590 thunk_info::remove (this);
2591
2592 /* Inline summary set-up. */
2593 analyze ();
2594 inline_analyze_function (this);
2595 }
2596