xref: /freebsd/usr.bin/gprof/gprof.1 (revision 076ad2f8)
1.\" Copyright (c) 1983, 1990, 1993
2.\"	The Regents of the University of California.  All rights reserved.
3.\"
4.\" Redistribution and use in source and binary forms, with or without
5.\" modification, are permitted provided that the following conditions
6.\" are met:
7.\" 1. Redistributions of source code must retain the above copyright
8.\"    notice, this list of conditions and the following disclaimer.
9.\" 2. Redistributions in binary form must reproduce the above copyright
10.\"    notice, this list of conditions and the following disclaimer in the
11.\"    documentation and/or other materials provided with the distribution.
12.\" 3. Neither the name of the University nor the names of its contributors
13.\"    may be used to endorse or promote products derived from this software
14.\"    without specific prior written permission.
15.\"
16.\" THIS SOFTWARE IS PROVIDED BY THE REGENTS AND CONTRIBUTORS ``AS IS'' AND
17.\" ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
18.\" IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
19.\" ARE DISCLAIMED.  IN NO EVENT SHALL THE REGENTS OR CONTRIBUTORS BE LIABLE
20.\" FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
21.\" DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS
22.\" OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION)
23.\" HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT
24.\" LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY
25.\" OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF
26.\" SUCH DAMAGE.
27.\"
28.\"	@(#)gprof.1	8.1 (Berkeley) 6/6/93
29.\" $FreeBSD$
30.\"
31.Dd December 25, 2008
32.Dt GPROF 1
33.Os
34.Sh NAME
35.Nm gprof
36.Nd display call graph profile data
37.Sh SYNOPSIS
38.Nm
39.Op Fl abKlLsuz
40.Op Fl C Ar count
41.Op Fl e Ar name
42.Op Fl E Ar name
43.Op Fl f Ar name
44.Op Fl F Ar name
45.Op Fl k Ar fromname toname
46.Op Ar a.out Op Ar a.out.gmon ...
47.Sh DESCRIPTION
48The
49.Nm
50utility produces an execution profile of C, Pascal, or Fortran77 programs.
51The effect of called routines is incorporated in the profile of each caller.
52The profile data is taken from the call graph profile file
53which is created by programs that are compiled with the
54.Fl pg
55option of
56.Xr cc 1 ,
57.Xr pc 1 ,
58and
59.Xr f77 1 .
60The
61.Fl pg
62option also links in versions of the library routines
63that are compiled for profiling.
64By convention these libraries have their name suffixed with
65.Pa _p ,
66i.e., the profiled version of
67.Pa libc.a
68is
69.Pa libc_p.a
70and if you specify libraries directly to the
71compiler or linker you can use
72.Fl l Ns Ar c_p
73instead of
74.Fl l Ns Ar c .
75Read the given object file (the default is
76.Pa a.out )
77and establishes the relation between its symbol table
78and the call graph profile.
79The default graph profile file name is the name
80of the executable with the suffix
81.Pa .gmon
82appended.
83If more than one profile file is specified,
84the
85.Nm
86output shows the sum of the profile information in the given profile files.
87.Pp
88The
89.Nm
90utility calculates the amount of time spent in each routine.
91Next, these times are propagated along the edges of the call graph.
92Cycles are discovered, and calls into a cycle are made to share the time
93of the cycle.
94The first listing shows the functions
95sorted according to the time they represent
96including the time of their call graph descendants.
97Below each function entry is shown its (direct) call graph children,
98and how their times are propagated to this function.
99A similar display above the function shows how this function's time and the
100time of its descendants is propagated to its (direct) call graph parents.
101.Pp
102Cycles are also shown, with an entry for the cycle as a whole and
103a listing of the members of the cycle and their contributions to the
104time and call counts of the cycle.
105.Pp
106Second, a flat profile is given,
107similar to that provided by
108.Xr prof 1 .
109This listing gives the total execution times, the call counts,
110the time that the call spent in the routine itself, and
111the time that the call spent in the routine itself including
112its descendants.
113The units for the per-call times are normally milliseconds,
114but they are nanoseconds if the profiling clock frequency
115is 10 million or larger,
116and if a function appears to be never called then its total self time
117is printed as a percentage in the self time per call column.
118The very high profiling clock frequencies needed to get sufficient
119accuracy in the per-call times for short-lived programs are only
120implemented for
121.Dq high resolution
122(non-statistical) kernel profiling.
123.Pp
124Finally, an index of the function names is provided.
125.Pp
126The following options are available:
127.Bl -tag -width indent
128.It Fl a
129Suppress the printing of statically declared functions.
130If this option is given, all relevant information about the static function
131(e.g., time samples, calls to other functions, calls from other functions)
132belongs to the function loaded just before the static function in the
133.Pa a.out
134file.
135.It Fl b
136Suppress the printing of a description of each field in the profile.
137.It Fl C Ar count
138Find a minimal set of arcs that can be broken to eliminate all cycles with
139.Ar count
140or more members.
141Caution: the algorithm used to break cycles is exponential,
142so using this option may cause
143.Nm
144to run for a very long time.
145.It Fl e Ar name
146Suppress the printing of the graph profile entry for routine
147.Ar name
148and all its descendants
149(unless they have other ancestors that are not suppressed).
150More than one
151.Fl e
152option may be given.
153Only one
154.Ar name
155may be given with each
156.Fl e
157option.
158.It Fl E Ar name
159Suppress the printing of the graph profile entry for routine
160.Ar name
161(and its descendants) as
162.Fl e ,
163above, and also excludes the time spent in
164.Ar name
165(and its descendants) from the total and percentage time computations.
166(For example,
167.Fl E
168.Ar mcount
169.Fl E
170.Ar mcleanup
171is the default.)
172.It Fl f Ar name
173Print the graph profile entry of only the specified routine
174.Ar name
175and its descendants.
176More than one
177.Fl f
178option may be given.
179Only one
180.Ar name
181may be given with each
182.Fl f
183option.
184.It Fl F Ar name
185Print the graph profile entry of only the routine
186.Ar name
187and its descendants (as
188.Fl f ,
189above) and also uses only the times of the printed routines
190in total time and percentage computations.
191More than one
192.Fl F
193option may be given.
194Only one
195.Ar name
196may be given with each
197.Fl F
198option.
199The
200.Fl F
201option
202overrides
203the
204.Fl E
205option.
206.It Fl k Ar fromname Ar toname
207Will delete any arcs from routine
208.Ar fromname
209to routine
210.Ar toname .
211This can be used to break undesired cycles.
212More than one
213.Fl k
214option may be given.
215Only one pair of routine names may be given with each
216.Fl k
217option.
218.It Fl K
219Gather information about symbols from the currently-running kernel using the
220.Xr sysctl 3
221and
222.Xr kldsym 2
223interfaces.
224This forces the
225.Pa a.out
226argument to be ignored, and allows for symbols in
227.Xr kld 4
228modules to be used.
229.It Fl l
230Suppress the printing of the call-graph profile.
231.It Fl L
232Suppress the printing of the flat profile.
233.It Fl s
234A profile file
235.Pa gmon.sum
236is produced that represents
237the sum of the profile information in all the specified profile files.
238This summary profile file may be given to later
239executions of gprof (probably also with a
240.Fl s )
241to accumulate profile data across several runs of an
242.Pa a.out
243file.
244.It Fl u
245Suppress the printing of functions whose names are not visible to
246C programs.
247For the ELF object format, this means names that
248contain the
249.Ql .\&
250character.
251For the a.out object format, it means names that do not
252begin with a
253.Ql _
254character.
255All relevant information about such functions belongs to the
256(non-suppressed) function with the next lowest address.
257This is useful for eliminating "functions" that are just labels
258inside other functions.
259.It Fl z
260Display routines that have zero usage (as shown by call counts
261and accumulated time).
262.El
263.Sh FILES
264.Bl -tag -width a.out.gmon -compact
265.It Pa a.out
266The namelist and text space.
267.It Pa a.out.gmon
268Dynamic call graph and profile.
269.It Pa gmon.sum
270Summarized dynamic call graph and profile.
271.El
272.Sh SEE ALSO
273.Xr cc 1 ,
274.Xr profil 2 ,
275.Xr clocks 7
276.\" .Xr monitor 3 ,
277.\" .Xr prof 1
278.Rs
279.%T "An Execution Profiler for Modular Programs"
280.%A S. Graham
281.%A P. Kessler
282.%A M. McKusick
283.%J "Software - Practice and Experience"
284.%V 13
285.%P pp. 671-685
286.%D 1983
287.Re
288.Rs
289.%T "gprof: A Call Graph Execution Profiler"
290.%A S. Graham
291.%A P. Kessler
292.%A M. McKusick
293.%J "Proceedings of the SIGPLAN '82 Symposium on Compiler Construction, SIGPLAN Notices"
294.%V 17
295.%N 6
296.%P pp. 120-126
297.%D June 1982
298.Re
299.Sh HISTORY
300The
301.Nm
302profiler
303appeared in
304.Bx 4.2 .
305.Sh BUGS
306The granularity of the sampling is shown, but remains
307statistical at best.
308We assume that the time for each execution of a function
309can be expressed by the total time for the function divided
310by the number of times the function is called.
311Thus the time propagated along the call graph arcs to the function's
312parents is directly proportional to the number of times that
313arc is traversed.
314.Pp
315Parents that are not themselves profiled will have the time of
316their profiled children propagated to them, but they will appear
317to be spontaneously invoked in the call graph listing, and will
318not have their time propagated further.
319Similarly, signal catchers, even though profiled, will appear
320to be spontaneous (although for more obscure reasons).
321Any profiled children of signal catchers should have their times
322propagated properly, unless the signal catcher was invoked during
323the execution of the profiling routine, in which case all is lost.
324.Pp
325The profiled program must call
326.Xr exit 3
327or return normally for the profiling information to be saved
328in the graph profile file.
329