1.. SPDX-License-Identifier: GPL-2.0
2
3======================
4Histogram Design Notes
5======================
6
7:Author: Tom Zanussi <zanussi@kernel.org>
8
9This document attempts to provide a description of how the ftrace
10histograms work and how the individual pieces map to the data
11structures used to implement them in trace_events_hist.c and
12tracing_map.c.
13
14Note: All the ftrace histogram command examples assume the working
15      directory is the ftrace /tracing directory. For example::
16
17	# cd /sys/kernel/debug/tracing
18
19Also, the histogram output displayed for those commands will be
20generally be truncated - only enough to make the point is displayed.
21
22'hist_debug' trace event files
23==============================
24
25If the kernel is compiled with CONFIG_HIST_TRIGGERS_DEBUG set, an
26event file named 'hist_debug' will appear in each event's
27subdirectory.  This file can be read at any time and will display some
28of the hist trigger internals described in this document. Specific
29examples and output will be described in test cases below.
30
31Basic histograms
32================
33
34First, basic histograms.  Below is pretty much the simplest thing you
35can do with histograms - create one with a single key on a single
36event and cat the output::
37
38  # echo 'hist:keys=pid' >> events/sched/sched_waking/trigger
39
40  # cat events/sched/sched_waking/hist
41
42  { pid:      18249 } hitcount:          1
43  { pid:      13399 } hitcount:          1
44  { pid:      17973 } hitcount:          1
45  { pid:      12572 } hitcount:          1
46  ...
47  { pid:         10 } hitcount:        921
48  { pid:      18255 } hitcount:       1444
49  { pid:      25526 } hitcount:       2055
50  { pid:       5257 } hitcount:       2055
51  { pid:      27367 } hitcount:       2055
52  { pid:       1728 } hitcount:       2161
53
54  Totals:
55    Hits: 21305
56    Entries: 183
57    Dropped: 0
58
59What this does is create a histogram on the sched_waking event using
60pid as a key and with a single value, hitcount, which even if not
61explicitly specified, exists for every histogram regardless.
62
63The hitcount value is a per-bucket value that's automatically
64incremented on every hit for the given key, which in this case is the
65pid.
66
67So in this histogram, there's a separate bucket for each pid, and each
68bucket contains a value for that bucket, counting the number of times
69sched_waking was called for that pid.
70
71Each histogram is represented by a hist_data struct.
72
73To keep track of each key and value field in the histogram, hist_data
74keeps an array of these fields named fields[].  The fields[] array is
75an array containing struct hist_field representations of each
76histogram val and key in the histogram (variables are also included
77here, but are discussed later). So for the above histogram we have one
78key and one value; in this case the one value is the hitcount value,
79which all histograms have, regardless of whether they define that
80value or not, which the above histogram does not.
81
82Each struct hist_field contains a pointer to the ftrace_event_field
83from the event's trace_event_file along with various bits related to
84that such as the size, offset, type, and a hist_field_fn_t function,
85which is used to grab the field's data from the ftrace event buffer
86(in most cases - some hist_fields such as hitcount don't directly map
87to an event field in the trace buffer - in these cases the function
88implementation gets its value from somewhere else).  The flags field
89indicates which type of field it is - key, value, variable, variable
90reference, etc., with value being the default.
91
92The other important hist_data data structure in addition to the
93fields[] array is the tracing_map instance created for the histogram,
94which is held in the .map member.  The tracing_map implements the
95lock-free hash table used to implement histograms (see
96kernel/trace/tracing_map.h for much more discussion about the
97low-level data structures implementing the tracing_map).  For the
98purposes of this discussion, the tracing_map contains a number of
99buckets, each bucket corresponding to a particular tracing_map_elt
100object hashed by a given histogram key.
101
102Below is a diagram the first part of which describes the hist_data and
103associated key and value fields for the histogram described above.  As
104you can see, there are two fields in the fields array, one val field
105for the hitcount and one key field for the pid key.
106
107Below that is a diagram of a run-time snapshot of what the tracing_map
108might look like for a given run.  It attempts to show the
109relationships between the hist_data fields and the tracing_map
110elements for a couple hypothetical keys and values.::
111
112  +------------------+
113  | hist_data        |
114  +------------------+     +----------------+
115    | .fields[]      |---->| val = hitcount |----------------------------+
116    +----------------+     +----------------+                            |
117    | .map           |       | .size        |                            |
118    +----------------+       +--------------+                            |
119                             | .offset      |                            |
120                             +--------------+                            |
121                             | .fn()        |                            |
122                             +--------------+                            |
123                                   .                                     |
124                                   .                                     |
125                                   .                                     |
126                           +----------------+ <--- n_vals                |
127                           | key = pid      |----------------------------|--+
128                           +----------------+                            |  |
129                             | .size        |                            |  |
130                             +--------------+                            |  |
131                             | .offset      |                            |  |
132                             +--------------+                            |  |
133                             | .fn()        |                            |  |
134                           +----------------+ <--- n_fields              |  |
135                           | unused         |                            |  |
136                           +----------------+                            |  |
137                             |              |                            |  |
138                             +--------------+                            |  |
139                             |              |                            |  |
140                             +--------------+                            |  |
141                             |              |                            |  |
142                             +--------------+                            |  |
143                                            n_keys = n_fields - n_vals   |  |
144
145The hist_data n_vals and n_fields delineate the extent of the fields[]   |  |
146array and separate keys from values for the rest of the code.            |  |
147
148Below is a run-time representation of the tracing_map part of the        |  |
149histogram, with pointers from various parts of the fields[] array        |  |
150to corresponding parts of the tracing_map.                               |  |
151
152The tracing_map consists of an array of tracing_map_entrys and a set     |  |
153of preallocated tracing_map_elts (abbreviated below as map_entry and     |  |
154map_elt).  The total number of map_entrys in the hist_data.map array =   |  |
155map->max_elts (actually map->map_size but only max_elts of those are     |  |
156used.  This is a property required by the map_insert() algorithm).       |  |
157
158If a map_entry is unused, meaning no key has yet hashed into it, its     |  |
159.key value is 0 and its .val pointer is NULL.  Once a map_entry has      |  |
160been claimed, the .key value contains the key's hash value and the       |  |
161.val member points to a map_elt containing the full key and an entry     |  |
162for each key or value in the map_elt.fields[] array.  There is an        |  |
163entry in the map_elt.fields[] array corresponding to each hist_field     |  |
164in the histogram, and this is where the continually aggregated sums      |  |
165corresponding to each histogram value are kept.                          |  |
166
167The diagram attempts to show the relationship between the                |  |
168hist_data.fields[] and the map_elt.fields[] with the links drawn         |  |
169between diagrams::
170
171  +-----------+		                                                 |  |
172  | hist_data |		                                                 |  |
173  +-----------+		                                                 |  |
174    | .fields |		                                                 |  |
175    +---------+     +-----------+		                         |  |
176    | .map    |---->| map_entry |		                         |  |
177    +---------+     +-----------+		                         |  |
178                      | .key    |---> 0		                         |  |
179                      +---------+		                         |  |
180                      | .val    |---> NULL		                 |  |
181                    +-----------+                                        |  |
182                    | map_entry |                                        |  |
183                    +-----------+                                        |  |
184                      | .key    |---> pid = 999                          |  |
185                      +---------+    +-----------+                       |  |
186                      | .val    |--->| map_elt   |                       |  |
187                      +---------+    +-----------+                       |  |
188                           .           | .key    |---> full key *        |  |
189                           .           +---------+    +---------------+  |  |
190			   .           | .fields |--->| .sum (val)    |<-+  |
191                    +-----------+      +---------+    | 2345          |  |  |
192                    | map_entry |                     +---------------+  |  |
193                    +-----------+                     | .offset (key) |<----+
194                      | .key    |---> 0               | 0             |  |  |
195                      +---------+                     +---------------+  |  |
196                      | .val    |---> NULL                    .          |  |
197                    +-----------+                             .          |  |
198                    | map_entry |                             .          |  |
199                    +-----------+                     +---------------+  |  |
200                      | .key    |                     | .sum (val) or |  |  |
201                      +---------+    +---------+      | .offset (key) |  |  |
202                      | .val    |--->| map_elt |      +---------------+  |  |
203                    +-----------+    +---------+      | .sum (val) or |  |  |
204                    | map_entry |                     | .offset (key) |  |  |
205                    +-----------+                     +---------------+  |  |
206                      | .key    |---> pid = 4444                         |  |
207                      +---------+    +-----------+                       |  |
208                      | .val    |    | map_elt   |                       |  |
209                      +---------+    +-----------+                       |  |
210                                       | .key    |---> full key *        |  |
211                                       +---------+    +---------------+  |  |
212			               | .fields |--->| .sum (val)    |<-+  |
213                                       +---------+    | 65523         |     |
214                                                      +---------------+     |
215                                                      | .offset (key) |<----+
216                                                      | 0             |
217                                                      +---------------+
218                                                              .
219                                                              .
220                                                              .
221                                                      +---------------+
222                                                      | .sum (val) or |
223                                                      | .offset (key) |
224                                                      +---------------+
225                                                      | .sum (val) or |
226                                                      | .offset (key) |
227                                                      +---------------+
228
229Abbreviations used in the diagrams::
230
231  hist_data = struct hist_trigger_data
232  hist_data.fields = struct hist_field
233  fn = hist_field_fn_t
234  map_entry = struct tracing_map_entry
235  map_elt = struct tracing_map_elt
236  map_elt.fields = struct tracing_map_field
237
238Whenever a new event occurs and it has a hist trigger associated with
239it, event_hist_trigger() is called.  event_hist_trigger() first deals
240with the key: for each subkey in the key (in the above example, there
241is just one subkey corresponding to pid), the hist_field that
242represents that subkey is retrieved from hist_data.fields[] and the
243hist_field_fn_t fn() associated with that field, along with the
244field's size and offset, is used to grab that subkey's data from the
245current trace record.
246
247Once the complete key has been retrieved, it's used to look that key
248up in the tracing_map.  If there's no tracing_map_elt associated with
249that key, an empty one is claimed and inserted in the map for the new
250key.  In either case, the tracing_map_elt associated with that key is
251returned.
252
253Once a tracing_map_elt available, hist_trigger_elt_update() is called.
254As the name implies, this updates the element, which basically means
255updating the element's fields.  There's a tracing_map_field associated
256with each key and value in the histogram, and each of these correspond
257to the key and value hist_fields created when the histogram was
258created.  hist_trigger_elt_update() goes through each value hist_field
259and, as for the keys, uses the hist_field's fn() and size and offset
260to grab the field's value from the current trace record.  Once it has
261that value, it simply adds that value to that field's
262continually-updated tracing_map_field.sum member.  Some hist_field
263fn()s, such as for the hitcount, don't actually grab anything from the
264trace record (the hitcount fn() just increments the counter sum by 1),
265but the idea is the same.
266
267Once all the values have been updated, hist_trigger_elt_update() is
268done and returns.  Note that there are also tracing_map_fields for
269each subkey in the key, but hist_trigger_elt_update() doesn't look at
270them or update anything - those exist only for sorting, which can
271happen later.
272
273Basic histogram test
274--------------------
275
276This is a good example to try.  It produces 3 value fields and 2 key
277fields in the output::
278
279  # echo 'hist:keys=common_pid,call_site.sym:values=bytes_req,bytes_alloc,hitcount' >> events/kmem/kmalloc/trigger
280
281To see the debug data, cat the kmem/kmalloc's 'hist_debug' file. It
282will show the trigger info of the histogram it corresponds to, along
283with the address of the hist_data associated with the histogram, which
284will become useful in later examples.  It then displays the number of
285total hist_fields associated with the histogram along with a count of
286how many of those correspond to keys and how many correspond to values.
287
288It then goes on to display details for each field, including the
289field's flags and the position of each field in the hist_data's
290fields[] array, which is useful information for verifying that things
291internally appear correct or not, and which again will become even
292more useful in further examples::
293
294  # cat events/kmem/kmalloc/hist_debug
295
296  # event histogram
297  #
298  # trigger info: hist:keys=common_pid,call_site.sym:vals=hitcount,bytes_req,bytes_alloc:sort=hitcount:size=2048 [active]
299  #
300
301  hist_data: 000000005e48c9a5
302
303  n_vals: 3
304  n_keys: 2
305  n_fields: 5
306
307  val fields:
308
309    hist_data->fields[0]:
310      flags:
311        VAL: HIST_FIELD_FL_HITCOUNT
312      type: u64
313      size: 8
314      is_signed: 0
315
316    hist_data->fields[1]:
317      flags:
318        VAL: normal u64 value
319      ftrace_event_field name: bytes_req
320      type: size_t
321      size: 8
322      is_signed: 0
323
324    hist_data->fields[2]:
325      flags:
326        VAL: normal u64 value
327      ftrace_event_field name: bytes_alloc
328      type: size_t
329      size: 8
330      is_signed: 0
331
332  key fields:
333
334    hist_data->fields[3]:
335      flags:
336        HIST_FIELD_FL_KEY
337      ftrace_event_field name: common_pid
338      type: int
339      size: 8
340      is_signed: 1
341
342    hist_data->fields[4]:
343      flags:
344        HIST_FIELD_FL_KEY
345      ftrace_event_field name: call_site
346      type: unsigned long
347      size: 8
348      is_signed: 0
349
350The commands below can be used to clean things up for the next test::
351
352  # echo '!hist:keys=common_pid,call_site.sym:values=bytes_req,bytes_alloc,hitcount' >> events/kmem/kmalloc/trigger
353
354Variables
355=========
356
357Variables allow data from one hist trigger to be saved by one hist
358trigger and retrieved by another hist trigger.  For example, a trigger
359on the sched_waking event can capture a timestamp for a particular
360pid, and later a sched_switch event that switches to that pid event
361can grab the timestamp and use it to calculate a time delta between
362the two events::
363
364  # echo 'hist:keys=pid:ts0=common_timestamp.usecs' >>
365          events/sched/sched_waking/trigger
366
367  # echo 'hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0' >>
368          events/sched/sched_switch/trigger
369
370In terms of the histogram data structures, variables are implemented
371as another type of hist_field and for a given hist trigger are added
372to the hist_data.fields[] array just after all the val fields.  To
373distinguish them from the existing key and val fields, they're given a
374new flag type, HIST_FIELD_FL_VAR (abbreviated FL_VAR) and they also
375make use of a new .var.idx field member in struct hist_field, which
376maps them to an index in a new map_elt.vars[] array added to the
377map_elt specifically designed to store and retrieve variable values.
378The diagram below shows those new elements and adds a new variable
379entry, ts0, corresponding to the ts0 variable in the sched_waking
380trigger above.
381
382sched_waking histogram
383----------------------::
384
385  +------------------+
386  | hist_data        |<-------------------------------------------------------+
387  +------------------+   +-------------------+                                |
388    | .fields[]      |-->| val = hitcount    |                                |
389    +----------------+   +-------------------+                                |
390    | .map           |     | .size           |                                |
391    +----------------+     +-----------------+                                |
392                           | .offset         |                                |
393                           +-----------------+                                |
394                           | .fn()           |                                |
395                           +-----------------+                                |
396                           | .flags          |                                |
397                           +-----------------+                                |
398                           | .var.idx        |                                |
399                         +-------------------+                                |
400                         | var = ts0         |                                |
401                         +-------------------+                                |
402                           | .size           |                                |
403                           +-----------------+                                |
404                           | .offset         |                                |
405                           +-----------------+                                |
406                           | .fn()           |                                |
407                           +-----------------+                                |
408                           | .flags & FL_VAR |                                |
409                           +-----------------+                                |
410                           | .var.idx        |----------------------------+-+ |
411                           +-----------------+                            | | |
412			            .                                     | | |
413				    .                                     | | |
414                                    .                                     | | |
415                         +-------------------+ <--- n_vals                | | |
416                         | key = pid         |                            | | |
417                         +-------------------+                            | | |
418                           | .size           |                            | | |
419                           +-----------------+                            | | |
420                           | .offset         |                            | | |
421                           +-----------------+                            | | |
422                           | .fn()           |                            | | |
423                           +-----------------+                            | | |
424                           | .flags & FL_KEY |                            | | |
425                           +-----------------+                            | | |
426                           | .var.idx        |                            | | |
427                         +-------------------+ <--- n_fields              | | |
428                         | unused            |                            | | |
429                         +-------------------+                            | | |
430                           |                 |                            | | |
431                           +-----------------+                            | | |
432                           |                 |                            | | |
433                           +-----------------+                            | | |
434                           |                 |                            | | |
435                           +-----------------+                            | | |
436                           |                 |                            | | |
437                           +-----------------+                            | | |
438                           |                 |                            | | |
439                           +-----------------+                            | | |
440                                             n_keys = n_fields - n_vals   | | |
441                                                                          | | |
442
443This is very similar to the basic case.  In the above diagram, we can     | | |
444see a new .flags member has been added to the struct hist_field           | | |
445struct, and a new entry added to hist_data.fields representing the ts0    | | |
446variable.  For a normal val hist_field, .flags is just 0 (modulo          | | |
447modifier flags), but if the value is defined as a variable, the .flags    | | |
448contains a set FL_VAR bit.                                                | | |
449
450As you can see, the ts0 entry's .var.idx member contains the index        | | |
451into the tracing_map_elts' .vars[] array containing variable values.      | | |
452This idx is used whenever the value of the variable is set or read.       | | |
453The map_elt.vars idx assigned to the given variable is assigned and       | | |
454saved in .var.idx by create_tracing_map_fields() after it calls           | | |
455tracing_map_add_var().                                                    | | |
456
457Below is a representation of the histogram at run-time, which             | | |
458populates the map, along with correspondence to the above hist_data and   | | |
459hist_field data structures.                                               | | |
460
461The diagram attempts to show the relationship between the                 | | |
462hist_data.fields[] and the map_elt.fields[] and map_elt.vars[] with       | | |
463the links drawn between diagrams.  For each of the map_elts, you can      | | |
464see that the .fields[] members point to the .sum or .offset of a key      | | |
465or val and the .vars[] members point to the value of a variable.  The     | | |
466arrows between the two diagrams show the linkages between those           | | |
467tracing_map members and the field definitions in the corresponding        | | |
468hist_data fields[] members.::
469
470  +-----------+		                                                  | | |
471  | hist_data |		                                                  | | |
472  +-----------+		                                                  | | |
473    | .fields |		                                                  | | |
474    +---------+     +-----------+		                          | | |
475    | .map    |---->| map_entry |		                          | | |
476    +---------+     +-----------+		                          | | |
477                      | .key    |---> 0		                          | | |
478                      +---------+		                          | | |
479                      | .val    |---> NULL		                  | | |
480                    +-----------+                                         | | |
481                    | map_entry |                                         | | |
482                    +-----------+                                         | | |
483                      | .key    |---> pid = 999                           | | |
484                      +---------+    +-----------+                        | | |
485                      | .val    |--->| map_elt   |                        | | |
486                      +---------+    +-----------+                        | | |
487                           .           | .key    |---> full key *         | | |
488                           .           +---------+    +---------------+   | | |
489			   .           | .fields |--->| .sum (val)    |   | | |
490                           .           +---------+    | 2345          |   | | |
491                           .        +--| .vars   |    +---------------+   | | |
492                           .        |  +---------+    | .offset (key) |   | | |
493                           .        |                 | 0             |   | | |
494                           .        |                 +---------------+   | | |
495                           .        |                         .           | | |
496                           .        |                         .           | | |
497                           .        |                         .           | | |
498                           .        |                 +---------------+   | | |
499                           .        |                 | .sum (val) or |   | | |
500                           .        |                 | .offset (key) |   | | |
501                           .        |                 +---------------+   | | |
502                           .        |                 | .sum (val) or |   | | |
503                           .        |                 | .offset (key) |   | | |
504                           .        |                 +---------------+   | | |
505                           .        |                                     | | |
506                           .        +---------------->+---------------+   | | |
507			   .                          | ts0           |<--+ | |
508                           .                          | 113345679876  |   | | |
509                           .                          +---------------+   | | |
510                           .                          | unused        |   | | |
511                           .                          |               |   | | |
512                           .                          +---------------+   | | |
513                           .                                  .           | | |
514                           .                                  .           | | |
515                           .                                  .           | | |
516                           .                          +---------------+   | | |
517                           .                          | unused        |   | | |
518                           .                          |               |   | | |
519                           .                          +---------------+   | | |
520                           .                          | unused        |   | | |
521                           .                          |               |   | | |
522                           .                          +---------------+   | | |
523                           .                                              | | |
524                    +-----------+                                         | | |
525                    | map_entry |                                         | | |
526                    +-----------+                                         | | |
527                      | .key    |---> pid = 4444                          | | |
528                      +---------+    +-----------+                        | | |
529                      | .val    |--->| map_elt   |                        | | |
530                      +---------+    +-----------+                        | | |
531                           .           | .key    |---> full key *         | | |
532                           .           +---------+    +---------------+   | | |
533			   .           | .fields |--->| .sum (val)    |   | | |
534                                       +---------+    | 2345          |   | | |
535                                    +--| .vars   |    +---------------+   | | |
536                                    |  +---------+    | .offset (key) |   | | |
537                                    |                 | 0             |   | | |
538                                    |                 +---------------+   | | |
539                                    |                         .           | | |
540                                    |                         .           | | |
541                                    |                         .           | | |
542                                    |                 +---------------+   | | |
543                                    |                 | .sum (val) or |   | | |
544                                    |                 | .offset (key) |   | | |
545                                    |                 +---------------+   | | |
546                                    |                 | .sum (val) or |   | | |
547                                    |                 | .offset (key) |   | | |
548                                    |                 +---------------+   | | |
549                                    |                                     | | |
550                                    |                 +---------------+   | | |
551			            +---------------->| ts0           |<--+ | |
552                                                      | 213499240729  |     | |
553                                                      +---------------+     | |
554                                                      | unused        |     | |
555                                                      |               |     | |
556                                                      +---------------+     | |
557                                                              .             | |
558                                                              .             | |
559                                                              .             | |
560                                                      +---------------+     | |
561                                                      | unused        |     | |
562                                                      |               |     | |
563                                                      +---------------+     | |
564                                                      | unused        |     | |
565                                                      |               |     | |
566                                                      +---------------+     | |
567
568For each used map entry, there's a map_elt pointing to an array of          | |
569.vars containing the current value of the variables associated with         | |
570that histogram entry.  So in the above, the timestamp associated with       | |
571pid 999 is 113345679876, and the timestamp variable in the same             | |
572.var.idx for pid 4444 is 213499240729.                                      | |
573
574sched_switch histogram                                                      | |
575----------------------                                                      | |
576
577The sched_switch histogram paired with the above sched_waking               | |
578histogram is shown below.  The most important aspect of the                 | |
579sched_switch histogram is that it references a variable on the              | |
580sched_waking histogram above.                                               | |
581
582The histogram diagram is very similar to the others so far displayed,       | |
583but it adds variable references.  You can see the normal hitcount and       | |
584key fields along with a new wakeup_lat variable implemented in the          | |
585same way as the sched_waking ts0 variable, but in addition there's an       | |
586entry with the new FL_VAR_REF (short for HIST_FIELD_FL_VAR_REF) flag.       | |
587
588Associated with the new var ref field are a couple of new hist_field        | |
589members, var.hist_data and var_ref_idx.  For a variable reference, the      | |
590var.hist_data goes with the var.idx, which together uniquely identify       | |
591a particular variable on a particular histogram.  The var_ref_idx is        | |
592just the index into the var_ref_vals[] array that caches the values of      | |
593each variable whenever a hist trigger is updated.  Those resulting          | |
594values are then finally accessed by other code such as trace action         | |
595code that uses the var_ref_idx values to assign param values.               | |
596
597The diagram below describes the situation for the sched_switch              | |
598histogram referred to before::
599
600  # echo 'hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0' >>     | |
601          events/sched/sched_switch/trigger                                 | |
602                                                                            | |
603  +------------------+                                                      | |
604  | hist_data        |                                                      | |
605  +------------------+   +-----------------------+                          | |
606    | .fields[]      |-->| val = hitcount        |                          | |
607    +----------------+   +-----------------------+                          | |
608    | .map           |     | .size               |                          | |
609    +----------------+     +---------------------+                          | |
610 +--| .var_refs[]    |     | .offset             |                          | |
611 |  +----------------+     +---------------------+                          | |
612 |                         | .fn()               |                          | |
613 |   var_ref_vals[]        +---------------------+                          | |
614 |  +-------------+        | .flags              |                          | |
615 |  | $ts0        |<---+   +---------------------+                          | |
616 |  +-------------+    |   | .var.idx            |                          | |
617 |  |             |    |   +---------------------+                          | |
618 |  +-------------+    |   | .var.hist_data      |                          | |
619 |  |             |    |   +---------------------+                          | |
620 |  +-------------+    |   | .var_ref_idx        |                          | |
621 |  |             |    | +-----------------------+                          | |
622 |  +-------------+    | | var = wakeup_lat      |                          | |
623 |         .           | +-----------------------+                          | |
624 |         .           |   | .size               |                          | |
625 |         .           |   +---------------------+                          | |
626 |  +-------------+    |   | .offset             |                          | |
627 |  |             |    |   +---------------------+                          | |
628 |  +-------------+    |   | .fn()               |                          | |
629 |  |             |    |   +---------------------+                          | |
630 |  +-------------+    |   | .flags & FL_VAR     |                          | |
631 |                     |   +---------------------+                          | |
632 |                     |   | .var.idx            |                          | |
633 |                     |   +---------------------+                          | |
634 |                     |   | .var.hist_data      |                          | |
635 |                     |   +---------------------+                          | |
636 |                     |   | .var_ref_idx        |                          | |
637 |                     |   +---------------------+                          | |
638 |                     |             .                                      | |
639 |                     |             .                                      | |
640 |                     |             .                                      | |
641 |                     | +-----------------------+ <--- n_vals              | |
642 |                     | | key = pid             |                          | |
643 |                     | +-----------------------+                          | |
644 |                     |   | .size               |                          | |
645 |                     |   +---------------------+                          | |
646 |                     |   | .offset             |                          | |
647 |                     |   +---------------------+                          | |
648 |                     |   | .fn()               |                          | |
649 |                     |   +---------------------+                          | |
650 |                     |   | .flags              |                          | |
651 |                     |   +---------------------+                          | |
652 |                     |   | .var.idx            |                          | |
653 |                     | +-----------------------+ <--- n_fields            | |
654 |                     | | unused                |                          | |
655 |                     | +-----------------------+                          | |
656 |                     |   |                     |                          | |
657 |                     |   +---------------------+                          | |
658 |                     |   |                     |                          | |
659 |                     |   +---------------------+                          | |
660 |                     |   |                     |                          | |
661 |                     |   +---------------------+                          | |
662 |                     |   |                     |                          | |
663 |                     |   +---------------------+                          | |
664 |                     |   |                     |                          | |
665 |                     |   +---------------------+                          | |
666 |                     |                         n_keys = n_fields - n_vals | |
667 |                     |                                                    | |
668 |                     |						    | |
669 |                     | +-----------------------+                          | |
670 +---------------------->| var_ref = $ts0        |                          | |
671                       | +-----------------------+                          | |
672                       |   | .size               |                          | |
673                       |   +---------------------+                          | |
674                       |   | .offset             |                          | |
675                       |   +---------------------+                          | |
676                       |   | .fn()               |                          | |
677                       |   +---------------------+                          | |
678                       |   | .flags & FL_VAR_REF |                          | |
679                       |   +---------------------+                          | |
680                       |   | .var.idx            |--------------------------+ |
681                       |   +---------------------+                            |
682                       |   | .var.hist_data      |----------------------------+
683                       |   +---------------------+
684                       +---| .var_ref_idx        |
685                           +---------------------+
686
687Abbreviations used in the diagrams::
688
689  hist_data = struct hist_trigger_data
690  hist_data.fields = struct hist_field
691  fn = hist_field_fn_t
692  FL_KEY = HIST_FIELD_FL_KEY
693  FL_VAR = HIST_FIELD_FL_VAR
694  FL_VAR_REF = HIST_FIELD_FL_VAR_REF
695
696When a hist trigger makes use of a variable, a new hist_field is
697created with flag HIST_FIELD_FL_VAR_REF.  For a VAR_REF field, the
698var.idx and var.hist_data take the same values as the referenced
699variable, as well as the referenced variable's size, type, and
700is_signed values.  The VAR_REF field's .name is set to the name of the
701variable it references.  If a variable reference was created using the
702explicit system.event.$var_ref notation, the hist_field's system and
703event_name variables are also set.
704
705So, in order to handle an event for the sched_switch histogram,
706because we have a reference to a variable on another histogram, we
707need to resolve all variable references first.  This is done via the
708resolve_var_refs() calls made from event_hist_trigger().  What this
709does is grabs the var_refs[] array from the hist_data representing the
710sched_switch histogram.  For each one of those, the referenced
711variable's var.hist_data along with the current key is used to look up
712the corresponding tracing_map_elt in that histogram.  Once found, the
713referenced variable's var.idx is used to look up the variable's value
714using tracing_map_read_var(elt, var.idx), which yields the value of
715the variable for that element, ts0 in the case above.  Note that both
716the hist_fields representing both the variable and the variable
717reference have the same var.idx, so this is straightforward.
718
719Variable and variable reference test
720------------------------------------
721
722This example creates a variable on the sched_waking event, ts0, and
723uses it in the sched_switch trigger.  The sched_switch trigger also
724creates its own variable, wakeup_lat, but nothing yet uses it::
725
726  # echo 'hist:keys=pid:ts0=common_timestamp.usecs' >> events/sched/sched_waking/trigger
727
728  # echo 'hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0' >> events/sched/sched_switch/trigger
729
730Looking at the sched_waking 'hist_debug' output, in addition to the
731normal key and value hist_fields, in the val fields section we see a
732field with the HIST_FIELD_FL_VAR flag, which indicates that that field
733represents a variable.  Note that in addition to the variable name,
734contained in the var.name field, it includes the var.idx, which is the
735index into the tracing_map_elt.vars[] array of the actual variable
736location.  Note also that the output shows that variables live in the
737same part of the hist_data->fields[] array as normal values::
738
739  # cat events/sched/sched_waking/hist_debug
740
741  # event histogram
742  #
743  # trigger info: hist:keys=pid:vals=hitcount:ts0=common_timestamp.usecs:sort=hitcount:size=2048:clock=global [active]
744  #
745
746  hist_data: 000000009536f554
747
748  n_vals: 2
749  n_keys: 1
750  n_fields: 3
751
752  val fields:
753
754    hist_data->fields[0]:
755      flags:
756        VAL: HIST_FIELD_FL_HITCOUNT
757      type: u64
758      size: 8
759      is_signed: 0
760
761    hist_data->fields[1]:
762      flags:
763        HIST_FIELD_FL_VAR
764      var.name: ts0
765      var.idx (into tracing_map_elt.vars[]): 0
766      type: u64
767      size: 8
768      is_signed: 0
769
770  key fields:
771
772    hist_data->fields[2]:
773      flags:
774        HIST_FIELD_FL_KEY
775      ftrace_event_field name: pid
776      type: pid_t
777      size: 8
778      is_signed: 1
779
780Moving on to the sched_switch trigger hist_debug output, in addition
781to the unused wakeup_lat variable, we see a new section displaying
782variable references.  Variable references are displayed in a separate
783section because in addition to being logically separate from
784variables and values, they actually live in a separate hist_data
785array, var_refs[].
786
787In this example, the sched_switch trigger has a reference to a
788variable on the sched_waking trigger, $ts0.  Looking at the details,
789we can see that the var.hist_data value of the referenced variable
790matches the previously displayed sched_waking trigger, and the var.idx
791value matches the previously displayed var.idx value for that
792variable.  Also displayed is the var_ref_idx value for that variable
793reference, which is where the value for that variable is cached for
794use when the trigger is invoked::
795
796  # cat events/sched/sched_switch/hist_debug
797
798  # event histogram
799  #
800  # trigger info: hist:keys=next_pid:vals=hitcount:wakeup_lat=common_timestamp.usecs-$ts0:sort=hitcount:size=2048:clock=global [active]
801  #
802
803  hist_data: 00000000f4ee8006
804
805  n_vals: 2
806  n_keys: 1
807  n_fields: 3
808
809  val fields:
810
811    hist_data->fields[0]:
812      flags:
813        VAL: HIST_FIELD_FL_HITCOUNT
814      type: u64
815      size: 8
816      is_signed: 0
817
818    hist_data->fields[1]:
819      flags:
820        HIST_FIELD_FL_VAR
821      var.name: wakeup_lat
822      var.idx (into tracing_map_elt.vars[]): 0
823      type: u64
824      size: 0
825      is_signed: 0
826
827  key fields:
828
829    hist_data->fields[2]:
830      flags:
831        HIST_FIELD_FL_KEY
832      ftrace_event_field name: next_pid
833      type: pid_t
834      size: 8
835      is_signed: 1
836
837  variable reference fields:
838
839    hist_data->var_refs[0]:
840      flags:
841        HIST_FIELD_FL_VAR_REF
842      name: ts0
843      var.idx (into tracing_map_elt.vars[]): 0
844      var.hist_data: 000000009536f554
845      var_ref_idx (into hist_data->var_refs[]): 0
846      type: u64
847      size: 8
848      is_signed: 0
849
850The commands below can be used to clean things up for the next test::
851
852  # echo '!hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0' >> events/sched/sched_switch/trigger
853
854  # echo '!hist:keys=pid:ts0=common_timestamp.usecs' >> events/sched/sched_waking/trigger
855
856Actions and Handlers
857====================
858
859Adding onto the previous example, we will now do something with that
860wakeup_lat variable, namely send it and another field as a synthetic
861event.
862
863The onmatch() action below basically says that whenever we have a
864sched_switch event, if we have a matching sched_waking event, in this
865case if we have a pid in the sched_waking histogram that matches the
866next_pid field on this sched_switch event, we retrieve the
867variables specified in the wakeup_latency() trace action, and use
868them to generate a new wakeup_latency event into the trace stream.
869
870Note that the way the trace handlers such as wakeup_latency() (which
871could equivalently be written trace(wakeup_latency,$wakeup_lat,next_pid)
872are implemented, the parameters specified to the trace handler must be
873variables.  In this case, $wakeup_lat is obviously a variable, but
874next_pid isn't, since it's just naming a field in the sched_switch
875trace event.  Since this is something that almost every trace() and
876save() action does, a special shortcut is implemented to allow field
877names to be used directly in those cases.  How it works is that under
878the covers, a temporary variable is created for the named field, and
879this variable is what is actually passed to the trace handler.  In the
880code and documentation, this type of variable is called a 'field
881variable'.
882
883Fields on other trace event's histograms can be used as well.  In that
884case we have to generate a new histogram and an unfortunately named
885'synthetic_field' (the use of synthetic here has nothing to do with
886synthetic events) and use that special histogram field as a variable.
887
888The diagram below illustrates the new elements described above in the
889context of the sched_switch histogram using the onmatch() handler and
890the trace() action.
891
892First, we define the wakeup_latency synthetic event::
893
894  # echo 'wakeup_latency u64 lat; pid_t pid' >> synthetic_events
895
896Next, the sched_waking hist trigger as before::
897
898  # echo 'hist:keys=pid:ts0=common_timestamp.usecs' >>
899          events/sched/sched_waking/trigger
900
901Finally, we create a hist trigger on the sched_switch event that
902generates a wakeup_latency() trace event.  In this case we pass
903next_pid into the wakeup_latency synthetic event invocation, which
904means it will be automatically converted into a field variable::
905
906  # echo 'hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0: \
907          onmatch(sched.sched_waking).wakeup_latency($wakeup_lat,next_pid)' >>
908	  /sys/kernel/debug/tracing/events/sched/sched_switch/trigger
909
910The diagram for the sched_switch event is similar to previous examples
911but shows the additional field_vars[] array for hist_data and shows
912the linkages between the field_vars and the variables and references
913created to implement the field variables.  The details are discussed
914below::
915
916    +------------------+
917    | hist_data        |
918    +------------------+   +-----------------------+
919      | .fields[]      |-->| val = hitcount        |
920      +----------------+   +-----------------------+
921      | .map           |     | .size               |
922      +----------------+     +---------------------+
923  +---| .field_vars[]  |     | .offset             |
924  |   +----------------+     +---------------------+
925  |+--| .var_refs[]    |     | .offset             |
926  ||  +----------------+     +---------------------+
927  ||                         | .fn()               |
928  ||   var_ref_vals[]        +---------------------+
929  ||  +-------------+        | .flags              |
930  ||  | $ts0        |<---+   +---------------------+
931  ||  +-------------+    |   | .var.idx            |
932  ||  | $next_pid   |<-+ |   +---------------------+
933  ||  +-------------+  | |   | .var.hist_data      |
934  ||+>| $wakeup_lat |  | |   +---------------------+
935  ||| +-------------+  | |   | .var_ref_idx        |
936  ||| |             |  | | +-----------------------+
937  ||| +-------------+  | | | var = wakeup_lat      |
938  |||        .         | | +-----------------------+
939  |||        .         | |   | .size               |
940  |||        .         | |   +---------------------+
941  ||| +-------------+  | |   | .offset             |
942  ||| |             |  | |   +---------------------+
943  ||| +-------------+  | |   | .fn()               |
944  ||| |             |  | |   +---------------------+
945  ||| +-------------+  | |   | .flags & FL_VAR     |
946  |||                  | |   +---------------------+
947  |||                  | |   | .var.idx            |
948  |||                  | |   +---------------------+
949  |||                  | |   | .var.hist_data      |
950  |||                  | |   +---------------------+
951  |||                  | |   | .var_ref_idx        |
952  |||                  | |   +---------------------+
953  |||                  | |              .
954  |||                  | |              .
955  |||                  | |              .
956  |||                  | |              .
957  ||| +--------------+ | |              .
958  +-->| field_var    | | |              .
959   || +--------------+ | |              .
960   ||   | var        | | |              .
961   ||   +------------+ | |              .
962   ||   | val        | | |              .
963   || +--------------+ | |              .
964   || | field_var    | | |              .
965   || +--------------+ | |              .
966   ||   | var        | | |              .
967   ||   +------------+ | |              .
968   ||   | val        | | |              .
969   ||   +------------+ | |              .
970   ||         .        | |              .
971   ||         .        | |              .
972   ||         .        | | +-----------------------+ <--- n_vals
973   || +--------------+ | | | key = pid             |
974   || | field_var    | | | +-----------------------+
975   || +--------------+ | |   | .size               |
976   ||   | var        |--+|   +---------------------+
977   ||   +------------+ |||   | .offset             |
978   ||   | val        |-+||   +---------------------+
979   ||   +------------+ |||   | .fn()               |
980   ||                  |||   +---------------------+
981   ||                  |||   | .flags              |
982   ||                  |||   +---------------------+
983   ||                  |||   | .var.idx            |
984   ||                  |||   +---------------------+ <--- n_fields
985   ||                  |||
986   ||                  |||                           n_keys = n_fields - n_vals
987   ||                  ||| +-----------------------+
988   ||                  |+->| var = next_pid        |
989   ||                  | | +-----------------------+
990   ||                  | |   | .size               |
991   ||                  | |   +---------------------+
992   ||                  | |   | .offset             |
993   ||                  | |   +---------------------+
994   ||                  | |   | .flags & FL_VAR     |
995   ||                  | |   +---------------------+
996   ||                  | |   | .var.idx            |
997   ||                  | |   +---------------------+
998   ||                  | |   | .var.hist_data      |
999   ||                  | | +-----------------------+
1000   ||                  +-->| val for next_pid      |
1001   ||                  | | +-----------------------+
1002   ||                  | |   | .size               |
1003   ||                  | |   +---------------------+
1004   ||                  | |   | .offset             |
1005   ||                  | |   +---------------------+
1006   ||                  | |   | .fn()               |
1007   ||                  | |   +---------------------+
1008   ||                  | |   | .flags              |
1009   ||                  | |   +---------------------+
1010   ||                  | |   |                     |
1011   ||                  | |   +---------------------+
1012   ||                  | |
1013   ||                  | |
1014   ||                  | | +-----------------------+
1015   +|------------------|-|>| var_ref = $ts0        |
1016    |                  | | +-----------------------+
1017    |                  | |   | .size               |
1018    |                  | |   +---------------------+
1019    |                  | |   | .offset             |
1020    |                  | |   +---------------------+
1021    |                  | |   | .fn()               |
1022    |                  | |   +---------------------+
1023    |                  | |   | .flags & FL_VAR_REF |
1024    |                  | |   +---------------------+
1025    |                  | +---| .var_ref_idx        |
1026    |                  |   +-----------------------+
1027    |                  |   | var_ref = $next_pid   |
1028    |                  |   +-----------------------+
1029    |                  |     | .size               |
1030    |                  |     +---------------------+
1031    |                  |     | .offset             |
1032    |                  |     +---------------------+
1033    |                  |     | .fn()               |
1034    |                  |     +---------------------+
1035    |                  |     | .flags & FL_VAR_REF |
1036    |                  |     +---------------------+
1037    |                  +-----| .var_ref_idx        |
1038    |                      +-----------------------+
1039    |                      | var_ref = $wakeup_lat |
1040    |                      +-----------------------+
1041    |                        | .size               |
1042    |                        +---------------------+
1043    |                        | .offset             |
1044    |                        +---------------------+
1045    |                        | .fn()               |
1046    |                        +---------------------+
1047    |                        | .flags & FL_VAR_REF |
1048    |                        +---------------------+
1049    +------------------------| .var_ref_idx        |
1050                             +---------------------+
1051
1052As you can see, for a field variable, two hist_fields are created: one
1053representing the variable, in this case next_pid, and one to actually
1054get the value of the field from the trace stream, like a normal val
1055field does.  These are created separately from normal variable
1056creation and are saved in the hist_data->field_vars[] array.  See
1057below for how these are used.  In addition, a reference hist_field is
1058also created, which is needed to reference the field variables such as
1059$next_pid variable in the trace() action.
1060
1061Note that $wakeup_lat is also a variable reference, referencing the
1062value of the expression common_timestamp-$ts0, and so also needs to
1063have a hist field entry representing that reference created.
1064
1065When hist_trigger_elt_update() is called to get the normal key and
1066value fields, it also calls update_field_vars(), which goes through
1067each field_var created for the histogram, and available from
1068hist_data->field_vars and calls val->fn() to get the data from the
1069current trace record, and then uses the var's var.idx to set the
1070variable at the var.idx offset in the appropriate tracing_map_elt's
1071variable at elt->vars[var.idx].
1072
1073Once all the variables have been updated, resolve_var_refs() can be
1074called from event_hist_trigger(), and not only can our $ts0 and
1075$next_pid references be resolved but the $wakeup_lat reference as
1076well.  At this point, the trace() action can simply access the values
1077assembled in the var_ref_vals[] array and generate the trace event.
1078
1079The same process occurs for the field variables associated with the
1080save() action.
1081
1082Abbreviations used in the diagram::
1083
1084  hist_data = struct hist_trigger_data
1085  hist_data.fields = struct hist_field
1086  field_var = struct field_var
1087  fn = hist_field_fn_t
1088  FL_KEY = HIST_FIELD_FL_KEY
1089  FL_VAR = HIST_FIELD_FL_VAR
1090  FL_VAR_REF = HIST_FIELD_FL_VAR_REF
1091
1092trace() action field variable test
1093----------------------------------
1094
1095This example adds to the previous test example by finally making use
1096of the wakeup_lat variable, but in addition also creates a couple of
1097field variables that then are all passed to the wakeup_latency() trace
1098action via the onmatch() handler.
1099
1100First, we create the wakeup_latency synthetic event::
1101
1102  # echo 'wakeup_latency u64 lat; pid_t pid; char comm[16]' >> synthetic_events
1103
1104Next, the sched_waking trigger from previous examples::
1105
1106  # echo 'hist:keys=pid:ts0=common_timestamp.usecs' >> events/sched/sched_waking/trigger
1107
1108Finally, as in the previous test example, we calculate and assign the
1109wakeup latency using the $ts0 reference from the sched_waking trigger
1110to the wakeup_lat variable, and finally use it along with a couple
1111sched_switch event fields, next_pid and next_comm, to generate a
1112wakeup_latency trace event.  The next_pid and next_comm event fields
1113are automatically converted into field variables for this purpose::
1114
1115  # echo 'hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0:onmatch(sched.sched_waking).wakeup_latency($wakeup_lat,next_pid,next_comm)' >> /sys/kernel/debug/tracing/events/sched/sched_switch/trigger
1116
1117The sched_waking hist_debug output shows the same data as in the
1118previous test example::
1119
1120  # cat events/sched/sched_waking/hist_debug
1121
1122  # event histogram
1123  #
1124  # trigger info: hist:keys=pid:vals=hitcount:ts0=common_timestamp.usecs:sort=hitcount:size=2048:clock=global [active]
1125  #
1126
1127  hist_data: 00000000d60ff61f
1128
1129  n_vals: 2
1130  n_keys: 1
1131  n_fields: 3
1132
1133  val fields:
1134
1135    hist_data->fields[0]:
1136      flags:
1137        VAL: HIST_FIELD_FL_HITCOUNT
1138      type: u64
1139      size: 8
1140      is_signed: 0
1141
1142    hist_data->fields[1]:
1143      flags:
1144        HIST_FIELD_FL_VAR
1145      var.name: ts0
1146      var.idx (into tracing_map_elt.vars[]): 0
1147      type: u64
1148      size: 8
1149      is_signed: 0
1150
1151  key fields:
1152
1153    hist_data->fields[2]:
1154      flags:
1155        HIST_FIELD_FL_KEY
1156      ftrace_event_field name: pid
1157      type: pid_t
1158      size: 8
1159      is_signed: 1
1160
1161The sched_switch hist_debug output shows the same key and value fields
1162as in the previous test example - note that wakeup_lat is still in the
1163val fields section, but that the new field variables are not there -
1164although the field variables are variables, they're held separately in
1165the hist_data's field_vars[] array.  Although the field variables and
1166the normal variables are located in separate places, you can see that
1167the actual variable locations for those variables in the
1168tracing_map_elt.vars[] do have increasing indices as expected:
1169wakeup_lat takes the var.idx = 0 slot, while the field variables for
1170next_pid and next_comm have values var.idx = 1, and var.idx = 2.  Note
1171also that those are the same values displayed for the variable
1172references corresponding to those variables in the variable reference
1173fields section.  Since there are two triggers and thus two hist_data
1174addresses, those addresses also need to be accounted for when doing
1175the matching - you can see that the first variable refers to the 0
1176var.idx on the previous hist trigger (see the hist_data address
1177associated with that trigger), while the second variable refers to the
11780 var.idx on the sched_switch hist trigger, as do all the remaining
1179variable references.
1180
1181Finally, the action tracking variables section just shows the system
1182and event name for the onmatch() handler::
1183
1184  # cat events/sched/sched_switch/hist_debug
1185
1186  # event histogram
1187  #
1188  # trigger info: hist:keys=next_pid:vals=hitcount:wakeup_lat=common_timestamp.usecs-$ts0:sort=hitcount:size=2048:clock=global:onmatch(sched.sched_waking).wakeup_latency($wakeup_lat,next_pid,next_comm) [active]
1189  #
1190
1191  hist_data: 0000000008f551b7
1192
1193  n_vals: 2
1194  n_keys: 1
1195  n_fields: 3
1196
1197  val fields:
1198
1199    hist_data->fields[0]:
1200      flags:
1201        VAL: HIST_FIELD_FL_HITCOUNT
1202      type: u64
1203      size: 8
1204      is_signed: 0
1205
1206    hist_data->fields[1]:
1207      flags:
1208        HIST_FIELD_FL_VAR
1209      var.name: wakeup_lat
1210      var.idx (into tracing_map_elt.vars[]): 0
1211      type: u64
1212      size: 0
1213      is_signed: 0
1214
1215  key fields:
1216
1217    hist_data->fields[2]:
1218      flags:
1219        HIST_FIELD_FL_KEY
1220      ftrace_event_field name: next_pid
1221      type: pid_t
1222      size: 8
1223      is_signed: 1
1224
1225  variable reference fields:
1226
1227    hist_data->var_refs[0]:
1228      flags:
1229        HIST_FIELD_FL_VAR_REF
1230      name: ts0
1231      var.idx (into tracing_map_elt.vars[]): 0
1232      var.hist_data: 00000000d60ff61f
1233      var_ref_idx (into hist_data->var_refs[]): 0
1234      type: u64
1235      size: 8
1236      is_signed: 0
1237
1238    hist_data->var_refs[1]:
1239      flags:
1240        HIST_FIELD_FL_VAR_REF
1241      name: wakeup_lat
1242      var.idx (into tracing_map_elt.vars[]): 0
1243      var.hist_data: 0000000008f551b7
1244      var_ref_idx (into hist_data->var_refs[]): 1
1245      type: u64
1246      size: 0
1247      is_signed: 0
1248
1249    hist_data->var_refs[2]:
1250      flags:
1251        HIST_FIELD_FL_VAR_REF
1252      name: next_pid
1253      var.idx (into tracing_map_elt.vars[]): 1
1254      var.hist_data: 0000000008f551b7
1255      var_ref_idx (into hist_data->var_refs[]): 2
1256      type: pid_t
1257      size: 4
1258      is_signed: 0
1259
1260    hist_data->var_refs[3]:
1261      flags:
1262        HIST_FIELD_FL_VAR_REF
1263      name: next_comm
1264      var.idx (into tracing_map_elt.vars[]): 2
1265      var.hist_data: 0000000008f551b7
1266      var_ref_idx (into hist_data->var_refs[]): 3
1267      type: char[16]
1268      size: 256
1269      is_signed: 0
1270
1271  field variables:
1272
1273    hist_data->field_vars[0]:
1274
1275      field_vars[0].var:
1276      flags:
1277        HIST_FIELD_FL_VAR
1278      var.name: next_pid
1279      var.idx (into tracing_map_elt.vars[]): 1
1280
1281      field_vars[0].val:
1282      ftrace_event_field name: next_pid
1283      type: pid_t
1284      size: 4
1285      is_signed: 1
1286
1287    hist_data->field_vars[1]:
1288
1289      field_vars[1].var:
1290      flags:
1291        HIST_FIELD_FL_VAR
1292      var.name: next_comm
1293      var.idx (into tracing_map_elt.vars[]): 2
1294
1295      field_vars[1].val:
1296      ftrace_event_field name: next_comm
1297      type: char[16]
1298      size: 256
1299      is_signed: 0
1300
1301  action tracking variables (for onmax()/onchange()/onmatch()):
1302
1303    hist_data->actions[0].match_data.event_system: sched
1304    hist_data->actions[0].match_data.event: sched_waking
1305
1306The commands below can be used to clean things up for the next test::
1307
1308  # echo '!hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0:onmatch(sched.sched_waking).wakeup_latency($wakeup_lat,next_pid,next_comm)' >> /sys/kernel/debug/tracing/events/sched/sched_switch/trigger
1309
1310  # echo '!hist:keys=pid:ts0=common_timestamp.usecs' >> events/sched/sched_waking/trigger
1311
1312  # echo '!wakeup_latency u64 lat; pid_t pid; char comm[16]' >> synthetic_events
1313
1314action_data and the trace() action
1315----------------------------------
1316
1317As mentioned above, when the trace() action generates a synthetic
1318event, all the parameters to the synthetic event either already are
1319variables or are converted into variables (via field variables), and
1320finally all those variable values are collected via references to them
1321into a var_ref_vals[] array.
1322
1323The values in the var_ref_vals[] array, however, don't necessarily
1324follow the same ordering as the synthetic event params.  To address
1325that, struct action_data contains another array, var_ref_idx[] that
1326maps the trace action params to the var_ref_vals[] values.  Below is a
1327diagram illustrating that for the wakeup_latency() synthetic event::
1328
1329  +------------------+     wakeup_latency()
1330  | action_data      |       event params               var_ref_vals[]
1331  +------------------+    +-----------------+        +-----------------+
1332    | .var_ref_idx[] |--->| $wakeup_lat idx |---+    |                 |
1333    +----------------+    +-----------------+   |    +-----------------+
1334    | .synth_event   |    | $next_pid idx   |---|-+  | $wakeup_lat val |
1335    +----------------+    +-----------------+   | |  +-----------------+
1336                                   .            | +->| $next_pid val   |
1337                                   .            |    +-----------------+
1338                                   .            |           .
1339                          +-----------------+   |           .
1340			  |                 |   |           .
1341			  +-----------------+   |    +-----------------+
1342                                                +--->| $wakeup_lat val |
1343                                                     +-----------------+
1344
1345Basically, how this ends up getting used in the synthetic event probe
1346function, trace_event_raw_event_synth(), is as follows::
1347
1348  for each field i in .synth_event
1349    val_idx = .var_ref_idx[i]
1350    val = var_ref_vals[val_idx]
1351
1352action_data and the onXXX() handlers
1353------------------------------------
1354
1355The hist trigger onXXX() actions other than onmatch(), such as onmax()
1356and onchange(), also make use of and internally create hidden
1357variables.  This information is contained in the
1358action_data.track_data struct, and is also visible in the hist_debug
1359output as will be described in the example below.
1360
1361Typically, the onmax() or onchange() handlers are used in conjunction
1362with the save() and snapshot() actions.  For example::
1363
1364  # echo 'hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0: \
1365          onmax($wakeup_lat).save(next_comm,prev_pid,prev_prio,prev_comm)' >>
1366          /sys/kernel/debug/tracing/events/sched/sched_switch/trigger
1367
1368or::
1369
1370  # echo 'hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0: \
1371          onmax($wakeup_lat).snapshot()' >>
1372          /sys/kernel/debug/tracing/events/sched/sched_switch/trigger
1373
1374save() action field variable test
1375---------------------------------
1376
1377For this example, instead of generating a synthetic event, the save()
1378action is used to save field values whenever an onmax() handler
1379detects that a new max latency has been hit.  As in the previous
1380example, the values being saved are also field values, but in this
1381case, are kept in a separate hist_data array named save_vars[].
1382
1383As in previous test examples, we set up the sched_waking trigger::
1384
1385  # echo 'hist:keys=pid:ts0=common_timestamp.usecs' >> events/sched/sched_waking/trigger
1386
1387In this case, however, we set up the sched_switch trigger to save some
1388sched_switch field values whenever we hit a new maximum latency.  For
1389both the onmax() handler and save() action, variables will be created,
1390which we can use the hist_debug files to examine::
1391
1392  # echo 'hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0:onmax($wakeup_lat).save(next_comm,prev_pid,prev_prio,prev_comm)' >> events/sched/sched_switch/trigger
1393
1394The sched_waking hist_debug output shows the same data as in the
1395previous test examples::
1396
1397  # cat events/sched/sched_waking/hist_debug
1398
1399  #
1400  # trigger info: hist:keys=pid:vals=hitcount:ts0=common_timestamp.usecs:sort=hitcount:size=2048:clock=global [active]
1401  #
1402
1403  hist_data: 00000000e6290f48
1404
1405  n_vals: 2
1406  n_keys: 1
1407  n_fields: 3
1408
1409  val fields:
1410
1411    hist_data->fields[0]:
1412      flags:
1413        VAL: HIST_FIELD_FL_HITCOUNT
1414      type: u64
1415      size: 8
1416      is_signed: 0
1417
1418    hist_data->fields[1]:
1419      flags:
1420        HIST_FIELD_FL_VAR
1421      var.name: ts0
1422      var.idx (into tracing_map_elt.vars[]): 0
1423      type: u64
1424      size: 8
1425      is_signed: 0
1426
1427  key fields:
1428
1429    hist_data->fields[2]:
1430      flags:
1431        HIST_FIELD_FL_KEY
1432      ftrace_event_field name: pid
1433      type: pid_t
1434      size: 8
1435      is_signed: 1
1436
1437The output of the sched_switch trigger shows the same val and key
1438values as before, but also shows a couple new sections.
1439
1440First, the action tracking variables section now shows the
1441actions[].track_data information describing the special tracking
1442variables and references used to track, in this case, the running
1443maximum value.  The actions[].track_data.var_ref member contains the
1444reference to the variable being tracked, in this case the $wakeup_lat
1445variable.  In order to perform the onmax() handler function, there
1446also needs to be a variable that tracks the current maximum by getting
1447updated whenever a new maximum is hit.  In this case, we can see that
1448an auto-generated variable named ' __max' has been created and is
1449visible in the actions[].track_data.track_var variable.
1450
1451Finally, in the new 'save action variables' section, we can see that
1452the 4 params to the save() function have resulted in 4 field variables
1453being created for the purposes of saving the values of the named
1454fields when the max is hit.  These variables are kept in a separate
1455save_vars[] array off of hist_data, so are displayed in a separate
1456section::
1457
1458  # cat events/sched/sched_switch/hist_debug
1459
1460  # event histogram
1461  #
1462  # trigger info: hist:keys=next_pid:vals=hitcount:wakeup_lat=common_timestamp.usecs-$ts0:sort=hitcount:size=2048:clock=global:onmax($wakeup_lat).save(next_comm,prev_pid,prev_prio,prev_comm) [active]
1463  #
1464
1465  hist_data: 0000000057bcd28d
1466
1467  n_vals: 2
1468  n_keys: 1
1469  n_fields: 3
1470
1471  val fields:
1472
1473    hist_data->fields[0]:
1474      flags:
1475        VAL: HIST_FIELD_FL_HITCOUNT
1476      type: u64
1477      size: 8
1478      is_signed: 0
1479
1480    hist_data->fields[1]:
1481      flags:
1482        HIST_FIELD_FL_VAR
1483      var.name: wakeup_lat
1484      var.idx (into tracing_map_elt.vars[]): 0
1485      type: u64
1486      size: 0
1487      is_signed: 0
1488
1489  key fields:
1490
1491    hist_data->fields[2]:
1492      flags:
1493        HIST_FIELD_FL_KEY
1494      ftrace_event_field name: next_pid
1495      type: pid_t
1496      size: 8
1497      is_signed: 1
1498
1499  variable reference fields:
1500
1501    hist_data->var_refs[0]:
1502      flags:
1503        HIST_FIELD_FL_VAR_REF
1504      name: ts0
1505      var.idx (into tracing_map_elt.vars[]): 0
1506      var.hist_data: 00000000e6290f48
1507      var_ref_idx (into hist_data->var_refs[]): 0
1508      type: u64
1509      size: 8
1510      is_signed: 0
1511
1512    hist_data->var_refs[1]:
1513      flags:
1514        HIST_FIELD_FL_VAR_REF
1515      name: wakeup_lat
1516      var.idx (into tracing_map_elt.vars[]): 0
1517      var.hist_data: 0000000057bcd28d
1518      var_ref_idx (into hist_data->var_refs[]): 1
1519      type: u64
1520      size: 0
1521      is_signed: 0
1522
1523  action tracking variables (for onmax()/onchange()/onmatch()):
1524
1525    hist_data->actions[0].track_data.var_ref:
1526      flags:
1527        HIST_FIELD_FL_VAR_REF
1528      name: wakeup_lat
1529      var.idx (into tracing_map_elt.vars[]): 0
1530      var.hist_data: 0000000057bcd28d
1531      var_ref_idx (into hist_data->var_refs[]): 1
1532      type: u64
1533      size: 0
1534      is_signed: 0
1535
1536    hist_data->actions[0].track_data.track_var:
1537      flags:
1538        HIST_FIELD_FL_VAR
1539      var.name: __max
1540      var.idx (into tracing_map_elt.vars[]): 1
1541      type: u64
1542      size: 8
1543      is_signed: 0
1544
1545  save action variables (save() params):
1546
1547    hist_data->save_vars[0]:
1548
1549      save_vars[0].var:
1550      flags:
1551        HIST_FIELD_FL_VAR
1552      var.name: next_comm
1553      var.idx (into tracing_map_elt.vars[]): 2
1554
1555      save_vars[0].val:
1556      ftrace_event_field name: next_comm
1557      type: char[16]
1558      size: 256
1559      is_signed: 0
1560
1561    hist_data->save_vars[1]:
1562
1563      save_vars[1].var:
1564      flags:
1565        HIST_FIELD_FL_VAR
1566      var.name: prev_pid
1567      var.idx (into tracing_map_elt.vars[]): 3
1568
1569      save_vars[1].val:
1570      ftrace_event_field name: prev_pid
1571      type: pid_t
1572      size: 4
1573      is_signed: 1
1574
1575    hist_data->save_vars[2]:
1576
1577      save_vars[2].var:
1578      flags:
1579        HIST_FIELD_FL_VAR
1580      var.name: prev_prio
1581      var.idx (into tracing_map_elt.vars[]): 4
1582
1583      save_vars[2].val:
1584      ftrace_event_field name: prev_prio
1585      type: int
1586      size: 4
1587      is_signed: 1
1588
1589    hist_data->save_vars[3]:
1590
1591      save_vars[3].var:
1592      flags:
1593        HIST_FIELD_FL_VAR
1594      var.name: prev_comm
1595      var.idx (into tracing_map_elt.vars[]): 5
1596
1597      save_vars[3].val:
1598      ftrace_event_field name: prev_comm
1599      type: char[16]
1600      size: 256
1601      is_signed: 0
1602
1603The commands below can be used to clean things up for the next test::
1604
1605  # echo '!hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0:onmax($wakeup_lat).save(next_comm,prev_pid,prev_prio,prev_comm)' >> events/sched/sched_switch/trigger
1606
1607  # echo '!hist:keys=pid:ts0=common_timestamp.usecs' >> events/sched/sched_waking/trigger
1608
1609A couple special cases
1610======================
1611
1612While the above covers the basics of the histogram internals, there
1613are a couple of special cases that should be discussed, since they
1614tend to create even more confusion.  Those are field variables on other
1615histograms, and aliases, both described below through example tests
1616using the hist_debug files.
1617
1618Test of field variables on other histograms
1619-------------------------------------------
1620
1621This example is similar to the previous examples, but in this case,
1622the sched_switch trigger references a hist trigger field on another
1623event, namely the sched_waking event.  In order to accomplish this, a
1624field variable is created for the other event, but since an existing
1625histogram can't be used, as existing histograms are immutable, a new
1626histogram with a matching variable is created and used, and we'll see
1627that reflected in the hist_debug output shown below.
1628
1629First, we create the wakeup_latency synthetic event.  Note the
1630addition of the prio field::
1631
1632  # echo 'wakeup_latency u64 lat; pid_t pid; int prio' >> synthetic_events
1633
1634As in previous test examples, we set up the sched_waking trigger::
1635
1636  # echo 'hist:keys=pid:ts0=common_timestamp.usecs' >> events/sched/sched_waking/trigger
1637
1638Here we set up a hist trigger on sched_switch to send a wakeup_latency
1639event using an onmatch handler naming the sched_waking event.  Note
1640that the third param being passed to the wakeup_latency() is prio,
1641which is a field name that needs to have a field variable created for
1642it.  There isn't however any prio field on the sched_switch event so
1643it would seem that it wouldn't be possible to create a field variable
1644for it.  The matching sched_waking event does have a prio field, so it
1645should be possible to make use of it for this purpose.  The problem
1646with that is that it's not currently possible to define a new variable
1647on an existing histogram, so it's not possible to add a new prio field
1648variable to the existing sched_waking histogram.  It is however
1649possible to create an additional new 'matching' sched_waking histogram
1650for the same event, meaning that it uses the same key and filters, and
1651define the new prio field variable on that.
1652
1653Here's the sched_switch trigger::
1654
1655  # echo 'hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0:onmatch(sched.sched_waking).wakeup_latency($wakeup_lat,next_pid,prio)' >> events/sched/sched_switch/trigger
1656
1657And here's the output of the hist_debug information for the
1658sched_waking hist trigger.  Note that there are two histograms
1659displayed in the output: the first is the normal sched_waking
1660histogram we've seen in the previous examples, and the second is the
1661special histogram we created to provide the prio field variable.
1662
1663Looking at the second histogram below, we see a variable with the name
1664synthetic_prio.  This is the field variable created for the prio field
1665on that sched_waking histogram::
1666
1667  # cat events/sched/sched_waking/hist_debug
1668
1669  # event histogram
1670  #
1671  # trigger info: hist:keys=pid:vals=hitcount:ts0=common_timestamp.usecs:sort=hitcount:size=2048:clock=global [active]
1672  #
1673
1674  hist_data: 00000000349570e4
1675
1676  n_vals: 2
1677  n_keys: 1
1678  n_fields: 3
1679
1680  val fields:
1681
1682    hist_data->fields[0]:
1683      flags:
1684        VAL: HIST_FIELD_FL_HITCOUNT
1685      type: u64
1686      size: 8
1687      is_signed: 0
1688
1689    hist_data->fields[1]:
1690      flags:
1691        HIST_FIELD_FL_VAR
1692      var.name: ts0
1693      var.idx (into tracing_map_elt.vars[]): 0
1694      type: u64
1695      size: 8
1696      is_signed: 0
1697
1698  key fields:
1699
1700    hist_data->fields[2]:
1701      flags:
1702        HIST_FIELD_FL_KEY
1703      ftrace_event_field name: pid
1704      type: pid_t
1705      size: 8
1706      is_signed: 1
1707
1708
1709  # event histogram
1710  #
1711  # trigger info: hist:keys=pid:vals=hitcount:synthetic_prio=prio:sort=hitcount:size=2048 [active]
1712  #
1713
1714  hist_data: 000000006920cf38
1715
1716  n_vals: 2
1717  n_keys: 1
1718  n_fields: 3
1719
1720  val fields:
1721
1722    hist_data->fields[0]:
1723      flags:
1724        VAL: HIST_FIELD_FL_HITCOUNT
1725      type: u64
1726      size: 8
1727      is_signed: 0
1728
1729    hist_data->fields[1]:
1730      flags:
1731        HIST_FIELD_FL_VAR
1732      ftrace_event_field name: prio
1733      var.name: synthetic_prio
1734      var.idx (into tracing_map_elt.vars[]): 0
1735      type: int
1736      size: 4
1737      is_signed: 1
1738
1739  key fields:
1740
1741    hist_data->fields[2]:
1742      flags:
1743        HIST_FIELD_FL_KEY
1744      ftrace_event_field name: pid
1745      type: pid_t
1746      size: 8
1747      is_signed: 1
1748
1749Looking at the sched_switch histogram below, we can see a reference to
1750the synthetic_prio variable on sched_waking, and looking at the
1751associated hist_data address we see that it is indeed associated with
1752the new histogram.  Note also that the other references are to a
1753normal variable, wakeup_lat, and to a normal field variable, next_pid,
1754the details of which are in the field variables section::
1755
1756  # cat events/sched/sched_switch/hist_debug
1757
1758  # event histogram
1759  #
1760  # trigger info: hist:keys=next_pid:vals=hitcount:wakeup_lat=common_timestamp.usecs-$ts0:sort=hitcount:size=2048:clock=global:onmatch(sched.sched_waking).wakeup_latency($wakeup_lat,next_pid,prio) [active]
1761  #
1762
1763  hist_data: 00000000a73b67df
1764
1765  n_vals: 2
1766  n_keys: 1
1767  n_fields: 3
1768
1769  val fields:
1770
1771    hist_data->fields[0]:
1772      flags:
1773        VAL: HIST_FIELD_FL_HITCOUNT
1774      type: u64
1775      size: 8
1776      is_signed: 0
1777
1778    hist_data->fields[1]:
1779      flags:
1780        HIST_FIELD_FL_VAR
1781      var.name: wakeup_lat
1782      var.idx (into tracing_map_elt.vars[]): 0
1783      type: u64
1784      size: 0
1785      is_signed: 0
1786
1787  key fields:
1788
1789    hist_data->fields[2]:
1790      flags:
1791        HIST_FIELD_FL_KEY
1792      ftrace_event_field name: next_pid
1793      type: pid_t
1794      size: 8
1795      is_signed: 1
1796
1797  variable reference fields:
1798
1799    hist_data->var_refs[0]:
1800      flags:
1801        HIST_FIELD_FL_VAR_REF
1802      name: ts0
1803      var.idx (into tracing_map_elt.vars[]): 0
1804      var.hist_data: 00000000349570e4
1805      var_ref_idx (into hist_data->var_refs[]): 0
1806      type: u64
1807      size: 8
1808      is_signed: 0
1809
1810    hist_data->var_refs[1]:
1811      flags:
1812        HIST_FIELD_FL_VAR_REF
1813      name: wakeup_lat
1814      var.idx (into tracing_map_elt.vars[]): 0
1815      var.hist_data: 00000000a73b67df
1816      var_ref_idx (into hist_data->var_refs[]): 1
1817      type: u64
1818      size: 0
1819      is_signed: 0
1820
1821    hist_data->var_refs[2]:
1822      flags:
1823        HIST_FIELD_FL_VAR_REF
1824      name: next_pid
1825      var.idx (into tracing_map_elt.vars[]): 1
1826      var.hist_data: 00000000a73b67df
1827      var_ref_idx (into hist_data->var_refs[]): 2
1828      type: pid_t
1829      size: 4
1830      is_signed: 0
1831
1832    hist_data->var_refs[3]:
1833      flags:
1834        HIST_FIELD_FL_VAR_REF
1835      name: synthetic_prio
1836      var.idx (into tracing_map_elt.vars[]): 0
1837      var.hist_data: 000000006920cf38
1838      var_ref_idx (into hist_data->var_refs[]): 3
1839      type: int
1840      size: 4
1841      is_signed: 1
1842
1843  field variables:
1844
1845    hist_data->field_vars[0]:
1846
1847      field_vars[0].var:
1848      flags:
1849        HIST_FIELD_FL_VAR
1850      var.name: next_pid
1851      var.idx (into tracing_map_elt.vars[]): 1
1852
1853      field_vars[0].val:
1854      ftrace_event_field name: next_pid
1855      type: pid_t
1856      size: 4
1857      is_signed: 1
1858
1859  action tracking variables (for onmax()/onchange()/onmatch()):
1860
1861    hist_data->actions[0].match_data.event_system: sched
1862    hist_data->actions[0].match_data.event: sched_waking
1863
1864The commands below can be used to clean things up for the next test::
1865
1866  # echo '!hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0:onmatch(sched.sched_waking).wakeup_latency($wakeup_lat,next_pid,prio)' >> events/sched/sched_switch/trigger
1867
1868  # echo '!hist:keys=pid:ts0=common_timestamp.usecs' >> events/sched/sched_waking/trigger
1869
1870  # echo '!wakeup_latency u64 lat; pid_t pid; int prio' >> synthetic_events
1871
1872Alias test
1873----------
1874
1875This example is very similar to previous examples, but demonstrates
1876the alias flag.
1877
1878First, we create the wakeup_latency synthetic event::
1879
1880  # echo 'wakeup_latency u64 lat; pid_t pid; char comm[16]' >> synthetic_events
1881
1882Next, we create a sched_waking trigger similar to previous examples,
1883but in this case we save the pid in the waking_pid variable::
1884
1885  # echo 'hist:keys=pid:waking_pid=pid:ts0=common_timestamp.usecs' >> events/sched/sched_waking/trigger
1886
1887For the sched_switch trigger, instead of using $waking_pid directly in
1888the wakeup_latency synthetic event invocation, we create an alias of
1889$waking_pid named $woken_pid, and use that in the synthetic event
1890invocation instead::
1891
1892  # echo 'hist:keys=next_pid:woken_pid=$waking_pid:wakeup_lat=common_timestamp.usecs-$ts0:onmatch(sched.sched_waking).wakeup_latency($wakeup_lat,$woken_pid,next_comm)' >> events/sched/sched_switch/trigger
1893
1894Looking at the sched_waking hist_debug output, in addition to the
1895normal fields, we can see the waking_pid variable::
1896
1897  # cat events/sched/sched_waking/hist_debug
1898
1899  # event histogram
1900  #
1901  # trigger info: hist:keys=pid:vals=hitcount:waking_pid=pid,ts0=common_timestamp.usecs:sort=hitcount:size=2048:clock=global [active]
1902  #
1903
1904  hist_data: 00000000a250528c
1905
1906  n_vals: 3
1907  n_keys: 1
1908  n_fields: 4
1909
1910  val fields:
1911
1912    hist_data->fields[0]:
1913      flags:
1914        VAL: HIST_FIELD_FL_HITCOUNT
1915      type: u64
1916      size: 8
1917      is_signed: 0
1918
1919    hist_data->fields[1]:
1920      flags:
1921        HIST_FIELD_FL_VAR
1922      ftrace_event_field name: pid
1923      var.name: waking_pid
1924      var.idx (into tracing_map_elt.vars[]): 0
1925      type: pid_t
1926      size: 4
1927      is_signed: 1
1928
1929    hist_data->fields[2]:
1930      flags:
1931        HIST_FIELD_FL_VAR
1932      var.name: ts0
1933      var.idx (into tracing_map_elt.vars[]): 1
1934      type: u64
1935      size: 8
1936      is_signed: 0
1937
1938  key fields:
1939
1940    hist_data->fields[3]:
1941      flags:
1942        HIST_FIELD_FL_KEY
1943      ftrace_event_field name: pid
1944      type: pid_t
1945      size: 8
1946      is_signed: 1
1947
1948The sched_switch hist_debug output shows that a variable named
1949woken_pid has been created but that it also has the
1950HIST_FIELD_FL_ALIAS flag set.  It also has the HIST_FIELD_FL_VAR flag
1951set, which is why it appears in the val field section.
1952
1953Despite that implementation detail, an alias variable is actually more
1954like a variable reference; in fact it can be thought of as a reference
1955to a reference.  The implementation copies the var_ref->fn() from the
1956variable reference being referenced, in this case, the waking_pid
1957fn(), which is hist_field_var_ref() and makes that the fn() of the
1958alias.  The hist_field_var_ref() fn() requires the var_ref_idx of the
1959variable reference it's using, so waking_pid's var_ref_idx is also
1960copied to the alias.  The end result is that when the value of alias
1961is retrieved, in the end it just does the same thing the original
1962reference would have done and retrieves the same value from the
1963var_ref_vals[] array.  You can verify this in the output by noting
1964that the var_ref_idx of the alias, in this case woken_pid, is the same
1965as the var_ref_idx of the reference, waking_pid, in the variable
1966reference fields section.
1967
1968Additionally, once it gets that value, since it is also a variable, it
1969then saves that value into its var.idx.  So the var.idx of the
1970woken_pid alias is 0, which it fills with the value from var_ref_idx 0
1971when its fn() is called to update itself.  You'll also notice that
1972there's a woken_pid var_ref in the variable refs section.  That is the
1973reference to the woken_pid alias variable, and you can see that it
1974retrieves the value from the same var.idx as the woken_pid alias, 0,
1975and then in turn saves that value in its own var_ref_idx slot, 3, and
1976the value at this position is finally what gets assigned to the
1977$woken_pid slot in the trace event invocation::
1978
1979  # cat events/sched/sched_switch/hist_debug
1980
1981  # event histogram
1982  #
1983  # trigger info: hist:keys=next_pid:vals=hitcount:woken_pid=$waking_pid,wakeup_lat=common_timestamp.usecs-$ts0:sort=hitcount:size=2048:clock=global:onmatch(sched.sched_waking).wakeup_latency($wakeup_lat,$woken_pid,next_comm) [active]
1984  #
1985
1986  hist_data: 0000000055d65ed0
1987
1988  n_vals: 3
1989  n_keys: 1
1990  n_fields: 4
1991
1992  val fields:
1993
1994    hist_data->fields[0]:
1995      flags:
1996        VAL: HIST_FIELD_FL_HITCOUNT
1997      type: u64
1998      size: 8
1999      is_signed: 0
2000
2001    hist_data->fields[1]:
2002      flags:
2003        HIST_FIELD_FL_VAR
2004        HIST_FIELD_FL_ALIAS
2005      var.name: woken_pid
2006      var.idx (into tracing_map_elt.vars[]): 0
2007      var_ref_idx (into hist_data->var_refs[]): 0
2008      type: pid_t
2009      size: 4
2010      is_signed: 1
2011
2012    hist_data->fields[2]:
2013      flags:
2014        HIST_FIELD_FL_VAR
2015      var.name: wakeup_lat
2016      var.idx (into tracing_map_elt.vars[]): 1
2017      type: u64
2018      size: 0
2019      is_signed: 0
2020
2021  key fields:
2022
2023    hist_data->fields[3]:
2024      flags:
2025        HIST_FIELD_FL_KEY
2026      ftrace_event_field name: next_pid
2027      type: pid_t
2028      size: 8
2029      is_signed: 1
2030
2031  variable reference fields:
2032
2033    hist_data->var_refs[0]:
2034      flags:
2035        HIST_FIELD_FL_VAR_REF
2036      name: waking_pid
2037      var.idx (into tracing_map_elt.vars[]): 0
2038      var.hist_data: 00000000a250528c
2039      var_ref_idx (into hist_data->var_refs[]): 0
2040      type: pid_t
2041      size: 4
2042      is_signed: 1
2043
2044    hist_data->var_refs[1]:
2045      flags:
2046        HIST_FIELD_FL_VAR_REF
2047      name: ts0
2048      var.idx (into tracing_map_elt.vars[]): 1
2049      var.hist_data: 00000000a250528c
2050      var_ref_idx (into hist_data->var_refs[]): 1
2051      type: u64
2052      size: 8
2053      is_signed: 0
2054
2055    hist_data->var_refs[2]:
2056      flags:
2057        HIST_FIELD_FL_VAR_REF
2058      name: wakeup_lat
2059      var.idx (into tracing_map_elt.vars[]): 1
2060      var.hist_data: 0000000055d65ed0
2061      var_ref_idx (into hist_data->var_refs[]): 2
2062      type: u64
2063      size: 0
2064      is_signed: 0
2065
2066    hist_data->var_refs[3]:
2067      flags:
2068        HIST_FIELD_FL_VAR_REF
2069      name: woken_pid
2070      var.idx (into tracing_map_elt.vars[]): 0
2071      var.hist_data: 0000000055d65ed0
2072      var_ref_idx (into hist_data->var_refs[]): 3
2073      type: pid_t
2074      size: 4
2075      is_signed: 1
2076
2077    hist_data->var_refs[4]:
2078      flags:
2079        HIST_FIELD_FL_VAR_REF
2080      name: next_comm
2081      var.idx (into tracing_map_elt.vars[]): 2
2082      var.hist_data: 0000000055d65ed0
2083      var_ref_idx (into hist_data->var_refs[]): 4
2084      type: char[16]
2085      size: 256
2086      is_signed: 0
2087
2088  field variables:
2089
2090    hist_data->field_vars[0]:
2091
2092      field_vars[0].var:
2093      flags:
2094        HIST_FIELD_FL_VAR
2095      var.name: next_comm
2096      var.idx (into tracing_map_elt.vars[]): 2
2097
2098      field_vars[0].val:
2099      ftrace_event_field name: next_comm
2100      type: char[16]
2101      size: 256
2102      is_signed: 0
2103
2104  action tracking variables (for onmax()/onchange()/onmatch()):
2105
2106    hist_data->actions[0].match_data.event_system: sched
2107    hist_data->actions[0].match_data.event: sched_waking
2108
2109The commands below can be used to clean things up for the next test::
2110
2111  # echo '!hist:keys=next_pid:woken_pid=$waking_pid:wakeup_lat=common_timestamp.usecs-$ts0:onmatch(sched.sched_waking).wakeup_latency($wakeup_lat,$woken_pid,next_comm)' >> events/sched/sched_switch/trigger
2112
2113  # echo '!hist:keys=pid:ts0=common_timestamp.usecs' >> events/sched/sched_waking/trigger
2114
2115  # echo '!wakeup_latency u64 lat; pid_t pid; char comm[16]' >> synthetic_events
2116