Documentation/block/bfq-iosched.rst

*898bd37aSMauro Carvalho Chehab==========================
*898bd37aSMauro Carvalho ChehabBFQ (Budget Fair Queueing)
*898bd37aSMauro Carvalho Chehab==========================
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabBFQ is a proportional-share I/O scheduler, with some extra
*898bd37aSMauro Carvalho Chehablow-latency capabilities. In addition to cgroups support (blkio or io
*898bd37aSMauro Carvalho Chehabcontrollers), BFQ's main features are:
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab- BFQ guarantees a high system and application responsiveness, and a
*898bd37aSMauro Carvalho Chehab  low latency for time-sensitive applications, such as audio or video
*898bd37aSMauro Carvalho Chehab  players;
*898bd37aSMauro Carvalho Chehab- BFQ distributes bandwidth, and not just time, among processes or
*898bd37aSMauro Carvalho Chehab  groups (switching back to time distribution when needed to keep
*898bd37aSMauro Carvalho Chehab  throughput high).
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabIn its default configuration, BFQ privileges latency over
*898bd37aSMauro Carvalho Chehabthroughput. So, when needed for achieving a lower latency, BFQ builds
*898bd37aSMauro Carvalho Chehabschedules that may lead to a lower throughput. If your main or only
*898bd37aSMauro Carvalho Chehabgoal, for a given device, is to achieve the maximum-possible
*898bd37aSMauro Carvalho Chehabthroughput at all times, then do switch off all low-latency heuristics
*898bd37aSMauro Carvalho Chehabfor that device, by setting low_latency to 0. See Section 3 for
*898bd37aSMauro Carvalho Chehabdetails on how to configure BFQ for the desired tradeoff between
*898bd37aSMauro Carvalho Chehablatency and throughput, or on how to maximize throughput.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabAs every I/O scheduler, BFQ adds some overhead to per-I/O-request
*898bd37aSMauro Carvalho Chehabprocessing. To give an idea of this overhead, the total,
*898bd37aSMauro Carvalho Chehabsingle-lock-protected, per-request processing time of BFQ---i.e., the
*898bd37aSMauro Carvalho Chehabsum of the execution times of the request insertion, dispatch and
*898bd37aSMauro Carvalho Chehabcompletion hooks---is, e.g., 1.9 us on an Intel Core i7-2760QM@2.40GHz
*898bd37aSMauro Carvalho Chehab(dated CPU for notebooks; time measured with simple code
*898bd37aSMauro Carvalho Chehabinstrumentation, and using the throughput-sync.sh script of the S
*898bd37aSMauro Carvalho Chehabsuite [1], in performance-profiling mode). To put this result into
*898bd37aSMauro Carvalho Chehabcontext, the total, single-lock-protected, per-request execution time
*898bd37aSMauro Carvalho Chehabof the lightest I/O scheduler available in blk-mq, mq-deadline, is 0.7
*898bd37aSMauro Carvalho Chehabus (mq-deadline is ~800 LOC, against ~10500 LOC for BFQ).
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabScheduling overhead further limits the maximum IOPS that a CPU can
*898bd37aSMauro Carvalho Chehabprocess (already limited by the execution of the rest of the I/O
*898bd37aSMauro Carvalho Chehabstack). To give an idea of the limits with BFQ, on slow or average
*898bd37aSMauro Carvalho ChehabCPUs, here are, first, the limits of BFQ for three different CPUs, on,
*898bd37aSMauro Carvalho Chehabrespectively, an average laptop, an old desktop, and a cheap embedded
*898bd37aSMauro Carvalho Chehabsystem, in case full hierarchical support is enabled (i.e.,
*898bd37aSMauro Carvalho ChehabCONFIG_BFQ_GROUP_IOSCHED is set), but CONFIG_BFQ_CGROUP_DEBUG is not
*898bd37aSMauro Carvalho Chehabset (Section 4-2):
*898bd37aSMauro Carvalho Chehab- Intel i7-4850HQ: 400 KIOPS
*898bd37aSMauro Carvalho Chehab- AMD A8-3850: 250 KIOPS
*898bd37aSMauro Carvalho Chehab- ARM CortexTM-A53 Octa-core: 80 KIOPS
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabIf CONFIG_BFQ_CGROUP_DEBUG is set (and of course full hierarchical
*898bd37aSMauro Carvalho Chehabsupport is enabled), then the sustainable throughput with BFQ
*898bd37aSMauro Carvalho Chehabdecreases, because all blkio.bfq* statistics are created and updated
*898bd37aSMauro Carvalho Chehab(Section 4-2). For BFQ, this leads to the following maximum
*898bd37aSMauro Carvalho Chehabsustainable throughputs, on the same systems as above:
*898bd37aSMauro Carvalho Chehab- Intel i7-4850HQ: 310 KIOPS
*898bd37aSMauro Carvalho Chehab- AMD A8-3850: 200 KIOPS
*898bd37aSMauro Carvalho Chehab- ARM CortexTM-A53 Octa-core: 56 KIOPS
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabBFQ works for multi-queue devices too.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab.. The table of contents follow. Impatients can just jump to Section 3.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab.. CONTENTS
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab   1. When may BFQ be useful?
*898bd37aSMauro Carvalho Chehab    1-1 Personal systems
*898bd37aSMauro Carvalho Chehab    1-2 Server systems
*898bd37aSMauro Carvalho Chehab   2. How does BFQ work?
*898bd37aSMauro Carvalho Chehab   3. What are BFQ's tunables and how to properly configure BFQ?
*898bd37aSMauro Carvalho Chehab   4. BFQ group scheduling
*898bd37aSMauro Carvalho Chehab    4-1 Service guarantees provided
*898bd37aSMauro Carvalho Chehab    4-2 Interface
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab1. When may BFQ be useful?
*898bd37aSMauro Carvalho Chehab==========================
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabBFQ provides the following benefits on personal and server systems.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab1-1 Personal systems
*898bd37aSMauro Carvalho Chehab--------------------
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabLow latency for interactive applications
*898bd37aSMauro Carvalho Chehab^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabRegardless of the actual background workload, BFQ guarantees that, for
*898bd37aSMauro Carvalho Chehabinteractive tasks, the storage device is virtually as responsive as if
*898bd37aSMauro Carvalho Chehabit was idle. For example, even if one or more of the following
*898bd37aSMauro Carvalho Chehabbackground workloads are being executed:
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab- one or more large files are being read, written or copied,
*898bd37aSMauro Carvalho Chehab- a tree of source files is being compiled,
*898bd37aSMauro Carvalho Chehab- one or more virtual machines are performing I/O,
*898bd37aSMauro Carvalho Chehab- a software update is in progress,
*898bd37aSMauro Carvalho Chehab- indexing daemons are scanning filesystems and updating their
*898bd37aSMauro Carvalho Chehab  databases,
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehabstarting an application or loading a file from within an application
*898bd37aSMauro Carvalho Chehabtakes about the same time as if the storage device was idle. As a
*898bd37aSMauro Carvalho Chehabcomparison, with CFQ, NOOP or DEADLINE, and in the same conditions,
*898bd37aSMauro Carvalho Chehabapplications experience high latencies, or even become unresponsive
*898bd37aSMauro Carvalho Chehabuntil the background workload terminates (also on SSDs).
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabLow latency for soft real-time applications
*898bd37aSMauro Carvalho Chehab^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
*898bd37aSMauro Carvalho ChehabAlso soft real-time applications, such as audio and video
*898bd37aSMauro Carvalho Chehabplayers/streamers, enjoy a low latency and a low drop rate, regardless
*898bd37aSMauro Carvalho Chehabof the background I/O workload. As a consequence, these applications
*898bd37aSMauro Carvalho Chehabdo not suffer from almost any glitch due to the background workload.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabHigher speed for code-development tasks
*898bd37aSMauro Carvalho Chehab^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabIf some additional workload happens to be executed in parallel, then
*898bd37aSMauro Carvalho ChehabBFQ executes the I/O-related components of typical code-development
*898bd37aSMauro Carvalho Chehabtasks (compilation, checkout, merge, ...) much more quickly than CFQ,
*898bd37aSMauro Carvalho ChehabNOOP or DEADLINE.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabHigh throughput
*898bd37aSMauro Carvalho Chehab^^^^^^^^^^^^^^^
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabOn hard disks, BFQ achieves up to 30% higher throughput than CFQ, and
*898bd37aSMauro Carvalho Chehabup to 150% higher throughput than DEADLINE and NOOP, with all the
*898bd37aSMauro Carvalho Chehabsequential workloads considered in our tests. With random workloads,
*898bd37aSMauro Carvalho Chehaband with all the workloads on flash-based devices, BFQ achieves,
*898bd37aSMauro Carvalho Chehabinstead, about the same throughput as the other schedulers.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabStrong fairness, bandwidth and delay guarantees
*898bd37aSMauro Carvalho Chehab^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabBFQ distributes the device throughput, and not just the device time,
*898bd37aSMauro Carvalho Chehabamong I/O-bound applications in proportion their weights, with any
*898bd37aSMauro Carvalho Chehabworkload and regardless of the device parameters. From these bandwidth
*898bd37aSMauro Carvalho Chehabguarantees, it is possible to compute tight per-I/O-request delay
*898bd37aSMauro Carvalho Chehabguarantees by a simple formula. If not configured for strict service
*898bd37aSMauro Carvalho Chehabguarantees, BFQ switches to time-based resource sharing (only) for
*898bd37aSMauro Carvalho Chehabapplications that would otherwise cause a throughput loss.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab1-2 Server systems
*898bd37aSMauro Carvalho Chehab------------------
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabMost benefits for server systems follow from the same service
*898bd37aSMauro Carvalho Chehabproperties as above. In particular, regardless of whether additional,
*898bd37aSMauro Carvalho Chehabpossibly heavy workloads are being served, BFQ guarantees:
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab* audio and video-streaming with zero or very low jitter and drop
*898bd37aSMauro Carvalho Chehab  rate;
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab* fast retrieval of WEB pages and embedded objects;
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab* real-time recording of data in live-dumping applications (e.g.,
*898bd37aSMauro Carvalho Chehab  packet logging);
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab* responsiveness in local and remote access to a server.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab2. How does BFQ work?
*898bd37aSMauro Carvalho Chehab=====================
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabBFQ is a proportional-share I/O scheduler, whose general structure,
*898bd37aSMauro Carvalho Chehabplus a lot of code, are borrowed from CFQ.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab- Each process doing I/O on a device is associated with a weight and a
*898bd37aSMauro Carvalho Chehab  `(bfq_)queue`.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab- BFQ grants exclusive access to the device, for a while, to one queue
*898bd37aSMauro Carvalho Chehab  (process) at a time, and implements this service model by
*898bd37aSMauro Carvalho Chehab  associating every queue with a budget, measured in number of
*898bd37aSMauro Carvalho Chehab  sectors.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab  - After a queue is granted access to the device, the budget of the
*898bd37aSMauro Carvalho Chehab    queue is decremented, on each request dispatch, by the size of the
*898bd37aSMauro Carvalho Chehab    request.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab  - The in-service queue is expired, i.e., its service is suspended,
*898bd37aSMauro Carvalho Chehab    only if one of the following events occurs: 1) the queue finishes
*898bd37aSMauro Carvalho Chehab    its budget, 2) the queue empties, 3) a "budget timeout" fires.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab    - The budget timeout prevents processes doing random I/O from
*898bd37aSMauro Carvalho Chehab      holding the device for too long and dramatically reducing
*898bd37aSMauro Carvalho Chehab      throughput.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab    - Actually, as in CFQ, a queue associated with a process issuing
*898bd37aSMauro Carvalho Chehab      sync requests may not be expired immediately when it empties. In
*898bd37aSMauro Carvalho Chehab      contrast, BFQ may idle the device for a short time interval,
*898bd37aSMauro Carvalho Chehab      giving the process the chance to go on being served if it issues
*898bd37aSMauro Carvalho Chehab      a new request in time. Device idling typically boosts the
*898bd37aSMauro Carvalho Chehab      throughput on rotational devices and on non-queueing flash-based
*898bd37aSMauro Carvalho Chehab      devices, if processes do synchronous and sequential I/O. In
*898bd37aSMauro Carvalho Chehab      addition, under BFQ, device idling is also instrumental in
*898bd37aSMauro Carvalho Chehab      guaranteeing the desired throughput fraction to processes
*898bd37aSMauro Carvalho Chehab      issuing sync requests (see the description of the slice_idle
*898bd37aSMauro Carvalho Chehab      tunable in this document, or [1, 2], for more details).
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab      - With respect to idling for service guarantees, if several
*898bd37aSMauro Carvalho Chehab	processes are competing for the device at the same time, but
*898bd37aSMauro Carvalho Chehab	all processes and groups have the same weight, then BFQ
*898bd37aSMauro Carvalho Chehab	guarantees the expected throughput distribution without ever
*898bd37aSMauro Carvalho Chehab	idling the device. Throughput is thus as high as possible in
*898bd37aSMauro Carvalho Chehab	this common scenario.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab     - On flash-based storage with internal queueing of commands
*898bd37aSMauro Carvalho Chehab       (typically NCQ), device idling happens to be always detrimental
*898bd37aSMauro Carvalho Chehab       for throughput. So, with these devices, BFQ performs idling
*898bd37aSMauro Carvalho Chehab       only when strictly needed for service guarantees, i.e., for
*898bd37aSMauro Carvalho Chehab       guaranteeing low latency or fairness. In these cases, overall
*898bd37aSMauro Carvalho Chehab       throughput may be sub-optimal. No solution currently exists to
*898bd37aSMauro Carvalho Chehab       provide both strong service guarantees and optimal throughput
*898bd37aSMauro Carvalho Chehab       on devices with internal queueing.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab  - If low-latency mode is enabled (default configuration), BFQ
*898bd37aSMauro Carvalho Chehab    executes some special heuristics to detect interactive and soft
*898bd37aSMauro Carvalho Chehab    real-time applications (e.g., video or audio players/streamers),
*898bd37aSMauro Carvalho Chehab    and to reduce their latency. The most important action taken to
*898bd37aSMauro Carvalho Chehab    achieve this goal is to give to the queues associated with these
*898bd37aSMauro Carvalho Chehab    applications more than their fair share of the device
*898bd37aSMauro Carvalho Chehab    throughput. For brevity, we call just "weight-raising" the whole
*898bd37aSMauro Carvalho Chehab    sets of actions taken by BFQ to privilege these queues. In
*898bd37aSMauro Carvalho Chehab    particular, BFQ provides a milder form of weight-raising for
*898bd37aSMauro Carvalho Chehab    interactive applications, and a stronger form for soft real-time
*898bd37aSMauro Carvalho Chehab    applications.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab  - BFQ automatically deactivates idling for queues born in a burst of
*898bd37aSMauro Carvalho Chehab    queue creations. In fact, these queues are usually associated with
*898bd37aSMauro Carvalho Chehab    the processes of applications and services that benefit mostly
*898bd37aSMauro Carvalho Chehab    from a high throughput. Examples are systemd during boot, or git
*898bd37aSMauro Carvalho Chehab    grep.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab  - As CFQ, BFQ merges queues performing interleaved I/O, i.e.,
*898bd37aSMauro Carvalho Chehab    performing random I/O that becomes mostly sequential if
*898bd37aSMauro Carvalho Chehab    merged. Differently from CFQ, BFQ achieves this goal with a more
*898bd37aSMauro Carvalho Chehab    reactive mechanism, called Early Queue Merge (EQM). EQM is so
*898bd37aSMauro Carvalho Chehab    responsive in detecting interleaved I/O (cooperating processes),
*898bd37aSMauro Carvalho Chehab    that it enables BFQ to achieve a high throughput, by queue
*898bd37aSMauro Carvalho Chehab    merging, even for queues for which CFQ needs a different
*898bd37aSMauro Carvalho Chehab    mechanism, preemption, to get a high throughput. As such EQM is a
*898bd37aSMauro Carvalho Chehab    unified mechanism to achieve a high throughput with interleaved
*898bd37aSMauro Carvalho Chehab    I/O.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab  - Queues are scheduled according to a variant of WF2Q+, named
*898bd37aSMauro Carvalho Chehab    B-WF2Q+, and implemented using an augmented rb-tree to preserve an
*898bd37aSMauro Carvalho Chehab    O(log N) overall complexity.  See [2] for more details. B-WF2Q+ is
*898bd37aSMauro Carvalho Chehab    also ready for hierarchical scheduling, details in Section 4.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab  - B-WF2Q+ guarantees a tight deviation with respect to an ideal,
*898bd37aSMauro Carvalho Chehab    perfectly fair, and smooth service. In particular, B-WF2Q+
*898bd37aSMauro Carvalho Chehab    guarantees that each queue receives a fraction of the device
*898bd37aSMauro Carvalho Chehab    throughput proportional to its weight, even if the throughput
*898bd37aSMauro Carvalho Chehab    fluctuates, and regardless of: the device parameters, the current
*898bd37aSMauro Carvalho Chehab    workload and the budgets assigned to the queue.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab  - The last, budget-independence, property (although probably
*898bd37aSMauro Carvalho Chehab    counterintuitive in the first place) is definitely beneficial, for
*898bd37aSMauro Carvalho Chehab    the following reasons:
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab    - First, with any proportional-share scheduler, the maximum
*898bd37aSMauro Carvalho Chehab      deviation with respect to an ideal service is proportional to
*898bd37aSMauro Carvalho Chehab      the maximum budget (slice) assigned to queues. As a consequence,
*898bd37aSMauro Carvalho Chehab      BFQ can keep this deviation tight not only because of the
*898bd37aSMauro Carvalho Chehab      accurate service of B-WF2Q+, but also because BFQ *does not*
*898bd37aSMauro Carvalho Chehab      need to assign a larger budget to a queue to let the queue
*898bd37aSMauro Carvalho Chehab      receive a higher fraction of the device throughput.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab    - Second, BFQ is free to choose, for every process (queue), the
*898bd37aSMauro Carvalho Chehab      budget that best fits the needs of the process, or best
*898bd37aSMauro Carvalho Chehab      leverages the I/O pattern of the process. In particular, BFQ
*898bd37aSMauro Carvalho Chehab      updates queue budgets with a simple feedback-loop algorithm that
*898bd37aSMauro Carvalho Chehab      allows a high throughput to be achieved, while still providing
*898bd37aSMauro Carvalho Chehab      tight latency guarantees to time-sensitive applications. When
*898bd37aSMauro Carvalho Chehab      the in-service queue expires, this algorithm computes the next
*898bd37aSMauro Carvalho Chehab      budget of the queue so as to:
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab      - Let large budgets be eventually assigned to the queues
*898bd37aSMauro Carvalho Chehab	associated with I/O-bound applications performing sequential
*898bd37aSMauro Carvalho Chehab	I/O: in fact, the longer these applications are served once
*898bd37aSMauro Carvalho Chehab	got access to the device, the higher the throughput is.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab      - Let small budgets be eventually assigned to the queues
*898bd37aSMauro Carvalho Chehab	associated with time-sensitive applications (which typically
*898bd37aSMauro Carvalho Chehab	perform sporadic and short I/O), because, the smaller the
*898bd37aSMauro Carvalho Chehab	budget assigned to a queue waiting for service is, the sooner
*898bd37aSMauro Carvalho Chehab	B-WF2Q+ will serve that queue (Subsec 3.3 in [2]).
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab- If several processes are competing for the device at the same time,
*898bd37aSMauro Carvalho Chehab  but all processes and groups have the same weight, then BFQ
*898bd37aSMauro Carvalho Chehab  guarantees the expected throughput distribution without ever idling
*898bd37aSMauro Carvalho Chehab  the device. It uses preemption instead. Throughput is then much
*898bd37aSMauro Carvalho Chehab  higher in this common scenario.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab- ioprio classes are served in strict priority order, i.e.,
*898bd37aSMauro Carvalho Chehab  lower-priority queues are not served as long as there are
*898bd37aSMauro Carvalho Chehab  higher-priority queues.  Among queues in the same class, the
*898bd37aSMauro Carvalho Chehab  bandwidth is distributed in proportion to the weight of each
*898bd37aSMauro Carvalho Chehab  queue. A very thin extra bandwidth is however guaranteed to
*898bd37aSMauro Carvalho Chehab  the Idle class, to prevent it from starving.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab3. What are BFQ's tunables and how to properly configure BFQ?
*898bd37aSMauro Carvalho Chehab=============================================================
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabMost BFQ tunables affect service guarantees (basically latency and
*898bd37aSMauro Carvalho Chehabfairness) and throughput. For full details on how to choose the
*898bd37aSMauro Carvalho Chehabdesired tradeoff between service guarantees and throughput, see the
*898bd37aSMauro Carvalho Chehabparameters slice_idle, strict_guarantees and low_latency. For details
*898bd37aSMauro Carvalho Chehabon how to maximise throughput, see slice_idle, timeout_sync and
*898bd37aSMauro Carvalho Chehabmax_budget. The other performance-related parameters have been
*898bd37aSMauro Carvalho Chehabinherited from, and have been preserved mostly for compatibility with
*898bd37aSMauro Carvalho ChehabCFQ. So far, no performance improvement has been reported after
*898bd37aSMauro Carvalho Chehabchanging the latter parameters in BFQ.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabIn particular, the tunables back_seek-max, back_seek_penalty,
*898bd37aSMauro Carvalho Chehabfifo_expire_async and fifo_expire_sync below are the same as in
*898bd37aSMauro Carvalho ChehabCFQ. Their description is just copied from that for CFQ. Some
*898bd37aSMauro Carvalho Chehabconsiderations in the description of slice_idle are copied from CFQ
*898bd37aSMauro Carvalho Chehabtoo.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehabper-process ioprio and weight
*898bd37aSMauro Carvalho Chehab-----------------------------
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabUnless the cgroups interface is used (see "4. BFQ group scheduling"),
*898bd37aSMauro Carvalho Chehabweights can be assigned to processes only indirectly, through I/O
*898bd37aSMauro Carvalho Chehabpriorities, and according to the relation:
*898bd37aSMauro Carvalho Chehabweight = (IOPRIO_BE_NR - ioprio) * 10.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabBeware that, if low-latency is set, then BFQ automatically raises the
*898bd37aSMauro Carvalho Chehabweight of the queues associated with interactive and soft real-time
*898bd37aSMauro Carvalho Chehabapplications. Unset this tunable if you need/want to control weights.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehabslice_idle
*898bd37aSMauro Carvalho Chehab----------
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabThis parameter specifies how long BFQ should idle for next I/O
*898bd37aSMauro Carvalho Chehabrequest, when certain sync BFQ queues become empty. By default
*898bd37aSMauro Carvalho Chehabslice_idle is a non-zero value. Idling has a double purpose: boosting
*898bd37aSMauro Carvalho Chehabthroughput and making sure that the desired throughput distribution is
*898bd37aSMauro Carvalho Chehabrespected (see the description of how BFQ works, and, if needed, the
*898bd37aSMauro Carvalho Chehabpapers referred there).
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabAs for throughput, idling can be very helpful on highly seeky media
*898bd37aSMauro Carvalho Chehablike single spindle SATA/SAS disks where we can cut down on overall
*898bd37aSMauro Carvalho Chehabnumber of seeks and see improved throughput.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabSetting slice_idle to 0 will remove all the idling on queues and one
*898bd37aSMauro Carvalho Chehabshould see an overall improved throughput on faster storage devices
*898bd37aSMauro Carvalho Chehablike multiple SATA/SAS disks in hardware RAID configuration, as well
*898bd37aSMauro Carvalho Chehabas flash-based storage with internal command queueing (and
*898bd37aSMauro Carvalho Chehabparallelism).
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabSo depending on storage and workload, it might be useful to set
*898bd37aSMauro Carvalho Chehabslice_idle=0.  In general for SATA/SAS disks and software RAID of
*898bd37aSMauro Carvalho ChehabSATA/SAS disks keeping slice_idle enabled should be useful. For any
*898bd37aSMauro Carvalho Chehabconfigurations where there are multiple spindles behind single LUN
*898bd37aSMauro Carvalho Chehab(Host based hardware RAID controller or for storage arrays), or with
*898bd37aSMauro Carvalho Chehabflash-based fast storage, setting slice_idle=0 might end up in better
*898bd37aSMauro Carvalho Chehabthroughput and acceptable latencies.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabIdling is however necessary to have service guarantees enforced in
*898bd37aSMauro Carvalho Chehabcase of differentiated weights or differentiated I/O-request lengths.
*898bd37aSMauro Carvalho ChehabTo see why, suppose that a given BFQ queue A must get several I/O
*898bd37aSMauro Carvalho Chehabrequests served for each request served for another queue B. Idling
*898bd37aSMauro Carvalho Chehabensures that, if A makes a new I/O request slightly after becoming
*898bd37aSMauro Carvalho Chehabempty, then no request of B is dispatched in the middle, and thus A
*898bd37aSMauro Carvalho Chehabdoes not lose the possibility to get more than one request dispatched
*898bd37aSMauro Carvalho Chehabbefore the next request of B is dispatched. Note that idling
*898bd37aSMauro Carvalho Chehabguarantees the desired differentiated treatment of queues only in
*898bd37aSMauro Carvalho Chehabterms of I/O-request dispatches. To guarantee that the actual service
*898bd37aSMauro Carvalho Chehaborder then corresponds to the dispatch order, the strict_guarantees
*898bd37aSMauro Carvalho Chehabtunable must be set too.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabThere is an important flipside for idling: apart from the above cases
*898bd37aSMauro Carvalho Chehabwhere it is beneficial also for throughput, idling can severely impact
*898bd37aSMauro Carvalho Chehabthroughput. One important case is random workload. Because of this
*898bd37aSMauro Carvalho Chehabissue, BFQ tends to avoid idling as much as possible, when it is not
*898bd37aSMauro Carvalho Chehabbeneficial also for throughput (as detailed in Section 2). As a
*898bd37aSMauro Carvalho Chehabconsequence of this behavior, and of further issues described for the
*898bd37aSMauro Carvalho Chehabstrict_guarantees tunable, short-term service guarantees may be
*898bd37aSMauro Carvalho Chehaboccasionally violated. And, in some cases, these guarantees may be
*898bd37aSMauro Carvalho Chehabmore important than guaranteeing maximum throughput. For example, in
*898bd37aSMauro Carvalho Chehabvideo playing/streaming, a very low drop rate may be more important
*898bd37aSMauro Carvalho Chehabthan maximum throughput. In these cases, consider setting the
*898bd37aSMauro Carvalho Chehabstrict_guarantees parameter.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehabslice_idle_us
*898bd37aSMauro Carvalho Chehab-------------
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabControls the same tuning parameter as slice_idle, but in microseconds.
*898bd37aSMauro Carvalho ChehabEither tunable can be used to set idling behavior.  Afterwards, the
*898bd37aSMauro Carvalho Chehabother tunable will reflect the newly set value in sysfs.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehabstrict_guarantees
*898bd37aSMauro Carvalho Chehab-----------------
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabIf this parameter is set (default: unset), then BFQ
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab- always performs idling when the in-service queue becomes empty;
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab- forces the device to serve one I/O request at a time, by dispatching a
*898bd37aSMauro Carvalho Chehab  new request only if there is no outstanding request.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabIn the presence of differentiated weights or I/O-request sizes, both
*898bd37aSMauro Carvalho Chehabthe above conditions are needed to guarantee that every BFQ queue
*898bd37aSMauro Carvalho Chehabreceives its allotted share of the bandwidth. The first condition is
*898bd37aSMauro Carvalho Chehabneeded for the reasons explained in the description of the slice_idle
*898bd37aSMauro Carvalho Chehabtunable.  The second condition is needed because all modern storage
*898bd37aSMauro Carvalho Chehabdevices reorder internally-queued requests, which may trivially break
*898bd37aSMauro Carvalho Chehabthe service guarantees enforced by the I/O scheduler.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabSetting strict_guarantees may evidently affect throughput.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehabback_seek_max
*898bd37aSMauro Carvalho Chehab-------------
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabThis specifies, given in Kbytes, the maximum "distance" for backward seeking.
*898bd37aSMauro Carvalho ChehabThe distance is the amount of space from the current head location to the
*898bd37aSMauro Carvalho Chehabsectors that are backward in terms of distance.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabThis parameter allows the scheduler to anticipate requests in the "backward"
*898bd37aSMauro Carvalho Chehabdirection and consider them as being the "next" if they are within this
*898bd37aSMauro Carvalho Chehabdistance from the current head location.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehabback_seek_penalty
*898bd37aSMauro Carvalho Chehab-----------------
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabThis parameter is used to compute the cost of backward seeking. If the
*898bd37aSMauro Carvalho Chehabbackward distance of request is just 1/back_seek_penalty from a "front"
*898bd37aSMauro Carvalho Chehabrequest, then the seeking cost of two requests is considered equivalent.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabSo scheduler will not bias toward one or the other request (otherwise scheduler
*898bd37aSMauro Carvalho Chehabwill bias toward front request). Default value of back_seek_penalty is 2.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehabfifo_expire_async
*898bd37aSMauro Carvalho Chehab-----------------
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabThis parameter is used to set the timeout of asynchronous requests. Default
*898bd37aSMauro Carvalho Chehabvalue of this is 248ms.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehabfifo_expire_sync
*898bd37aSMauro Carvalho Chehab----------------
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabThis parameter is used to set the timeout of synchronous requests. Default
*898bd37aSMauro Carvalho Chehabvalue of this is 124ms. In case to favor synchronous requests over asynchronous
*898bd37aSMauro Carvalho Chehabone, this value should be decreased relative to fifo_expire_async.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehablow_latency
*898bd37aSMauro Carvalho Chehab-----------
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabThis parameter is used to enable/disable BFQ's low latency mode. By
*898bd37aSMauro Carvalho Chehabdefault, low latency mode is enabled. If enabled, interactive and soft
*898bd37aSMauro Carvalho Chehabreal-time applications are privileged and experience a lower latency,
*898bd37aSMauro Carvalho Chehabas explained in more detail in the description of how BFQ works.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabDISABLE this mode if you need full control on bandwidth
*898bd37aSMauro Carvalho Chehabdistribution. In fact, if it is enabled, then BFQ automatically
*898bd37aSMauro Carvalho Chehabincreases the bandwidth share of privileged applications, as the main
*898bd37aSMauro Carvalho Chehabmeans to guarantee a lower latency to them.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabIn addition, as already highlighted at the beginning of this document,
*898bd37aSMauro Carvalho ChehabDISABLE this mode if your only goal is to achieve a high throughput.
*898bd37aSMauro Carvalho ChehabIn fact, privileging the I/O of some application over the rest may
*898bd37aSMauro Carvalho Chehabentail a lower throughput. To achieve the highest-possible throughput
*898bd37aSMauro Carvalho Chehabon a non-rotational device, setting slice_idle to 0 may be needed too
*898bd37aSMauro Carvalho Chehab(at the cost of giving up any strong guarantee on fairness and low
*898bd37aSMauro Carvalho Chehablatency).
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehabtimeout_sync
*898bd37aSMauro Carvalho Chehab------------
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabMaximum amount of device time that can be given to a task (queue) once
*898bd37aSMauro Carvalho Chehabit has been selected for service. On devices with costly seeks,
*898bd37aSMauro Carvalho Chehabincreasing this time usually increases maximum throughput. On the
*898bd37aSMauro Carvalho Chehabopposite end, increasing this time coarsens the granularity of the
*898bd37aSMauro Carvalho Chehabshort-term bandwidth and latency guarantees, especially if the
*898bd37aSMauro Carvalho Chehabfollowing parameter is set to zero.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehabmax_budget
*898bd37aSMauro Carvalho Chehab----------
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabMaximum amount of service, measured in sectors, that can be provided
*898bd37aSMauro Carvalho Chehabto a BFQ queue once it is set in service (of course within the limits
*898bd37aSMauro Carvalho Chehabof the above timeout). According to what said in the description of
*898bd37aSMauro Carvalho Chehabthe algorithm, larger values increase the throughput in proportion to
*898bd37aSMauro Carvalho Chehabthe percentage of sequential I/O requests issued. The price of larger
*898bd37aSMauro Carvalho Chehabvalues is that they coarsen the granularity of short-term bandwidth
*898bd37aSMauro Carvalho Chehaband latency guarantees.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabThe default value is 0, which enables auto-tuning: BFQ sets max_budget
*898bd37aSMauro Carvalho Chehabto the maximum number of sectors that can be served during
*898bd37aSMauro Carvalho Chehabtimeout_sync, according to the estimated peak rate.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabFor specific devices, some users have occasionally reported to have
*898bd37aSMauro Carvalho Chehabreached a higher throughput by setting max_budget explicitly, i.e., by
*898bd37aSMauro Carvalho Chehabsetting max_budget to a higher value than 0. In particular, they have
*898bd37aSMauro Carvalho Chehabset max_budget to higher values than those to which BFQ would have set
*898bd37aSMauro Carvalho Chehabit with auto-tuning. An alternative way to achieve this goal is to
*898bd37aSMauro Carvalho Chehabjust increase the value of timeout_sync, leaving max_budget equal to 0.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehabweights
*898bd37aSMauro Carvalho Chehab-------
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabRead-only parameter, used to show the weights of the currently active
*898bd37aSMauro Carvalho ChehabBFQ queues.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab4. Group scheduling with BFQ
*898bd37aSMauro Carvalho Chehab============================
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabBFQ supports both cgroups-v1 and cgroups-v2 io controllers, namely
*898bd37aSMauro Carvalho Chehabblkio and io. In particular, BFQ supports weight-based proportional
*898bd37aSMauro Carvalho Chehabshare. To activate cgroups support, set BFQ_GROUP_IOSCHED.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab4-1 Service guarantees provided
*898bd37aSMauro Carvalho Chehab-------------------------------
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabWith BFQ, proportional share means true proportional share of the
*898bd37aSMauro Carvalho Chehabdevice bandwidth, according to group weights. For example, a group
*898bd37aSMauro Carvalho Chehabwith weight 200 gets twice the bandwidth, and not just twice the time,
*898bd37aSMauro Carvalho Chehabof a group with weight 100.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabBFQ supports hierarchies (group trees) of any depth. Bandwidth is
*898bd37aSMauro Carvalho Chehabdistributed among groups and processes in the expected way: for each
*898bd37aSMauro Carvalho Chehabgroup, the children of the group share the whole bandwidth of the
*898bd37aSMauro Carvalho Chehabgroup in proportion to their weights. In particular, this implies
*898bd37aSMauro Carvalho Chehabthat, for each leaf group, every process of the group receives the
*898bd37aSMauro Carvalho Chehabsame share of the whole group bandwidth, unless the ioprio of the
*898bd37aSMauro Carvalho Chehabprocess is modified.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabThe resource-sharing guarantee for a group may partially or totally
*898bd37aSMauro Carvalho Chehabswitch from bandwidth to time, if providing bandwidth guarantees to
*898bd37aSMauro Carvalho Chehabthe group lowers the throughput too much. This switch occurs on a
*898bd37aSMauro Carvalho Chehabper-process basis: if a process of a leaf group causes throughput loss
*898bd37aSMauro Carvalho Chehabif served in such a way to receive its share of the bandwidth, then
*898bd37aSMauro Carvalho ChehabBFQ switches back to just time-based proportional share for that
*898bd37aSMauro Carvalho Chehabprocess.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab4-2 Interface
*898bd37aSMauro Carvalho Chehab-------------
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabTo get proportional sharing of bandwidth with BFQ for a given device,
*898bd37aSMauro Carvalho ChehabBFQ must of course be the active scheduler for that device.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabWithin each group directory, the names of the files associated with
*898bd37aSMauro Carvalho ChehabBFQ-specific cgroup parameters and stats begin with the "bfq."
*898bd37aSMauro Carvalho Chehabprefix. So, with cgroups-v1 or cgroups-v2, the full prefix for
*898bd37aSMauro Carvalho ChehabBFQ-specific files is "blkio.bfq." or "io.bfq." For example, the group
*898bd37aSMauro Carvalho Chehabparameter to set the weight of a group with BFQ is blkio.bfq.weight
*898bd37aSMauro Carvalho Chehabor io.bfq.weight.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabAs for cgroups-v1 (blkio controller), the exact set of stat files
*898bd37aSMauro Carvalho Chehabcreated, and kept up-to-date by bfq, depends on whether
*898bd37aSMauro Carvalho ChehabCONFIG_BFQ_CGROUP_DEBUG is set. If it is set, then bfq creates all
*898bd37aSMauro Carvalho Chehabthe stat files documented in
*898bd37aSMauro Carvalho ChehabDocumentation/cgroup-v1/blkio-controller.rst. If, instead,
*898bd37aSMauro Carvalho ChehabCONFIG_BFQ_CGROUP_DEBUG is not set, then bfq creates only the files::
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab  blkio.bfq.io_service_bytes
*898bd37aSMauro Carvalho Chehab  blkio.bfq.io_service_bytes_recursive
*898bd37aSMauro Carvalho Chehab  blkio.bfq.io_serviced
*898bd37aSMauro Carvalho Chehab  blkio.bfq.io_serviced_recursive
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabThe value of CONFIG_BFQ_CGROUP_DEBUG greatly influences the maximum
*898bd37aSMauro Carvalho Chehabthroughput sustainable with bfq, because updating the blkio.bfq.*
*898bd37aSMauro Carvalho Chehabstats is rather costly, especially for some of the stats enabled by
*898bd37aSMauro Carvalho ChehabCONFIG_BFQ_CGROUP_DEBUG.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabParameters to set
*898bd37aSMauro Carvalho Chehab-----------------
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabFor each group, there is only the following parameter to set.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehabweight (namely blkio.bfq.weight or io.bfq-weight): the weight of the
*898bd37aSMauro Carvalho Chehabgroup inside its parent. Available values: 1..10000 (default 100). The
*898bd37aSMauro Carvalho Chehablinear mapping between ioprio and weights, described at the beginning
*898bd37aSMauro Carvalho Chehabof the tunable section, is still valid, but all weights higher than
*898bd37aSMauro Carvalho ChehabIOPRIO_BE_NR*10 are mapped to ioprio 0.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho ChehabRecall that, if low-latency is set, then BFQ automatically raises the
*898bd37aSMauro Carvalho Chehabweight of the queues associated with interactive and soft real-time
*898bd37aSMauro Carvalho Chehabapplications. Unset this tunable if you need/want to control weights.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab[1]
*898bd37aSMauro Carvalho Chehab    P. Valente, A. Avanzini, "Evolution of the BFQ Storage I/O
*898bd37aSMauro Carvalho Chehab    Scheduler", Proceedings of the First Workshop on Mobile System
*898bd37aSMauro Carvalho Chehab    Technologies (MST-2015), May 2015.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab    http://algogroup.unimore.it/people/paolo/disk_sched/mst-2015.pdf
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab[2]
*898bd37aSMauro Carvalho Chehab    P. Valente and M. Andreolini, "Improving Application
*898bd37aSMauro Carvalho Chehab    Responsiveness with the BFQ Disk I/O Scheduler", Proceedings of
*898bd37aSMauro Carvalho Chehab    the 5th Annual International Systems and Storage Conference
*898bd37aSMauro Carvalho Chehab    (SYSTOR '12), June 2012.
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab    Slightly extended version:
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab    http://algogroup.unimore.it/people/paolo/disk_sched/bfq-v1-suite-results.pdf
*898bd37aSMauro Carvalho Chehab
*898bd37aSMauro Carvalho Chehab[3]
*898bd37aSMauro Carvalho Chehab   https://github.com/Algodev-github/S