xref: /freebsd/usr.sbin/nfsd/nfsv4.4 (revision 45486372)
1.\" Copyright (c) 2009 Rick Macklem, University of Guelph
2.\" All rights reserved.
3.\"
4.\" Redistribution and use in source and binary forms, with or without
5.\" modification, are permitted provided that the following conditions
6.\" are met:
7.\" 1. Redistributions of source code must retain the above copyright
8.\"    notice, this list of conditions and the following disclaimer.
9.\" 2. Redistributions in binary form must reproduce the above copyright
10.\"    notice, this list of conditions and the following disclaimer in the
11.\"    documentation and/or other materials provided with the distribution.
12.\"
13.\" THIS SOFTWARE IS PROVIDED BY THE AUTHOR AND CONTRIBUTORS ``AS IS'' AND
14.\" ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
15.\" IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
16.\" ARE DISCLAIMED.  IN NO EVENT SHALL THE AUTHOR OR CONTRIBUTORS BE LIABLE
17.\" FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
18.\" DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS
19.\" OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION)
20.\" HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT
21.\" LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY
22.\" OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF
23.\" SUCH DAMAGE.
24.\"
25.Dd January 8, 2024
26.Dt NFSV4 4
27.Os
28.Sh NAME
29.Nm NFSv4
30.Nd NFS Version 4 Protocol
31.Sh DESCRIPTION
32The NFS client and server provides support for the
33.Tn NFSv4
34specification; see
35.%T "Network File System (NFS) Version 4 Protocol RFC 7530" ,
36.%T "Network File System (NFS) Version 4 Minor Version 1 Protocol RFC 5661" ,
37.%T "Network File System (NFS) Version 4 Minor Version 2 Protocol RFC 7862" ,
38.%T "File System Extended Attributes in NFSv4 RFC 8276" and
39.%T "Parallel NFS (pNFS) Flexible File Layout RFC 8435" .
40The protocol is somewhat similar to NFS Version 3, but differs in significant
41ways.
42It uses a single compound RPC that concatenates operations to-gether.
43Each of these operations are similar to the RPCs of NFS Version 3.
44The operations in the compound are performed in order, until one of
45them fails (returns an error) and then the RPC terminates at that point.
46.Pp
47It has
48integrated locking support, which implies that the server is no longer
49stateless.
50As such, the
51.Nm
52server remains in recovery mode for a grace period (always greater than the
53lease duration the server uses) after a reboot.
54During this grace period, clients may recover state but not perform other
55open/lock state changing operations.
56To provide for correct recovery semantics, a small file described by
57.Xr stablerestart 5
58is used by the server during the recovery phase.
59If this file is missing or empty, there is a backup copy maintained by
60.Xr nfsd 8
61that will be used.
62If either file is missing, they will be created by the
63.Xr nfsd 8 .
64If both the file and the backup copy are empty,
65it will result in the server starting without providing a grace period
66for recovery.
67Note that recovery only occurs when the server
68machine is rebooted, not when the
69.Xr nfsd 8
70are just restarted.
71.Pp
72It provides several optional features not present in NFS Version 3:
73.sp
74.Bd -literal -offset indent -compact
75- NFS Version 4 ACLs
76- Referrals, which redirect subtrees to other servers
77  (not yet implemented)
78- Delegations, which allow a client to operate on a file locally
79- pNFS, where I/O operations are separated from Metadata operations
80And for NFSv4.2 only
81- User namespace extended attributes
82- lseek(SEEK_DATA/SEEK_HOLE)
83- File copying done locally on the server for copy_file_range(2)
84- posix_fallocate(2)
85- posix_fadvise(POSIX_FADV_WILLNEED/POSIX_FADV_DONTNEED)
86.Ed
87.Pp
88The
89.Nm
90protocol does not use a separate mount protocol and assumes that the
91server provides a single file system tree structure, rooted at the point
92in the local file system tree specified by one or more
93.sp 1
94.Bd -literal -offset indent -compact
95V4: <rootdir> [-sec=secflavors] [host(s) or net]
96.Ed
97.sp 1
98line(s) in the
99.Xr exports 5
100file.
101(See
102.Xr exports 5
103for details.)
104The
105.Xr nfsd 8
106allows a limited subset of operations to be performed on non-exported subtrees
107of the local file system, so that traversal of the tree to the exported
108subtrees is possible.
109As such, the ``<rootdir>'' can be in a non-exported file system.
110The exception is ZFS, which checks exports and, as such, all ZFS file systems
111below the ``<rootdir>'' must be exported.
112However,
113the entire tree that is rooted at that point must be in local file systems
114that are of types that can be NFS exported.
115Since the
116.Nm
117file system is rooted at ``<rootdir>'', setting this to anything other
118than ``/'' will result in clients being required to use different mount
119paths for
120.Nm
121than for NFS Version 2 or 3.
122Unlike NFS Version 2 and 3, Version 4 allows a client mount to span across
123multiple server file systems, although not all clients are capable of doing
124this.
125.Pp
126.Nm
127uses strings for users and groups instead of numbers.
128On the wire, these strings can either have the numbers in the string or
129take the form:
130.sp
131.Bd -literal -offset indent -compact
132<user>@<dns.domain>
133.Ed
134.sp
135where ``<dns.domain>'' is not the same as the DNS domain used
136for host name lookups, but is usually set to the same string.
137Most systems set this ``<dns.domain>''
138to the domain name part of the machine's
139.Xr hostname 1
140by default.
141However, this can normally be overridden by a command line
142option or configuration file for the daemon used to do the name<->number
143mapping.
144Under
145.Fx ,
146the mapping daemon is called
147.Xr nfsuserd 8
148and has a command line option that overrides the domain component of the
149machine's hostname.
150For use of this form of string on
151.Nm ,
152either client or server, this daemon must be running.
153.Pp
154The form where the numbers are in the strings can only be used for AUTH_SYS.
155To configure your systems this way, the
156.Xr nfsuserd 8
157daemon does not need to be running on the server, but the following sysctls
158need to be set to 1 on the server.
159.sp
160.Bd -literal -offset indent -compact
161vfs.nfs.enable_uidtostring
162vfs.nfsd.enable_stringtouid
163.Ed
164.sp
165On the client, the sysctl
166.sp
167.Bd -literal -offset indent -compact
168vfs.nfs.enable_uidtostring
169.Ed
170.sp
171must be set to 1 and the
172.Xr nfsuserd 8
173daemon does not need to be running.
174.Pp
175If these strings are not configured correctly, ``ls -l'' will typically
176report a lot of ``nobody'' and ``nogroup'' ownerships.
177.Pp
178Although uid/gid numbers are no longer used in the
179.Nm
180protocol except optionally in the above strings, they will still be in the RPC
181authentication fields when using AUTH_SYS (sec=sys), which is the default.
182As such, in this case both the user/group name and number spaces must
183be consistent between the client and server.
184.Pp
185However, if you run
186.Nm
187with RPCSEC_GSS (sec=krb5, krb5i, krb5p), only names and KerberosV tickets
188will go on the wire.
189.Sh SERVER SETUP
190To set up the NFS server that supports
191.Nm ,
192you will need to set the variables in
193.Xr rc.conf 5
194as follows:
195.sp
196.Bd -literal -offset indent -compact
197nfs_server_enable="YES"
198nfsv4_server_enable="YES"
199.Ed
200.sp
201plus
202.sp
203.Bd -literal -offset indent -compact
204nfsuserd_enable="YES"
205.Ed
206.sp
207if the server is using the ``<user>@<domain>'' form of user/group strings or
208is using the ``-manage-gids'' option for
209.Xr nfsuserd 8 .
210.Pp
211In addition, you can set:
212.sp
213.Bd -literal -offset indent -compact
214nfsv4_server_only="YES"
215.Ed
216.sp
217to disable support for NFSv2 and NFSv3.
218.Pp
219You will also need to add at least one ``V4:'' line to the
220.Xr exports 5
221file for
222.Nm
223to work.
224.Pp
225If the file systems you are exporting are only being accessed via
226.Nm
227there are a couple of
228.Xr sysctl 8
229variables that you can change, which might improve performance.
230.Bl -tag -width Ds
231.It Cm vfs.nfsd.issue_delegations
232when set non-zero, allows the server to issue Open Delegations to
233clients.
234These delegations permit the client to manipulate the file
235locally on the client.
236Unfortunately, at this time, client use of
237delegations is limited, so performance gains may not be observed.
238This can only be enabled when the file systems being exported to
239.Nm
240clients are not being accessed locally on the server and, if being
241accessed via NFS Version 2 or 3 clients, these clients cannot be
242using the NLM.
243.It Cm vfs.nfsd.enable_locallocks
244can be set to 0 to disable acquisition of local byte range locks.
245Disabling local locking can only be done if neither local accesses
246to the exported file systems nor the NLM is operating on them.
247.El
248.sp
249Note that Samba server access would be considered ``local access'' for the above
250discussion.
251.Pp
252To build a kernel with the NFS server that supports
253.Nm
254linked into it, the
255.sp
256.Bd -literal -offset indent -compact
257options	NFSD
258.Ed
259.sp
260must be specified in the kernel's
261.Xr config 5
262file.
263.Sh CLIENT MOUNTS
264To do an
265.Nm
266mount, specify the ``nfsv4'' option on the
267.Xr mount_nfs 8
268command line.
269This will force use of the client that supports
270.Nm
271plus set ``tcp'' and
272.Nm .
273.Pp
274The
275.Xr nfsuserd 8
276must be running if name<->uid/gid mapping is being used, as above.
277Also, since an
278.Nm
279mount uses the host uuid to identify the client uniquely to the server,
280you cannot safely do an
281.Nm
282mount when
283.sp
284.Bd -literal -offset indent -compact
285hostid_enable="NO"
286.Ed
287.sp
288is set in
289.Xr rc.conf 5 .
290.sp
291If the
292.Nm
293server that is being mounted on supports delegations, you can start the
294.Xr nfscbd 8
295daemon to handle client side callbacks.
296This will occur if
297.sp
298.Bd -literal -offset indent -compact
299nfsuserd_enable="YES"	<-- If name<->uid/gid mapping is being used.
300nfscbd_enable="YES"
301.Ed
302.sp
303are set in
304.Xr rc.conf 5 .
305.sp
306Without a functioning callback path, a server will never issue Delegations
307to a client.
308.sp
309For NFSv4.0, by default, the callback address will be set to the IP address
310acquired via
311.Fn rtalloc
312in the kernel and port# 7745.
313To override the default port#, a command line option for
314.Xr nfscbd 8
315can be used.
316.sp
317To get callbacks to work when behind a NAT gateway, a port for the callback
318service will need to be set up on the NAT gateway and then the address
319of the NAT gateway (host IP plus port#) will need to be set by assigning the
320.Xr sysctl 8
321variable vfs.nfs.callback_addr to a string of the form:
322.sp
323N.N.N.N.N.N
324.sp
325where the first 4 Ns are the host IP address and the last two are the
326port# in network byte order (all decimal #s in the range 0-255).
327.Pp
328For NFSv4.1 and NFSv4.2, the callback path (called a backchannel) uses the
329same TCP connection as the mount, so none of the above applies and should
330work through gateways without any issues.
331.Pp
332To build a kernel with the client that supports
333.Nm
334linked into it, the option
335.sp
336.Bd -literal -offset indent -compact
337options	NFSCL
338.Ed
339.sp
340must be specified in the kernel's
341.Xr config 5
342file.
343.Pp
344Options can be specified for the
345.Xr nfsuserd 8
346and
347.Xr nfscbd 8
348daemons at boot time via the ``nfsuserd_flags'' and ``nfscbd_flags''
349.Xr rc.conf 5
350variables.
351.Pp
352NFSv4 mount(s) against exported volume(s) on the same host are not recommended,
353since this can result in a hung NFS server.
354It occurs when an nfsd thread tries to do an NFSv4
355.Fn VOP_RECLAIM
356/ Close RPC as part of acquiring a new vnode.
357If all other nfsd threads are blocked waiting for lock(s) held by this nfsd
358thread, then there is no nfsd thread to service the Close RPC.
359.Sh FILES
360.Bl -tag -width /var/db/nfs-stablerestart.bak -compact
361.It Pa /var/db/nfs-stablerestart
362NFS V4 stable restart file
363.It Pa /var/db/nfs-stablerestart.bak
364backup copy of the file
365.El
366.Sh SEE ALSO
367.Xr stablerestart 5 ,
368.Xr mountd 8 ,
369.Xr nfscbd 8 ,
370.Xr nfsd 8 ,
371.Xr nfsdumpstate 8 ,
372.Xr nfsrevoke 8 ,
373.Xr nfsuserd 8
374.Sh BUGS
375At this time, there is no recall of delegations for local file system
376operations.
377As such, delegations should only be enabled for file systems
378that are being used solely as NFS export volumes and are not being accessed
379via local system calls nor services such as Samba.
380