Thread (24 messages) 24 messages, 4 authors, 2021-12-10

Re: [PATCH RFC v6 2/2] nfsd: Initial implementation of NFSv4 Courteous Server

From: Chuck Lever III <hidden>
Date: 2021-12-08 15:58:57
Also in: linux-nfs

On Dec 8, 2021, at 10:54 AM, Dai Ngo [off-list ref] wrote:

On 12/6/21 11:55 AM, Chuck Lever III wrote:
quoted
quoted
+
+/*
+ * Function to check if the nfserr_share_denied error for 'fp' resulted
+ * from conflict with courtesy clients then release their state to resolve
+ * the conflict.
+ *
+ * Function returns:
+ *	 0 -  no conflict with courtesy clients
+ *	>0 -  conflict with courtesy clients resolved, try access/deny check again
+ *	-1 -  conflict with courtesy clients being resolved in background
+ *            return nfserr_jukebox to NFS client
+ */
+static int
+nfs4_destroy_clnts_with_sresv_conflict(struct svc_rqst *rqstp,
+			struct nfs4_file *fp, struct nfs4_ol_stateid *stp,
+			u32 access, bool share_access)
+{
+	int cnt = 0;
+	int async_cnt = 0;
+	bool no_retry = false;
+	struct nfs4_client *cl;
+	struct list_head *pos, *next, reaplist;
+	struct nfsd_net *nn = net_generic(SVC_NET(rqstp), nfsd_net_id);
+
+	INIT_LIST_HEAD(&reaplist);
+	spin_lock(&nn->client_lock);
+	list_for_each_safe(pos, next, &nn->client_lru) {
+		cl = list_entry(pos, struct nfs4_client, cl_lru);
+		/*
+		 * check all nfs4_ol_stateid of this client
+		 * for conflicts with 'access'mode.
+		 */
+		if (nfs4_check_deny_bmap(cl, fp, stp, access, share_access)) {
+			if (!test_bit(NFSD4_COURTESY_CLIENT, &cl->cl_flags)) {
+				/* conflict with non-courtesy client */
+				no_retry = true;
+				cnt = 0;
+				goto out;
+			}
+			/*
+			 * if too many to resolve synchronously
+			 * then do the rest in background
+			 */
+			if (cnt > 100) {
+				set_bit(NFSD4_DESTROY_COURTESY_CLIENT, &cl->cl_flags);
+				async_cnt++;
+				continue;
+			}
+			if (mark_client_expired_locked(cl))
+				continue;
+			cnt++;
+			list_add(&cl->cl_lru, &reaplist);
+		}
+	}
Bruce suggested simply returning NFS4ERR_DELAY for all cases.
That would simplify this quite a bit for what is a rare edge
case.
If we always do this asynchronously by returning NFS4ERR_DELAY
for all cases then the following pynfs tests need to be modified
to handle the error:

RENEW3   st_renew.testExpired                                     : FAILURE
LKU10    st_locku.testTimedoutUnlock                              : FAILURE
CLOSE9   st_close.testTimedoutClose2                              : FAILURE

and any new tests that opens file have to be prepared to handle
NFS4ERR_DELAY due to the lack of destroy_clientid in 4.0.

Do we still want to take this approach?
I'm still interested, but Bruce should chime in.

Maybe Calum could have a look under the covers of pynfs and see how difficult the change might be.


--
Chuck Lever


Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help