Thread (4 messages) 4 messages, 2 authors, 2014-06-02

Re: [PATCH] imsm: retry load_and_parse_mpb if we suspect mdmon has made modifications

From: Artur Paszkiewicz <hidden>
Date: 2014-06-02 13:02:59
Subsystem: the rest · Maintainer: Linus Torvalds

On 06/02/2014 04:36 AM, NeilBrown wrote:
On Fri, 30 May 2014 15:18:33 +0200 Artur Paszkiewicz
[off-list ref] wrote:
quoted
If the checksum verification fails in mdadm and mdmon is running, retry
the load to get a consistent snapshot of the mpb.

Based on db575f3b

Signed-off-by: Artur Paszkiewicz <redacted>
Reviewed-by: Pawel Baldysiak <redacted>
---
 super-intel.c | 17 +++++++++++++++++
 1 file changed, 17 insertions(+)
diff --git a/super-intel.c b/super-intel.c
index f0a7ab5..037c018 100644
--- a/super-intel.c
+++ b/super-intel.c
@@ -4422,6 +4422,7 @@ static int load_super_imsm(struct supertype *st, int fd, char *devname)
 {
 	struct intel_super *super;
 	int rv;
+	int retry;
 
 	if (test_partition(fd))
 		/* IMSM not allowed on partitions */
@@ -4444,6 +4445,22 @@ static int load_super_imsm(struct supertype *st, int fd, char *devname)
 	}
 	rv = load_and_parse_mpb(fd, super, devname, 0);
 
+	/* retry the load if we might have raced against mdmon */
+	if (rv == 3) {
+		struct mdstat_ent *mdstat = mdstat_by_component(fd2devnm(fd));
+
+		if (mdmon_running(mdstat->devnm) && getpid() != mdmon_pid(mdstat->devnm)) {
+			for (retry = 0; retry < 3; retry++) {
+				usleep(3000);
+				rv = load_and_parse_mpb(fd, super, devname, 0);
+				if (rv != 3)
+					break;
+			}
+		}
The only thing you use from mdstat is devnm, and that is the thing you passed
to mdstat_by_component to get mdstat....

Can you just do
   char *devnm = fd2devnm(fd);
   if (mdmon_running(devnm) && ......)

??
I can't do that because mdmon_running and mdmon_pid need a devnm of a
container device, and the only thing we have here is the file descriptor
of a component device. So I used mdstat_by_component to get the
container devnm. Do you have an idea how to get that reliably without
reading mdstat?

I have overlooked that mdstat_by_component can return NULL here. I've
added a check for this in the patch below.

Thanks,
Artur

From dfb12870a482654b405ec1d4d9d3a8ba69a6290c Mon Sep 17 00:00:00 2001
From: Artur Paszkiewicz <redacted>
Date: Tue, 27 May 2014 15:30:54 +0200
Subject: [PATCH] imsm: retry load_and_parse_mpb if we suspect mdmon has made
 modifications

If the checksum verification fails in mdadm and mdmon is running, retry
the load to get a consistent snapshot of the mpb.

Based on db575f3b

Signed-off-by: Artur Paszkiewicz <redacted>
---
 super-intel.c | 17 +++++++++++++++++
 1 file changed, 17 insertions(+)
diff --git a/super-intel.c b/super-intel.c
index f0a7ab5..9dd807a 100644
--- a/super-intel.c
+++ b/super-intel.c
@@ -4422,6 +4422,7 @@ static int load_super_imsm(struct supertype *st, int fd, char *devname)
 {
 	struct intel_super *super;
 	int rv;
+	int retry;
 
 	if (test_partition(fd))
 		/* IMSM not allowed on partitions */
@@ -4444,6 +4445,22 @@ static int load_super_imsm(struct supertype *st, int fd, char *devname)
 	}
 	rv = load_and_parse_mpb(fd, super, devname, 0);
 
+	/* retry the load if we might have raced against mdmon */
+	if (rv == 3) {
+		struct mdstat_ent *mdstat = mdstat_by_component(fd2devnm(fd));
+
+		if (mdstat && mdmon_running(mdstat->devnm) && getpid() != mdmon_pid(mdstat->devnm)) {
+			for (retry = 0; retry < 3; retry++) {
+				usleep(3000);
+				rv = load_and_parse_mpb(fd, super, devname, 0);
+				if (rv != 3)
+					break;
+			}
+		}
+
+		free_mdstat(mdstat);
+	}
+
 	if (rv) {
 		if (devname)
 			pr_err("Failed to load all information "
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help