Re: [PATCH 7/8] cgroup: mount cgroupns-root when inside non-init cgroupns

CGroup Namespaces (v4) · serge@hallyn.com · 2015-11-16
[PATCH 4/8] cgroup: export cgroup_get() and cgroup_put() · serge@hallyn.com · 2015-11-16
Re: [PATCH 4/8] cgroup: export cgroup_get() and cgroup_put() · Tejun Heo <tj@kernel.org> · 2015-11-24
Re: [PATCH 4/8] cgroup: export cgroup_get() and cgroup_put() · Serge E. Hallyn <hidden> · 2015-11-24
[PATCH 2/8] sched: new clone flag CLONE_NEWCGROUP for cgroup namespace · serge@hallyn.com · 2015-11-16
[PATCH 7/8] cgroup: mount cgroupns-root when inside non-init cgroupns · serge@hallyn.com · 2015-11-16
Re: [PATCH 7/8] cgroup: mount cgroupns-root when inside non-init cgroupns · Tejun Heo <tj@kernel.org> · 2015-11-24
Re: [PATCH 7/8] cgroup: mount cgroupns-root when inside non-init cgroupns · Serge E. Hallyn <hidden> · 2015-11-25
Re: [PATCH 7/8] cgroup: mount cgroupns-root when inside non-init cgroupns · Tejun Heo <tj@kernel.org> · 2015-11-25
Re: [PATCH 7/8] cgroup: mount cgroupns-root when inside non-init cgroupns · Serge Hallyn <hidden> · 2015-11-25
Re: [PATCH 7/8] cgroup: mount cgroupns-root when inside non-init cgroupns · Tejun Heo <tj@kernel.org> · 2015-11-25
Re: [PATCH 7/8] cgroup: mount cgroupns-root when inside non-init cgroupns · Serge E. Hallyn <hidden> · 2015-11-27
Re: [PATCH 7/8] cgroup: mount cgroupns-root when inside non-init cgroupns · Tejun Heo <tj@kernel.org> · 2015-11-30
Re: [PATCH 7/8] cgroup: mount cgroupns-root when inside non-init cgroupns · Serge E. Hallyn <hidden> · 2015-12-01
Re: [PATCH 7/8] cgroup: mount cgroupns-root when inside non-init cgroupns · Tejun Heo <tj@kernel.org> · 2015-12-01
Re: [PATCH 7/8] cgroup: mount cgroupns-root when inside non-init cgroupns · Serge E. Hallyn <hidden> · 2015-12-01
Re: [PATCH 7/8] cgroup: mount cgroupns-root when inside non-init cgroupns · Tejun Heo <tj@kernel.org> · 2015-12-02
Re: [PATCH 7/8] cgroup: mount cgroupns-root when inside non-init cgroupns · Serge E. Hallyn <hidden> · 2015-12-02
Re: [PATCH 7/8] cgroup: mount cgroupns-root when inside non-init cgroupns · Tejun Heo <tj@kernel.org> · 2015-12-02
Re: [PATCH 7/8] cgroup: mount cgroupns-root when inside non-init cgroupns · Serge E. Hallyn <hidden> · 2015-12-02
Re: [PATCH 7/8] cgroup: mount cgroupns-root when inside non-init cgroupns · Tejun Heo <tj@kernel.org> · 2015-12-02
Re: [PATCH 7/8] cgroup: mount cgroupns-root when inside non-init cgroupns · Serge E. Hallyn <hidden> · 2015-12-03
Re: [PATCH 7/8] cgroup: mount cgroupns-root when inside non-init cgroupns · Tejun Heo <tj@kernel.org> · 2015-12-07
Re: [PATCH 7/8] cgroup: mount cgroupns-root when inside non-init cgroupns · Serge Hallyn <hidden> · 2015-12-07
[PATCH 8/8] cgroup: Add documentation for cgroup namespaces · serge@hallyn.com · 2015-11-16
Re: [PATCH 8/8] cgroup: Add documentation for cgroup namespaces · Tejun Heo <tj@kernel.org> · 2015-11-24
[PATCH 5/8] cgroup: introduce cgroup namespaces · serge@hallyn.com · 2015-11-16
Re: [PATCH 5/8] cgroup: introduce cgroup namespaces · Tejun Heo <tj@kernel.org> · 2015-11-24
[PATCH 6/8] cgroup: cgroup namespace setns support · serge@hallyn.com · 2015-11-16
Re: [PATCH 6/8] cgroup: cgroup namespace setns support · Tejun Heo <tj@kernel.org> · 2015-11-24
[PATCH 1/8] kernfs: Add API to generate relative kernfs path · serge@hallyn.com · 2015-11-16
Re: [PATCH 1/8] kernfs: Add API to generate relative kernfs path · Tejun Heo <tj@kernel.org> · 2015-11-24
Re: [PATCH 1/8] kernfs: Add API to generate relative kernfs path · Tejun Heo <tj@kernel.org> · 2015-11-24
Re: [PATCH 1/8] kernfs: Add API to generate relative kernfs path · Serge E. Hallyn <hidden> · 2015-11-24
Re: [PATCH 1/8] kernfs: Add API to generate relative kernfs path · Serge E. Hallyn <hidden> · 2015-11-27
Re: [PATCH 1/8] kernfs: Add API to generate relative kernfs path · Tejun Heo <tj@kernel.org> · 2015-11-30
Re: [PATCH 1/8] kernfs: Add API to generate relative kernfs path · Serge E. Hallyn <hidden> · 2015-11-30
Re: [PATCH 1/8] kernfs: Add API to generate relative kernfs path · Tejun Heo <tj@kernel.org> · 2015-11-30
Re: [PATCH 1/8] kernfs: Add API to generate relative kernfs path · Serge E. Hallyn <hidden> · 2015-12-01
[PATCH 3/8] cgroup: add function to get task's cgroup · serge@hallyn.com · 2015-11-16
Re: [PATCH 3/8] cgroup: add function to get task's cgroup · Tejun Heo <tj@kernel.org> · 2015-11-24
Re: [PATCH 3/8] cgroup: add function to get task's cgroup · Tejun Heo <tj@kernel.org> · 2015-11-24
Re: CGroup Namespaces (v4) · Richard Weinberger <hidden> · 2015-11-16
Re: CGroup Namespaces (v4) · "Serge E. Hallyn" <serge@hallyn.com> · 2015-11-16
Re: CGroup Namespaces (v4) · Richard Weinberger <richard@nod.at> · 2015-11-16
Re: CGroup Namespaces (v4) · "Serge E. Hallyn" <serge@hallyn.com> · 2015-11-16

From: Tejun Heo <hidden>
Date: 2015-11-24 17:16:17
Also in: cgroups, lkml

Hello,

On Mon, Nov 16, 2015 at 01:51:44PM -0600, serge-A9i7LUbDfNHQT0dZR+AlfA@public.gmane.org wrote:

+struct dentry *kernfs_obtain_root(struct super_block *sb,
+				  struct kernfs_node *kn)
+{
+	struct dentry *dentry;
+	struct inode *inode;
+
+	BUG_ON(sb->s_op != &kernfs_sops);
+
+	/* inode for the given kernfs_node should already exist. */
+	inode = ilookup(sb, kn->ino);
+	if (!inode) {
+		pr_debug("kernfs: could not get inode for '");
+		pr_cont_kernfs_path(kn);
+		pr_cont("'.\n");
+		return ERR_PTR(-EINVAL);
+	}

Hmmm... but inode might not have been instantiated yet.  Why not use
kernfs_get_inode()?

+	/* instantiate and link root dentry */
+	dentry = d_obtain_root(inode);
+	if (!dentry) {
+		pr_debug("kernfs: could not get dentry for '");
+		pr_cont_kernfs_path(kn);
+		pr_cont("'.\n");
+		return ERR_PTR(-ENOMEM);
+	}
+
+	/* If this is a new dentry, set it up. We need kernfs_mutex because this
+	 * may be called by callers other than kernfs_fill_super. */

Formatting.

+	mutex_lock(&kernfs_mutex);
+	if (!dentry->d_fsdata) {
+		kernfs_get(kn);
+		dentry->d_fsdata = kn;
+	} else {
+		WARN_ON(dentry->d_fsdata != kn);
+	}
+	mutex_unlock(&kernfs_mutex);
+
+	return dentry;
+}

Wouldn't it be simpler to walk dentry from kernfs root than
duplicating dentry instantiation?

quoted hunk ↗ jump to hunk

diff --git a/kernel/cgroup.c b/kernel/cgroup.c
index 1d696de..0a3e893 100644
--- a/kernel/cgroup.c
+++ b/kernel/cgroup.c

@@ -2112,11 +2120,31 @@ out_free:
 	kfree(opts.release_agent);
 	kfree(opts.name);
 
-	if (ret)
+	if (ret) {
+		put_cgroup_ns(ns);
 		return ERR_PTR(ret);
+	}
 
 	dentry = kernfs_mount(fs_type, flags, root->kf_root,
 				CGROUP_SUPER_MAGIC, &new_sb);
+
+	if (!IS_ERR(dentry)) {
+		/* In non-init cgroup namespace, instead of root cgroup's
+		 * dentry, we return the dentry corresponding to the
+		 * cgroupns->root_cgrp.
+		 */

Formatting.

+		if (ns != &init_cgroup_ns) {
+			struct dentry *nsdentry;
+			struct cgroup *cgrp;
+
+			cgrp = cset_cgroup_from_root(ns->root_cgrps, root);
+			nsdentry = kernfs_obtain_root(dentry->d_sb,
+				cgrp->kn);
+			dput(dentry);
+			dentry = nsdentry;
+		}
+	}

So, this would effectively allow namespace mounts to claim controllers
which aren't configured otherwise which doesn't seem like a good idea.
I think the right thing to do for namespace mounts is to always
require an existing superblock.

Thanks.

-- 
tejun

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help