Re: [RFC v4+ hot_track 09/19] vfs: add one work queue

[RFC v4+ hot_track 00/19] vfs: hot data tracking · <hidden> · 2012-10-29
[RFC v4+ hot_track 01/19] vfs: introduce private radix tree structures · <hidden> · 2012-10-29
[RFC v4+ hot_track 02/19] vfs: initialize and free data structures · <hidden> · 2012-10-29
Re: [RFC v4+ hot_track 02/19] vfs: initialize and free data structures · David Sterba <hidden> · 2012-11-06
Re: [RFC v4+ hot_track 02/19] vfs: initialize and free data structures · Zhi Yong Wu <hidden> · 2012-11-07
Re: [RFC v4+ hot_track 02/19] vfs: initialize and free data structures · Zhi Yong Wu <hidden> · 2012-11-16
[RFC v4+ hot_track 03/19] vfs: add I/O frequency update function · <hidden> · 2012-10-29
Re: [RFC v4+ hot_track 03/19] vfs: add I/O frequency update function · Steven Whitehouse <hidden> · 2012-11-05
Re: [RFC v4+ hot_track 03/19] vfs: add I/O frequency update function · Zhi Yong Wu <hidden> · 2012-11-05
Re: [RFC v4+ hot_track 03/19] vfs: add I/O frequency update function · David Sterba <hidden> · 2012-11-06
Re: [RFC v4+ hot_track 03/19] vfs: add I/O frequency update function · Zhi Yong Wu <hidden> · 2012-11-07
Re: [RFC v4+ hot_track 03/19] vfs: add I/O frequency update function · Darrick J. Wong <hidden> · 2012-11-06
Re: [RFC v4+ hot_track 03/19] vfs: add I/O frequency update function · Zhi Yong Wu <hidden> · 2012-11-07
Re: [RFC v4+ hot_track 03/19] vfs: add I/O frequency update function · Darrick J. Wong <hidden> · 2012-11-07
Re: [RFC v4+ hot_track 03/19] vfs: add I/O frequency update function · Zhi Yong Wu <hidden> · 2012-11-08
[RFC v4+ hot_track 04/19] vfs: add two map arrays · <hidden> · 2012-10-29
[RFC v4+ hot_track 05/19] vfs: add hooks to enable hot tracking · <hidden> · 2012-10-29
Re: [RFC v4+ hot_track 05/19] vfs: add hooks to enable hot tracking · David Sterba <hidden> · 2012-11-06
Re: [RFC v4+ hot_track 05/19] vfs: add hooks to enable hot tracking · Zhi Yong Wu <hidden> · 2012-11-07
[RFC v4+ hot_track 06/19] vfs: add temp calculation function · <hidden> · 2012-10-29
[RFC v4+ hot_track 07/19] vfs: add map info update function · <hidden> · 2012-10-29
[RFC v4+ hot_track 09/19] vfs: add one work queue · <hidden> · 2012-10-29
Re: [RFC v4+ hot_track 09/19] vfs: add one work queue · Steven Whitehouse <hidden> · 2012-11-05
Re: [RFC v4+ hot_track 09/19] vfs: add one work queue · Zhi Yong Wu <hidden> · 2012-11-05
Re: [RFC v4+ hot_track 09/19] vfs: add one work queue · Steven Whitehouse <hidden> · 2012-11-05
Re: [RFC v4+ hot_track 09/19] vfs: add one work queue · Zhi Yong Wu <hidden> · 2012-11-05
[RFC v4+ hot_track 10/19] vfs: introduce hot func register framework · <hidden> · 2012-10-29
Re: [RFC v4+ hot_track 10/19] vfs: introduce hot func register framework · David Sterba <hidden> · 2012-11-06
Re: [RFC v4+ hot_track 10/19] vfs: introduce hot func register framework · Zhi Yong Wu <hidden> · 2012-11-07
Re: [RFC v4+ hot_track 10/19] vfs: introduce hot func register framework · Darrick J. Wong <hidden> · 2012-11-06
Re: [RFC v4+ hot_track 10/19] vfs: introduce hot func register framework · Zhi Yong Wu <hidden> · 2012-11-07
Re: [RFC v4+ hot_track 10/19] vfs: introduce hot func register framework · Darrick J. Wong <hidden> · 2012-11-07
Re: [RFC v4+ hot_track 10/19] vfs: introduce hot func register framework · Zhi Yong Wu <hidden> · 2012-11-08
[RFC v4+ hot_track 11/19] vfs: register one shrinker · <hidden> · 2012-10-29
[RFC v4+ hot_track 12/19] vfs: add one ioctl interface · <hidden> · 2012-10-29
Re: [RFC v4+ hot_track 12/19] vfs: add one ioctl interface · David Sterba <hidden> · 2012-11-06
Re: [RFC v4+ hot_track 12/19] vfs: add one ioctl interface · Zhi Yong Wu <hidden> · 2012-11-07
[RFC v4+ hot_track 13/19] debugfs: introduce one function · <hidden> · 2012-10-29
Re: [RFC v4+ hot_track 13/19] debugfs: introduce one function · Greg KH <gregkh@linuxfoundation.org> · 2012-10-29
Re: [RFC v4+ hot_track 13/19] debugfs: introduce one function · Zhi Yong Wu <hidden> · 2012-10-29
Re: [RFC v4+ hot_track 13/19] debugfs: introduce one function · Greg KH <gregkh@linuxfoundation.org> · 2012-10-29
Re: [RFC v4+ hot_track 13/19] debugfs: introduce one function · Zhi Yong Wu <hidden> · 2012-10-29
Re: [RFC v4+ hot_track 13/19] debugfs: introduce one function · Greg KH <gregkh@linuxfoundation.org> · 2012-10-29
Re: [RFC v4+ hot_track 13/19] debugfs: introduce one function · Zhi Yong Wu <hidden> · 2012-10-29
[RFC v4+ hot_track 14/19] vfs: add debugfs support · <hidden> · 2012-10-29
Re: [RFC v4+ hot_track 14/19] vfs: add debugfs support · David Sterba <hidden> · 2012-11-06
Re: [RFC v4+ hot_track 14/19] vfs: add debugfs support · Zhi Yong Wu <hidden> · 2012-11-07
[RFC v4+ hot_track 15/19] sysfs: add two hot_track proc files · <hidden> · 2012-10-29
Re: [RFC v4+ hot_track 15/19] sysfs: add two hot_track proc files · Greg KH <gregkh@linuxfoundation.org> · 2012-10-29
Re: [RFC v4+ hot_track 15/19] sysfs: add two hot_track proc files · Zhi Yong Wu <hidden> · 2012-10-29
[RFC v4+ hot_track 16/19] btrfs: add hot tracking support · <hidden> · 2012-10-29
Re: [RFC v4+ hot_track 16/19] btrfs: add hot tracking support · David Sterba <hidden> · 2012-11-07
Re: [RFC v4+ hot_track 16/19] btrfs: add hot tracking support · Zhi Yong Wu <hidden> · 2012-11-07
[RFC v4+ hot_track 18/19] ext4: add hot tracking support · <hidden> · 2012-10-29
[RFC v4+ hot_track 19/19] vfs: add documentation · <hidden> · 2012-10-29
[RFC v4+ hot_track 08/19] vfs: add aging function · <hidden> · 2012-10-29
[RFC v4+ hot_track 17/19] xfs: add hot tracking support · <hidden> · 2012-10-29
Re: [RFC v4+ hot_track 00/19] vfs: hot data tracking · Zhi Yong Wu <hidden> · 2012-11-07

From: Zhi Yong Wu <hidden>
Date: 2012-11-05 11:55:40
Also in: linux-btrfs, linux-fsdevel, lkml

On Mon, Nov 5, 2012 at 7:21 PM, Steven Whitehouse [off-list ref] wrote:

Hi,

On Mon, 2012-10-29 at 12:30 +0800, zwu.kernel@gmail.com wrote:

quoted

From: Zhi Yong Wu <redacted>

  Add a per-superblock workqueue and a delayed_work
to run periodic work to update map info on each superblock.

Signed-off-by: Zhi Yong Wu <redacted>
---
 fs/hot_tracking.c            |   85 ++++++++++++++++++++++++++++++++++++++++++
 fs/hot_tracking.h            |    3 +
 include/linux/hot_tracking.h |    3 +
 3 files changed, 91 insertions(+), 0 deletions(-)

diff --git a/fs/hot_tracking.c b/fs/hot_tracking.c
index fff0038..0ef9cad 100644
--- a/fs/hot_tracking.c
+++ b/fs/hot_tracking.c

@@ -15,9 +15,12 @@
 #include <linux/module.h>
 #include <linux/spinlock.h>
 #include <linux/hardirq.h>
+#include <linux/kthread.h>
+#include <linux/freezer.h>
 #include <linux/fs.h>
 #include <linux/blkdev.h>
 #include <linux/types.h>
+#include <linux/list_sort.h>
 #include <linux/limits.h>
 #include "hot_tracking.h"

@@ -557,6 +560,67 @@ static void hot_map_array_exit(struct hot_info *root)
      }
 }

+/* Temperature compare function*/
+static int hot_temp_cmp(void *priv, struct list_head *a,
+                             struct list_head *b)
+{
+     struct hot_comm_item *ap =
+                     container_of(a, struct hot_comm_item, n_list);
+     struct hot_comm_item *bp =
+                     container_of(b, struct hot_comm_item, n_list);
+
+     int diff = ap->hot_freq_data.last_temp
+                             - bp->hot_freq_data.last_temp;
+     if (diff > 0)
+             return -1;
+     if (diff < 0)
+             return 1;
+     return 0;
+}
+
+/*
+ * Every sync period we update temperatures for
+ * each hot inode item and hot range item for aging
+ * purposes.
+ */
+static void hot_update_worker(struct work_struct *work)
+{
+     struct hot_info *root = container_of(to_delayed_work(work),
+                                     struct hot_info, update_work);
+     struct hot_inode_item *hi_nodes[8];
+     u64 ino = 0;
+     int i, n;
+
+     while (1) {
+             n = radix_tree_gang_lookup(&root->hot_inode_tree,
+                                (void **)hi_nodes, ino,
+                                ARRAY_SIZE(hi_nodes));
+             if (!n)
+                     break;
+
+             ino = hi_nodes[n - 1]->i_ino + 1;
+             for (i = 0; i < n; i++) {
+                     kref_get(&hi_nodes[i]->hot_inode.refs);
+                     hot_map_array_update(
+                             &hi_nodes[i]->hot_inode.hot_freq_data, root);
+                     hot_range_update(hi_nodes[i], root);
+                     hot_inode_item_put(hi_nodes[i]);
+             }
+     }
+
+     /* Sort temperature map info */
+     for (i = 0; i < HEAT_MAP_SIZE; i++) {
+             list_sort(NULL, &root->heat_inode_map[i].node_list,
+                     hot_temp_cmp);
+             list_sort(NULL, &root->heat_range_map[i].node_list,
+                     hot_temp_cmp);
+     }
+

If this list can potentially have one (or more) entries per inode, then

Only one hot_inode_item per inode, while maybe multiple
hot_range_items per inode.

filesystems with a lot of inodes (millions) may potentially exceed the
max size of list which list_sort() can handle. If that happens it still
works, but you'll get a warning message and it won't be as efficient.

I haven't do so large scale test. If we want to find that issue, we
need to do large scale performance test, before that, i want to make
sure the code change is correct at first.
To be honest, for that issue you pointed to, i also have such
concern.But list_sort() performance looks good from the test result of
the following URL:
https://lkml.org/lkml/2010/1/20/485

It is something that we've run into with list_sort() and GFS2, but it
only happens very rarely,

Beside list_sort(), do you have any other way to share? For this
concern, how does GFS2 resolve it?

Steve.



-- 
Regards,

Zhi Yong Wu

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help