Re: [PATCH v5 00/16] md/raid5: set STRIPE_SIZE as a configurable value

[PATCH v5 00/16] md/raid5: set STRIPE_SIZE as a configurable value · Yufen Yu <hidden> · 2020-07-02
[PATCH v5 01/16] md/raid456: covert macro define of STRIPE_* as members of struct r5conf · Yufen Yu <hidden> · 2020-07-02
Re: [PATCH v5 01/16] md/raid456: covert macro define of STRIPE_* as members of struct r5conf · kernel test robot <hidden> · 2020-07-02
Re: [PATCH v5 01/16] md/raid456: covert macro define of STRIPE_* as members of struct r5conf · kernel test robot <hidden> · 2020-07-02
Re: [PATCH v5 01/16] md/raid456: covert macro define of STRIPE_* as members of struct r5conf · Song Liu <song@kernel.org> · 2020-07-02
Re: [PATCH v5 01/16] md/raid456: covert macro define of STRIPE_* as members of struct r5conf · Paul Menzel <hidden> · 2020-07-02
Re: [PATCH v5 01/16] md/raid456: covert macro define of STRIPE_* as members of struct r5conf · antlists <hidden> · 2020-07-12
Re: [PATCH v5 01/16] md/raid456: covert macro define of STRIPE_* as members of struct r5conf · Guoqing Jiang <hidden> · 2020-07-06
Re: [PATCH v5 01/16] md/raid456: covert macro define of STRIPE_* as members of struct r5conf · Guoqing Jiang <hidden> · 2020-07-06
Re: [PATCH v5 01/16] md/raid456: covert macro define of STRIPE_* as members of struct r5conf · Yufen Yu <hidden> · 2020-07-08
[PATCH v5 02/16] md/raid5: add sysfs entry to set and show stripe_size · Yufen Yu <hidden> · 2020-07-02
Re: [PATCH v5 02/16] md/raid5: add sysfs entry to set and show stripe_size · Song Liu <song@kernel.org> · 2020-07-02
[PATCH v5 03/16] md/raid5: set default stripe_size as 4096 · Yufen Yu <hidden> · 2020-07-02
[PATCH v5 04/16] md/raid5: add a member of r5pages for struct stripe_head · Yufen Yu <hidden> · 2020-07-02
Re: [PATCH v5 04/16] md/raid5: add a member of r5pages for struct stripe_head · Song Liu <song@kernel.org> · 2020-07-02
Re: [PATCH v5 04/16] md/raid5: add a member of r5pages for struct stripe_head · Jason Yan <yanaijie@huawei.com> · 2020-07-03
[PATCH v5 05/16] md/raid5: allocate and free shared pages of r5pages · Yufen Yu <hidden> · 2020-07-02
[PATCH v5 06/16] md/raid5: set correct page offset for bi_io_vec in ops_run_io() · Yufen Yu <hidden> · 2020-07-02
[PATCH v5 07/16] md/raid5: set correct page offset for async_copy_data() · Yufen Yu <hidden> · 2020-07-02
[PATCH v5 08/16] md/raid5: resize stripes and set correct offset when reshape array · Yufen Yu <hidden> · 2020-07-02
[PATCH v5 09/16] md/raid5: add new xor function to support different page offset · Yufen Yu <hidden> · 2020-07-02
[PATCH v5 10/16] md/raid5: add offset array in scribble buffer · Yufen Yu <hidden> · 2020-07-02
[PATCH v5 11/16] md/raid5: compute xor with correct page offset · Yufen Yu <hidden> · 2020-07-02
[PATCH v5 12/16] md/raid5: support config stripe_size by sysfs entry · Yufen Yu <hidden> · 2020-07-02
Re: [PATCH v5 12/16] md/raid5: support config stripe_size by sysfs entry · Song Liu <song@kernel.org> · 2020-07-02
Re: [PATCH v5 12/16] md/raid5: support config stripe_size by sysfs entry · Yufen Yu <hidden> · 2020-07-04
[PATCH v5 13/16] md/raid6: let syndrome computor support different page offset · Yufen Yu <hidden> · 2020-07-02
[PATCH v5 14/16] md/raid6: let async recovery function support different page offset · Yufen Yu <hidden> · 2020-07-02
[PATCH v5 15/16] md/raid6: compute syndrome with correct page offset · Yufen Yu <hidden> · 2020-07-02
[PATCH v5 16/16] raid6test: adaptation with syndrome function · Yufen Yu <hidden> · 2020-07-02
Re: [PATCH v5 00/16] md/raid5: set STRIPE_SIZE as a configurable value · Song Liu <song@kernel.org> · 2020-07-02
Re: [PATCH v5 00/16] md/raid5: set STRIPE_SIZE as a configurable value · Yufen Yu <hidden> · 2020-07-08
Re: [PATCH v5 00/16] md/raid5: set STRIPE_SIZE as a configurable value · Song Liu <song@kernel.org> · 2020-07-08
Re: [PATCH v5 00/16] md/raid5: set STRIPE_SIZE as a configurable value · Yufen Yu <hidden> · 2020-07-09
Re: [PATCH v5 00/16] md/raid5: set STRIPE_SIZE as a configurable value · Song Liu <song@kernel.org> · 2020-07-10

From: Song Liu <song@kernel.org>
Date: 2020-07-10 16:09:53

On Thu, Jul 9, 2020 at 6:27 AM Yufen Yu [off-list ref] wrote:



On 2020/7/9 7:55, Song Liu wrote:

quoted

On Wed, Jul 8, 2020 at 6:15 AM Yufen Yu [off-list ref] wrote:

quoted



On 2020/7/3 7:00, Song Liu wrote:

quoted

On Thu, Jul 2, 2020 at 5:05 AM Yufen Yu [off-list ref] wrote:

quoted

Hi, all

   For now, STRIPE_SIZE is equal to the value of PAGE_SIZE. That means, RAID5
   will issue each bio to disk at least 64KB when PAGE_SIZE is 64KB in arm64.
   However, filesystem usually issue bio in the unit of 4KB. Then, RAID5 may
   waste resource of disk bandwidth.

   To solve the problem, this patchset try to set stripe_size as a configuare
   value. The default value is 4096. We will add a new sysfs entry and set it
   by writing a new value, likely:

          echo 16384 > /sys/block/md1/md/stripe_size

Higher level question: do we need to support page size that is NOT 4kB
times power
of 2? Meaning, do we need to support 12kB, 20kB, 24kB, etc. If we only
supports, 4kB,
8kB, 16kB, 32kB, etc. some of the logic can be simpler.

Yeah, I think we just support 4kb, 8kb, 16kb, 32kb... is enough.
But Sorry that I don't know what logic can be simpler in current implementation.
I mean it also need to allocate page, and record page offset.

I was thinking about replacing multiplication/division with bit
operations (shift left/right).
But I am not very sure how much that matters in modern ARM CPUs. Would you mind
running some benchmarks with this?

To test multiplication/division and bit operation, I write a simple test case:

$ cat normal.c

int page_size = 65536;
int stripe_size = 32768; //32KB

int main(int argc, char *argv[])
{
         int i, j, count;
         int page, offset;

         if (argc != 2)
                 return -1;

         count = atol(argv[1]);

         for (i = 0; i < count; i++) {
                 for (j = 0; j < 4; j++) {
                         page = page_size / stripe_size;
                         offset = j * stripe_size;
                 }
         }
}

$ cat shift.c

int page_shift = 16; //64KB
int stripe_shift = 15; //32KB

int main(int argc, char *argv[])
{
         int i, j, count;
         int page, offset;

         if (argc != 2)
                 return -1;

         count = atol(argv[1]);

         for (i = 0; i < count; i++) {
                 for (j = 0; j < 4; j++) {
                         page = 1 << (page_shift - stripe_shift);
                         offset = j << stripe_shift;
                 }
         }
}

Test them on a arm64 server, the result show there is a minor
performance gap between multiplication/division and shift operation.

[root@localhost shift]# time ./normal 104857600

real    0m1.199s
user    0m1.198s
sys     0m0.000s

[root@localhost shift]# time ./shift 104857600

real    0m1.166s
user    0m1.166s
sys     0m0.000s

For our implementation, page address and page offset are just computed
when allocate stripe_size. After that, we just use the recorded value
in sh->dev[i].page and sh->dev[i].offset. So, I think current implementation
may not cause much overhead.

Sounds good. Let's keep this part as-is.

Thanks,
Song

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help