Re: [PATCHv8 00/32] THP-enabled tmpfs/shmem using compound pages

[PATCHv8 00/32] THP-enabled tmpfs/shmem using compound pages · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 02/32] mm: do not pass mm_struct into handle_mm_fault · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 03/32] mm: introduce fault_env · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 04/32] mm: postpone page table allocation until we have page to map · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 05/32] rmap: support file thp · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 06/32] mm: introduce do_set_pmd() · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 07/32] thp, vmstats: add counters for huge file pages · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 08/32] thp: support file pages in zap_huge_pmd() · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 09/32] thp: handle file pages in split_huge_pmd() · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 10/32] thp: handle file COW faults · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 11/32] thp: skip file huge pmd on copy_huge_pmd() · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 12/32] thp: prepare change_huge_pmd() for file thp · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 13/32] thp: run vma_adjust_trans_huge() outside i_mmap_rwsem · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 14/32] thp: file pages support for split_huge_page() · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 15/32] thp, mlock: do not mlock PTE-mapped file huge pages · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 16/32] vmscan: split file huge pages before paging them out · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 17/32] page-flags: relax policy for PG_mappedtodisk and PG_reclaim · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 18/32] radix-tree: implement radix_tree_maybe_preload_order() · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 19/32] filemap: prepare find and delete operations for huge pages · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 20/32] truncate: handle file thp · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 21/32] mm, rmap: account shmem thp pages · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 24/32] shmem: add huge pages support · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 26/32] thp: update Documentation/vm/transhuge.txt · Kirill A. Shutemov <hidden> · 2016-05-12
Re: [PATCHv8 26/32] thp: update Documentation/vm/transhuge.txt · Julien Grall <hidden> · 2016-05-19
Re: [PATCHv8 26/32] thp: update Documentation/vm/transhuge.txt · Kirill A. Shutemov <hidden> · 2016-05-20
[PATCHv8 27/32] thp: extract khugepaged from mm/huge_memory.c · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 28/32] khugepaged: move up_read(mmap_sem) out of khugepaged_alloc_page() · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 29/32] shmem: make shmem_inode_info::lock irq-safe · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 30/32] khugepaged: add support of collapse for tmpfs/shmem pages · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 31/32] thp: introduce CONFIG_TRANSPARENT_HUGE_PAGECACHE · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 32/32] shmem: split huge pages beyond i_size under memory pressure · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 23/32] shmem: get_unmapped_area align huge page · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 01/32] thp, mlock: update unevictable-lru.txt · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 22/32] shmem: prepare huge= mount option and sysfs knob · Kirill A. Shutemov <hidden> · 2016-05-12
[PATCHv8 25/32] shmem, thp: respect MADV_{NO,}HUGEPAGE for file mappings · Kirill A. Shutemov <hidden> · 2016-05-12
Re: [PATCHv8 00/32] THP-enabled tmpfs/shmem using compound pages · neha agarwal <hidden> · 2016-05-25
Re: [PATCHv8 00/32] THP-enabled tmpfs/shmem using compound pages · Kirill A. Shutemov <hidden> · 2016-05-25
Re: [PATCHv8 00/32] THP-enabled tmpfs/shmem using compound pages · neha agarwal <hidden> · 2016-05-25
Re: [PATCHv8 00/32] THP-enabled tmpfs/shmem using compound pages · Kirill A. Shutemov <hidden> · 2016-05-25
Re: [PATCHv8 00/32] THP-enabled tmpfs/shmem using compound pages · neha agarwal <hidden> · 2016-05-27
Re: [PATCHv8 00/32] THP-enabled tmpfs/shmem using compound pages · Kirill A. Shutemov <hidden> · 2016-06-06
Re: [PATCHv8 00/32] THP-enabled tmpfs/shmem using compound pages · neha agarwal <hidden> · 2016-06-08
Re: [PATCHv8 00/32] THP-enabled tmpfs/shmem using compound pages · Kirill A. Shutemov <hidden> · 2016-06-13

From: Kirill A. Shutemov <hidden>
Date: 2016-06-06 13:51:40
Also in: linux-fsdevel, lkml

On Wed, May 25, 2016 at 03:11:55PM -0400, neha agarwal wrote:

Hi All,

I have been testing Hugh's and Kirill's huge tmpfs patch sets with
Cassandra (NoSQL database). I am seeing significant performance gap between
these two implementations (~30%). Hugh's implementation performs better
than Kirill's implementation. I am surprised why I am seeing this
performance gap. Following is my test setup.

Patchsets
========
- For Hugh's:
I checked out 4.6-rc3, applied Hugh's preliminary patches (01 to 10
patches) from here: https://lkml.org/lkml/2016/4/5/792 and then applied the
THP patches posted on April 16 (01 to 29 patches).

- For Kirill's:
I am using his branch  "git://
git.kernel.org/pub/scm/linux/kernel/git/kas/linux.git hugetmpfs/v8", which
is based off of 4.6-rc3, posted on May 12.


Khugepaged settings
================
cd /sys/kernel/mm/transparent_hugepage
echo 10 >khugepaged/alloc_sleep_millisecs
echo 10 >khugepaged/scan_sleep_millisecs
echo 511 >khugepaged/max_ptes_none


Mount options
===========
- For Hugh's:
sudo sysctl -w vm/shmem_huge=2
sudo mount -o remount,huge=1 /hugetmpfs

- For Kirill's:
sudo mount -o remount,huge=always /hugetmpfs
echo force > /sys/kernel/mm/transparent_hugepage/shmem_enabled
echo 511 >khugepaged/max_ptes_swap


Workload Setting
=============
Please look at the attached setup document for Cassandra (NoSQL database):
cassandra-setup.txt


Machine setup
===========
36-core (72 hardware thread) dual-socket x86 server with 512 GB RAM running
Ubuntu. I use control groups for resource isolation. Server and client
threads run on different sockets. Frequency governor set to "performance"
to remove any performance fluctuations due to frequency variation.


Throughput numbers
================
Hugh's implementation: 74522.08 ops/sec
Kirill's implementation: 54919.10 ops/sec

In my setup I don't see the difference:

v4.7-rc1 + my implementation:
[OVERALL], RunTime(ms), 822862.0
[OVERALL], Throughput(ops/sec), 60763.53021527304
ShmemPmdMapped:  4999168 kB

v4.6-rc2 + Hugh's implementation:
[OVERALL], RunTime(ms), 833157.0
[OVERALL], Throughput(ops/sec), 60012.698687042175
ShmemPmdMapped:  5021696 kB

It's basically within measuarment error. 'ShmemPmdMapped' indicate how
much memory is mapped with huge pages by the end of test.

It's on dual-socket 24-core machine with 64G of RAM.

I guess we have some configuration difference or something, but so far I
don't see the drastic performance difference you've pointed to.

May be my implementation behaves slower on bigger machines, I don't know..
There's no architectural reason for this.

I'll post my updated patchset today.

-- 
 Kirill A. Shutemov

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help