Thread (74 messages) 74 messages, 12 authors, 2003-07-04

Re: What to expect with the 2.6 VM

From: Andrea Arcangeli <hidden>
Date: 2003-07-03 19:13:27
Also in: lkml

On Thu, Jul 03, 2003 at 11:53:41AM -0700, William Lee Irwin III wrote:
On Thu, 3 Jul 2003, Andrea Arcangeli wrote:
quoted
quoted
even if you don't use largepages as you should, the ram cost of the pte
is nothing on 64bit archs, all you care about is to use all the mhz and
tlb entries of the cpu.
On Thu, Jul 03, 2003 at 09:06:32AM -0400, Rik van Riel wrote:
quoted
That depends on the number of Oracle processes you have.
Say that page tables need 0.1% of the space of the virtual
space they map.  With 1000 Oracle users you'd end up needing
as much memory in page tables as your shm segment is large.
Of course, in this situation either the application should
use large pages or the kernel should simply reclaim the
page tables (possible while holding the mmap_sem for write).
No, it is not true that pagetable space can be wantonly wasted
on 64-bit.

Try mmap()'ing something sufficiently huge and accessing on average
every PAGE_SIZE'th virtual page, in a single-threaded single process.
e.g. various indexing schemes might do this. This is 1 pagetable page
per page of data (worse if shared), which blows major goats.
that's the very old exploit that touches 1 page per pmd.
There's a reason why those things use inverted pagetables... at any
rate, compacting virtualspace with remap_file_pages() solves it too.

Large pages won't help, since the data isn't contiguous.
if you can't use a sane design it's not a kernel issue.  this is bad
userspace code seeking like crazy on disk too, working around it with a
kernel feature sounds worthless. If algorithms have no locality at all,
and they spread 1 page per pmd that's their problem.

the easiest way to waste ram with bad code is to add this in the first
line of the main of a program:

	p = malloc(1G)
	bzero(p, 1G)

you don't need 1 page per pmd to waste ram. Should we also write a
kernel feature that checks if the page is zero and drops it so the above
won't swap etc..?

If you can come up with a real life example where the 1 page per pmd
scattered over 1T of address space (we're talking about the file here of
course, the on disk representation of the data) is the very best design
possible ever (without any concept of locality at all) and it speeds up
things of orders of magnitude not to have any locality at all,
especially for the huge seeking it will generate no matter what the
pagetable side is, you will then change my mind about it.

Andrea
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"aart@kvack.org"> aart@kvack.org </a>
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help