Re: [PATCH v6 6/8] KVM: Handle page fault for private memory

[PATCH v6 0/8] KVM: mm: fd-based approach for supporting KVM guest private memory · Chao Peng <hidden> · 2022-05-19
[PATCH v6 7/8] KVM: Enable and expose KVM_MEM_PRIVATE · Chao Peng <hidden> · 2022-05-19
Re: [PATCH v6 7/8] KVM: Enable and expose KVM_MEM_PRIVATE · Michael Roth <hidden> · 2022-06-23
Re: [PATCH v6 7/8] KVM: Enable and expose KVM_MEM_PRIVATE · Chao Peng <hidden> · 2022-06-24
[PATCH v6 1/8] mm: Introduce memfile_notifier · Chao Peng <hidden> · 2022-05-19
[PATCH v6 3/8] mm/memfd: Introduce MFD_INACCESSIBLE flag · Chao Peng <hidden> · 2022-05-19
Re: [PATCH v6 3/8] mm/memfd: Introduce MFD_INACCESSIBLE flag · Vishal Annapurve <hidden> · 2022-05-31
Re: [PATCH v6 3/8] mm/memfd: Introduce MFD_INACCESSIBLE flag · Chao Peng <hidden> · 2022-06-01
Re: [PATCH v6 3/8] mm/memfd: Introduce MFD_INACCESSIBLE flag · Gupta, Pankaj <hidden> · 2022-06-01
Re: [PATCH v6 3/8] mm/memfd: Introduce MFD_INACCESSIBLE flag · Chao Peng <hidden> · 2022-06-02
Re: [PATCH v6 3/8] mm/memfd: Introduce MFD_INACCESSIBLE flag · Sean Christopherson <seanjc@google.com> · 2022-06-14
Re: [PATCH v6 3/8] mm/memfd: Introduce MFD_INACCESSIBLE flag · Chao Peng <hidden> · 2022-06-15
[PATCH v6 2/8] mm/shmem: Support memfile_notifier · Chao Peng <hidden> · 2022-05-19
[PATCH v6 5/8] KVM: Add KVM_EXIT_MEMORY_FAULT exit · Chao Peng <hidden> · 2022-05-19
[PATCH v6 6/8] KVM: Handle page fault for private memory · Chao Peng <hidden> · 2022-05-19
Re: [PATCH v6 6/8] KVM: Handle page fault for private memory · Sean Christopherson <seanjc@google.com> · 2022-06-17
Re: [PATCH v6 6/8] KVM: Handle page fault for private memory · Chao Peng <hidden> · 2022-06-20
Re: [PATCH v6 6/8] KVM: Handle page fault for private memory · Kirill A. Shutemov <hidden> · 2022-08-19
Re: [PATCH v6 6/8] KVM: Handle page fault for private memory · Sean Christopherson <seanjc@google.com> · 2022-08-25
Re: [PATCH v6 6/8] KVM: Handle page fault for private memory · Nikunj A. Dadhania <hidden> · 2022-06-24
Re: [PATCH v6 6/8] KVM: Handle page fault for private memory · Chao Peng <hidden> · 2022-06-24
Re: [PATCH v6 6/8] KVM: Handle page fault for private memory · Vishal Annapurve <hidden> · 2022-06-30
Re: [PATCH v6 6/8] KVM: Handle page fault for private memory · Michael Roth <hidden> · 2022-06-30
Re: [PATCH v6 6/8] KVM: Handle page fault for private memory · Xiaoyao Li <hidden> · 2022-07-01
Re: [PATCH v6 6/8] KVM: Handle page fault for private memory · Sean Christopherson <seanjc@google.com> · 2022-07-07
Re: [PATCH v6 6/8] KVM: Handle page fault for private memory · Xiaoyao Li <hidden> · 2022-07-08
Re: [PATCH v6 6/8] KVM: Handle page fault for private memory · Vishal Annapurve <hidden> · 2022-07-20
Re: [PATCH v6 6/8] KVM: Handle page fault for private memory · Chao Peng <hidden> · 2022-07-21
[PATCH v6 4/8] KVM: Extend the memslot to support fd-based private memory · Chao Peng <hidden> · 2022-05-19
Re: [PATCH v6 4/8] KVM: Extend the memslot to support fd-based private memory · Andy Lutomirski <luto@kernel.org> · 2022-05-20
Re: [PATCH v6 4/8] KVM: Extend the memslot to support fd-based private memory · Sean Christopherson <seanjc@google.com> · 2022-05-20
Re: [PATCH v6 4/8] KVM: Extend the memslot to support fd-based private memory · "Andy Lutomirski" <luto@kernel.org> · 2022-05-22
Re: [PATCH v6 4/8] KVM: Extend the memslot to support fd-based private memory · Chao Peng <hidden> · 2022-05-23
Re: [PATCH v6 4/8] KVM: Extend the memslot to support fd-based private memory · Sean Christopherson <seanjc@google.com> · 2022-05-23
Re: [PATCH v6 4/8] KVM: Extend the memslot to support fd-based private memory · Chao Peng <hidden> · 2022-05-30
Re: [PATCH v6 4/8] KVM: Extend the memslot to support fd-based private memory · Sean Christopherson <seanjc@google.com> · 2022-06-10
Re: [PATCH v6 4/8] KVM: Extend the memslot to support fd-based private memory · Chao Peng <hidden> · 2022-06-14
Re: [PATCH v6 4/8] KVM: Extend the memslot to support fd-based private memory · Michael Roth <hidden> · 2022-06-23
Re: [PATCH v6 4/8] KVM: Extend the memslot to support fd-based private memory · Chao Peng <hidden> · 2022-06-24
Re: [PATCH v6 4/8] KVM: Extend the memslot to support fd-based private memory · Michael Roth <hidden> · 2022-06-24
Re: [PATCH v6 4/8] KVM: Extend the memslot to support fd-based private memory · Sean Christopherson <seanjc@google.com> · 2022-06-17
Re: [PATCH v6 4/8] KVM: Extend the memslot to support fd-based private memory · Sean Christopherson <seanjc@google.com> · 2022-06-17
Re: [PATCH v6 4/8] KVM: Extend the memslot to support fd-based private memory · Chao Peng <hidden> · 2022-06-20
Re: [PATCH v6 4/8] KVM: Extend the memslot to support fd-based private memory · Chao Peng <hidden> · 2022-06-20
[PATCH v6 8/8] memfd_create.2: Describe MFD_INACCESSIBLE flag · Chao Peng <hidden> · 2022-05-19
Re: [PATCH v6 0/8] KVM: mm: fd-based approach for supporting KVM guest private memory · Vishal Annapurve <hidden> · 2022-06-06
Re: [PATCH v6 0/8] KVM: mm: fd-based approach for supporting KVM guest private memory · Chao Peng <hidden> · 2022-06-07
Re: [PATCH v6 0/8] KVM: mm: fd-based approach for supporting KVM guest private memory · Marc Orr <hidden> · 2022-06-08
Re: [PATCH v6 0/8] KVM: mm: fd-based approach for supporting KVM guest private memory · Chao Peng <hidden> · 2022-06-08
Re: [PATCH v6 0/8] KVM: mm: fd-based approach for supporting KVM guest private memory · Vishal Annapurve <hidden> · 2022-06-08
Re: [PATCH v6 0/8] KVM: mm: fd-based approach for supporting KVM guest private memory · Sean Christopherson <seanjc@google.com> · 2022-06-09
Re: [PATCH v6 0/8] KVM: mm: fd-based approach for supporting KVM guest private memory · Chao Peng <hidden> · 2022-06-14
Re: [PATCH v6 0/8] KVM: mm: fd-based approach for supporting KVM guest private memory · Andy Lutomirski <luto@kernel.org> · 2022-06-14
Re: [PATCH v6 0/8] KVM: mm: fd-based approach for supporting KVM guest private memory · Sean Christopherson <seanjc@google.com> · 2022-06-14
Re: [PATCH v6 0/8] KVM: mm: fd-based approach for supporting KVM guest private memory · Andy Lutomirski <luto@kernel.org> · 2022-06-14
Re: [PATCH v6 0/8] KVM: mm: fd-based approach for supporting KVM guest private memory · Chao Peng <hidden> · 2022-06-15
Re: [PATCH v6 0/8] KVM: mm: fd-based approach for supporting KVM guest private memory · Sean Christopherson <seanjc@google.com> · 2022-06-15
Re: [PATCH v6 0/8] KVM: mm: fd-based approach for supporting KVM guest private memory · Marc Orr <hidden> · 2022-06-10

From: Xiaoyao Li <hidden>
Date: 2022-07-08 03:30:07
Also in: kvm, linux-doc, linux-fsdevel, linux-mm, lkml, qemu-devel

On 7/8/2022 4:08 AM, Sean Christopherson wrote:

On Fri, Jul 01, 2022, Xiaoyao Li wrote:

quoted

On 7/1/2022 6:21 AM, Michael Roth wrote:

quoted

On Thu, Jun 30, 2022 at 12:14:13PM -0700, Vishal Annapurve wrote:

quoted

With transparent_hugepages=always setting I see issues with the
current implementation.

...

quoted

Looks like with transparent huge pages enabled kvm tried to handle the
shared memory fault on 0x84d gfn by coalescing nearby 4K pages
to form a contiguous 2MB page mapping at gfn 0x800, since level 2 was
requested in kvm_mmu_spte_requested.
This caused the private memory contents from regions 0x800-0x84c and
0x86e-0xa00 to get unmapped from the guest leading to guest vm
shutdown.

Interesting... seems like that wouldn't be an issue for non-UPM SEV, since
the private pages would still be mapped as part of that 2M mapping, and
it's completely up to the guest as to whether it wants to access as
private or shared. But for UPM it makes sense this would cause issues.

quoted

Does getting the mapping level as per the fault access type help
address the above issue? Any such coalescing should not cross between
private to
shared or shared to private memory regions.

Doesn't seem like changing the check to fault->is_private would help in
your particular case, since the subsequent host_pfn_mapping_level() call
only seems to limit the mapping level to whatever the mapping level is
for the HVA in the host page table.

Seems like with UPM we need some additional handling here that also
checks that the entire 2M HVA range is backed by non-private memory.

Non-UPM SNP hypervisor patches already have a similar hook added to
host_pfn_mapping_level() which implements such a check via RMP table, so
UPM might need something similar:

    https://github.com/AMDESE/linux/commit/ae4475bc740eb0b9d031a76412b0117339794139

-Mike

For TDX, we try to track the page type (shared, private, mixed) of each gfn
at given level. Only when the type is shared/private, can it be mapped at
that level. When it's mixed, i.e., it contains both shared pages and private
pages at given level, it has to go to next smaller level.

https://github.com/intel/tdx/commit/ed97f4042eb69a210d9e972ccca6a84234028cad

Hmm, so a new slot->arch.page_attr array shouldn't be necessary, KVM can instead
update slot->arch.lpage_info on shared<->private conversions.  Detecting whether
a given range is partially mapped could get nasty if KVM defers tracking to the
backing store, but if KVM itself does the tracking as was previously suggested[*],
then updating lpage_info should be relatively straightfoward, e.g. use
xa_for_each_range() to see if a given 2mb/1gb range is completely covered (fully
shared) or not covered at all (fully private).

[*] https://lore.kernel.org/all/YofeZps9YXgtP3f1@google.com (local)

Yes, slot->arch.page_attr was introduced to help identify whether a page 
is completely shared/private at given level. It seems XARRAY can serve 
the same purpose, though I know nothing about it. Looking forward to 
seeing the patch of using XARRAY.

yes, update slot->arch.lpage_info is good to utilize the existing logic 
and Isaku has applied it to slot->arch.lpage_info for 2MB support patches.

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help