[PATCH v2 01/40] iommu: Introduce Shared Virtual Addressing API

[PATCH v2 00/40] Shared Virtual Addressing for the IOMMU · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 01/40] iommu: Introduce Shared Virtual Addressing API · Jean-Philippe Brucker <hidden> · 2018-05-11
Re: [PATCH v2 01/40] iommu: Introduce Shared Virtual Addressing API · Jacob Pan <hidden> · 2018-05-16
Re: [PATCH v2 01/40] iommu: Introduce Shared Virtual Addressing API · Jean-Philippe Brucker <hidden> · 2018-05-17
Re: [PATCH v2 01/40] iommu: Introduce Shared Virtual Addressing API · Jacob Pan <hidden> · 2018-05-17
Re: [PATCH v2 01/40] iommu: Introduce Shared Virtual Addressing API · Auger Eric <eric.auger@redhat.com> · 2018-09-05
Re: [PATCH v2 01/40] iommu: Introduce Shared Virtual Addressing API · Jean-Philippe Brucker <hidden> · 2018-09-06
Re: [PATCH v2 01/40] iommu: Introduce Shared Virtual Addressing API · Christian König <christian.koenig@amd.com> · 2018-09-06
Re: [PATCH v2 01/40] iommu: Introduce Shared Virtual Addressing API · Jean-Philippe Brucker <hidden> · 2018-09-06
Re: [PATCH v2 01/40] iommu: Introduce Shared Virtual Addressing API · Christian König <christian.koenig@amd.com> · 2018-09-07
Re: [PATCH v2 01/40] iommu: Introduce Shared Virtual Addressing API · Jean-Philippe Brucker <hidden> · 2018-09-07
Re: [PATCH v2 01/40] iommu: Introduce Shared Virtual Addressing API · Christian König <christian.koenig@amd.com> · 2018-09-07
Re: [PATCH v2 01/40] iommu: Introduce Shared Virtual Addressing API · Jacob Pan <hidden> · 2018-09-07
Re: [PATCH v2 01/40] iommu: Introduce Shared Virtual Addressing API · Christian König <christian.koenig@amd.com> · 2018-09-08
Re: [PATCH v2 01/40] iommu: Introduce Shared Virtual Addressing API · Jean-Philippe Brucker <hidden> · 2018-09-12
Re: [PATCH v2 01/40] iommu: Introduce Shared Virtual Addressing API · Christian König <christian.koenig@amd.com> · 2018-09-12
RE: [PATCH v2 01/40] iommu: Introduce Shared Virtual Addressing API · "Tian, Kevin" <kevin.tian@intel.com> · 2018-09-13
RE: [PATCH v2 01/40] iommu: Introduce Shared Virtual Addressing API · "Tian, Kevin" <kevin.tian@intel.com> · 2018-09-13
[PATCH v2 02/40] iommu/sva: Bind process address spaces to devices · Jean-Philippe Brucker <hidden> · 2018-05-11
Re: [PATCH v2 02/40] iommu/sva: Bind process address spaces to devices · Jonathan Cameron <Jonathan.Cameron@huawei.com> · 2018-05-17
Re: [PATCH v2 02/40] iommu/sva: Bind process address spaces to devices · Jean-Philippe Brucker <hidden> · 2018-05-21
Re: [PATCH v2 02/40] iommu/sva: Bind process address spaces to devices · Auger Eric <eric.auger@redhat.com> · 2018-09-05
Re: [PATCH v2 02/40] iommu/sva: Bind process address spaces to devices · Jean-Philippe Brucker <hidden> · 2018-09-06
[PATCH v2 03/40] iommu/sva: Manage process address spaces · Jean-Philippe Brucker <hidden> · 2018-05-11
Re: [PATCH v2 03/40] iommu/sva: Manage process address spaces · Jacob Pan <hidden> · 2018-05-16
Re: [PATCH v2 03/40] iommu/sva: Manage process address spaces · Jean-Philippe Brucker <hidden> · 2018-05-17
Re: [PATCH v2 03/40] iommu/sva: Manage process address spaces · Jacob Pan <hidden> · 2018-05-22
Re: [PATCH v2 03/40] iommu/sva: Manage process address spaces · Jean-Philippe Brucker <hidden> · 2018-05-24
Re: [PATCH v2 03/40] iommu/sva: Manage process address spaces · Ilias Apalodimas <ilias.apalodimas@linaro.org> · 2018-05-24
Re: [PATCH v2 03/40] iommu/sva: Manage process address spaces · Jean-Philippe Brucker <hidden> · 2018-05-24
Re: [PATCH v2 03/40] iommu/sva: Manage process address spaces · Ilias Apalodimas <ilias.apalodimas@linaro.org> · 2018-05-25
Re: [PATCH v2 03/40] iommu/sva: Manage process address spaces · Jonathan Cameron <Jonathan.Cameron@huawei.com> · 2018-05-25
[PATCH v2 03/40] iommu/sva: Manage process address spaces · Kenneth Lee <hidden> · 2018-05-26
Re: [PATCH v2 03/40] iommu/sva: Manage process address spaces · Kenneth Lee <hidden> · 2018-06-11
Re: [PATCH v2 03/40] iommu/sva: Manage process address spaces · Jonathan Cameron <Jonathan.Cameron@huawei.com> · 2018-05-17
Re: [PATCH v2 03/40] iommu/sva: Manage process address spaces · Jean-Philippe Brucker <hidden> · 2018-05-21
Re: [PATCH v2 03/40] iommu/sva: Manage process address spaces · Auger Eric <eric.auger@redhat.com> · 2018-09-05
Re: [PATCH v2 03/40] iommu/sva: Manage process address spaces · Jacob Pan <hidden> · 2018-09-05
Re: [PATCH v2 03/40] iommu/sva: Manage process address spaces · Jean-Philippe Brucker <hidden> · 2018-09-06
Re: [PATCH v2 03/40] iommu/sva: Manage process address spaces · Jean-Philippe Brucker <hidden> · 2018-09-06
[PATCH v2 04/40] iommu/sva: Add a mm_exit callback for device drivers · Jean-Philippe Brucker <hidden> · 2018-05-11
Re: [PATCH v2 04/40] iommu/sva: Add a mm_exit callback for device drivers · Auger Eric <eric.auger@redhat.com> · 2018-09-05
Re: [PATCH v2 04/40] iommu/sva: Add a mm_exit callback for device drivers · Jean-Philippe Brucker <hidden> · 2018-09-06
[PATCH v2 05/40] iommu/sva: Track mm changes with an MMU notifier · Jean-Philippe Brucker <hidden> · 2018-05-11
Re: [PATCH v2 05/40] iommu/sva: Track mm changes with an MMU notifier · Jonathan Cameron <Jonathan.Cameron@huawei.com> · 2018-05-17
Re: [PATCH v2 05/40] iommu/sva: Track mm changes with an MMU notifier · Jean-Philippe Brucker <hidden> · 2018-05-21
[PATCH v2 06/40] iommu/sva: Search mm by PASID · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 07/40] iommu: Add a page fault handler · Jean-Philippe Brucker <hidden> · 2018-05-11
Re: [PATCH v2 07/40] iommu: Add a page fault handler · Jonathan Cameron <Jonathan.Cameron@huawei.com> · 2018-05-17
Re: [PATCH v2 07/40] iommu: Add a page fault handler · Jean-Philippe Brucker <hidden> · 2018-05-21
Re: [PATCH v2 07/40] iommu: Add a page fault handler · Jacob Pan <hidden> · 2018-05-18
Re: [PATCH v2 07/40] iommu: Add a page fault handler · Jean-Philippe Brucker <hidden> · 2018-05-21
Re: [PATCH v2 07/40] iommu: Add a page fault handler · Jacob Pan <hidden> · 2018-05-22
Re: [PATCH v2 07/40] iommu: Add a page fault handler · Jean-Philippe Brucker <hidden> · 2018-05-24
Re: [PATCH v2 07/40] iommu: Add a page fault handler · Jacob Pan <hidden> · 2018-05-26
Re: [PATCH v2 07/40] iommu: Add a page fault handler · Jean-Philippe Brucker <hidden> · 2018-05-29
[PATCH v2 08/40] iommu/iopf: Handle mm faults · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 09/40] iommu/sva: Register page fault handler · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 10/40] mm: export symbol mm_access · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 11/40] mm: export symbol find_get_task_by_vpid · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 12/40] mm: export symbol mmput_async · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 13/40] vfio: Add support for Shared Virtual Addressing · Jean-Philippe Brucker <hidden> · 2018-05-11
Re: [PATCH v2 13/40] vfio: Add support for Shared Virtual Addressing · Jonathan Cameron <Jonathan.Cameron@huawei.com> · 2018-05-17
Re: [PATCH v2 13/40] vfio: Add support for Shared Virtual Addressing · Jean-Philippe Brucker <hidden> · 2018-05-21
Re: [PATCH v2 13/40] vfio: Add support for Shared Virtual Addressing · Xu Zaibo <hidden> · 2018-05-23
Re: [PATCH v2 13/40] vfio: Add support for Shared Virtual Addressing · Jean-Philippe Brucker <hidden> · 2018-05-24
Re: [PATCH v2 13/40] vfio: Add support for Shared Virtual Addressing · Xu Zaibo <hidden> · 2018-08-27
Re: [PATCH v2 13/40] vfio: Add support for Shared Virtual Addressing · Jean-Philippe Brucker <hidden> · 2018-08-31
Re: [PATCH v2 13/40] vfio: Add support for Shared Virtual Addressing · Xu Zaibo <hidden> · 2018-09-01
Re: [PATCH v2 13/40] vfio: Add support for Shared Virtual Addressing · Jean-Philippe Brucker <hidden> · 2018-09-03
Re: [PATCH v2 13/40] vfio: Add support for Shared Virtual Addressing · Xu Zaibo <hidden> · 2018-09-04
Re: [PATCH v2 13/40] vfio: Add support for Shared Virtual Addressing · Jean-Philippe Brucker <hidden> · 2018-09-04
Re: [PATCH v2 13/40] vfio: Add support for Shared Virtual Addressing · Xu Zaibo <hidden> · 2018-09-05
Re: [PATCH v2 13/40] vfio: Add support for Shared Virtual Addressing · Jean-Philippe Brucker <hidden> · 2018-09-05
Re: [PATCH v2 13/40] vfio: Add support for Shared Virtual Addressing · Xu Zaibo <hidden> · 2018-09-06
[PATCH v2 14/40] dt-bindings: document stall and PASID properties for IOMMU masters · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 15/40] iommu/of: Add stall and pasid properties to iommu_fwspec · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 16/40] arm64: mm: Pin down ASIDs for sharing mm with devices · Jean-Philippe Brucker <hidden> · 2018-05-11
Re: [PATCH v2 16/40] arm64: mm: Pin down ASIDs for sharing mm with devices · Catalin Marinas <catalin.marinas@arm.com> · 2018-05-15
Re: [PATCH v2 16/40] arm64: mm: Pin down ASIDs for sharing mm with devices · Jean-Philippe Brucker <hidden> · 2018-05-17
[PATCH v2 17/40] iommu/arm-smmu-v3: Link domains and devices · Jean-Philippe Brucker <hidden> · 2018-05-11
Re: [PATCH v2 17/40] iommu/arm-smmu-v3: Link domains and devices · Jonathan Cameron <Jonathan.Cameron@huawei.com> · 2018-05-17
Re: [PATCH v2 17/40] iommu/arm-smmu-v3: Link domains and devices · Jean-Philippe Brucker <hidden> · 2018-05-21
Re: [PATCH v2 17/40] iommu/arm-smmu-v3: Link domains and devices · Auger Eric <eric.auger@redhat.com> · 2018-09-10
[PATCH v2 18/40] iommu/io-pgtable-arm: Factor out ARM LPAE register defines · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 19/40] iommu: Add generic PASID table library · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 20/40] iommu/arm-smmu-v3: Move context descriptor code · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 21/40] iommu/arm-smmu-v3: Add support for Substream IDs · Jean-Philippe Brucker <hidden> · 2018-05-11
RE: [PATCH v2 21/40] iommu/arm-smmu-v3: Add support for Substream IDs · Bharat Kumar Gogada <hidden> · 2018-05-31
Re: [PATCH v2 21/40] iommu/arm-smmu-v3: Add support for Substream IDs · Jean-Philippe Brucker <hidden> · 2018-06-01
[PATCH v2 22/40] iommu/arm-smmu-v3: Add second level of context descriptor table · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 23/40] iommu/arm-smmu-v3: Share process page tables · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 24/40] iommu/arm-smmu-v3: Seize private ASID · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 25/40] iommu/arm-smmu-v3: Add support for VHE · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 26/40] iommu/arm-smmu-v3: Enable broadcast TLB maintenance · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 28/40] iommu/arm-smmu-v3: Implement mm operations · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 29/40] iommu/arm-smmu-v3: Add support for Hardware Translation Table Update · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 27/40] iommu/arm-smmu-v3: Add SVA feature checking · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 30/40] iommu/arm-smmu-v3: Register I/O Page Fault queue · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 31/40] iommu/arm-smmu-v3: Improve add_device error handling · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 32/40] iommu/arm-smmu-v3: Maintain a SID->device structure · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 33/40] iommu/arm-smmu-v3: Add stall support for platform devices · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 34/40] ACPI/IORT: Check ATS capability in root complex nodes · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 35/40] iommu/arm-smmu-v3: Add support for PCI ATS · Jean-Philippe Brucker <hidden> · 2018-05-11
Re: [PATCH v2 35/40] iommu/arm-smmu-v3: Add support for PCI ATS · Sinan Kaya <hidden> · 2018-05-19
Re: [PATCH v2 35/40] iommu/arm-smmu-v3: Add support for PCI ATS · Jean-Philippe Brucker <hidden> · 2018-05-21
[PATCH v2 36/40] iommu/arm-smmu-v3: Hook up ATC invalidation to mm ops · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 37/40] iommu/arm-smmu-v3: Disable tagged pointers · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 38/40] PCI: Make "PRG Response PASID Required" handling common · Jean-Philippe Brucker <hidden> · 2018-05-11
[PATCH v2 39/40] iommu/arm-smmu-v3: Add support for PRI · Jean-Philippe Brucker <hidden> · 2018-05-11
RE: [PATCH v2 39/40] iommu/arm-smmu-v3: Add support for PRI · Bharat Kumar Gogada <hidden> · 2018-05-25
Re: [PATCH v2 39/40] iommu/arm-smmu-v3: Add support for PRI · Jean-Philippe Brucker <hidden> · 2018-05-29
[PATCH v2 40/40] iommu/arm-smmu-v3: Add support for PCI PASID · Jean-Philippe Brucker <hidden> · 2018-05-11

From: christian.koenig@amd.com (Christian König)
Date: 2018-09-12 12:56:21
Also in: kvm, linux-acpi, linux-devicetree, linux-iommu, linux-mm, linux-pci

Am 12.09.2018 um 14:40 schrieb Jean-Philippe Brucker:

On 08/09/2018 08:29, Christian K?nig wrote:

quoted

Yes, exactly. I just need a PASID which is never used by the OS for a
process and we can easily give that back when the last FD reference is
closed.

Alright, iommu-sva can get its PASID from this external allocator as
well, as long as it has an interface similar to idr. Where would it go,
drivers/base/, mm/, kernel/...?

Good question, my initial instinct was to put it under drivers/pci.

But AFAIKS now you are supporting SVA implementations which are not 
based on PCI.

So drivers/base sounds like a good place to me.

quoted

The process dies, iommu-sva is notified and calls the mm_exit()
function passed by the device driver to iommu_sva_device_init(). In
mm_exit() the device driver needs to clear any reference to the
PASID in hardware and in its own structures. When the device driver
returns from mm_exit(), it effectively tells the core that it has
finished using the PASID, and iommu-sva can reuse the PASID for
another process. mm_exit() is allowed to block, so the device
driver has time to clean up and flush the queues.

If the device driver finishes using the PASID before the process
exits, it just calls unbind().

Exactly that's what Michal Hocko is probably going to not like at all.

Can we have a different approach where each driver is informed by the
mm_exit(), but needs to explicitly call unbind() before a PASID is
reused?

It's awful from the IOMMU driver perspective. In addition to "enabled"
and "disabled" PASID states, you add "disabled but DMA still running
normally". Between that new state and "disabled", the IOMMU will be
flooded by translation faults (non-recoverable ones), which it needs to
ignore instead of reporting to the kernel. Not all IOMMUs can deal with
this in hardware (SMMU and VT-d can quiesce translation faults
per-PASID, but I don't think AMD IOMMU can.) Some drivers will have to
filter fault events themselves, depending on the PASID state.

Puh, yeah that is probably true.

Ok let us skip that for a moment, we just need to invest more work in 
killing DMA operations quickly when the process address space is teared 
down.

quoted

During that teardown transition it would be ideal if that PASID only
points to a dummy root page directory with only invalid entries.

I guess this can be vendor specific, In VT-d I plan to mark PASID
entry not present and disable fault reporting while draining remaining
activities.

Sounds good to me.

Point is at least in the case where the process was killed by the OOM
killer we should not block in mm_exit().

Instead operations issued by the process to a device driver which uses
SVA needs to be terminated as soon as possible to make sure that the OOM
killer can advance.

I don't see how we're preventing the OOM killer from advancing, so I'm
looking for a stronger argument that justifies adding this complexity to
IOMMU drivers. Time limit of the release MMU notifier, locking
requirement, a concrete example where things break, even a comment
somewhere in mm/ would do...

In my tests I can't manage to disturb the OOM killer, but I could be
missing some special case. Even if the mm_exit() callback
(unrealistically) sleeps for 60 seconds,

Well you are *COMPLETELY* under estimating this. A compute task with a 
huge wave launch can take multiple minutes to tear down.

That's why I'm so concerned about that, but to be honest I think that 
just the hardware needs to become better and we need to be able to block 
dead tasks from spawning threads again.

Regards,
Christian.

  the OOM killer isn't affected
because oom_reap_task_mm() wipes the victim's address space in another
thread, either before or while the release notifier is running.

Thanks,
Jean

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help