Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100

[RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Alexey Kardashevskiy <hidden> · 2018-06-07
[RFC PATCH kernel 2/5] powerpc/iommu_context: Change referencing in API · Alexey Kardashevskiy <hidden> · 2018-06-07
[RFC PATCH kernel 3/5] powerpc/iommu: Do not pin memory of a memory device · Alexey Kardashevskiy <hidden> · 2018-06-07
[RFC PATCH kernel 4/5] vfio_pci: Allow mapping extra regions · Alexey Kardashevskiy <hidden> · 2018-06-07
Re: [RFC PATCH kernel 4/5] vfio_pci: Allow mapping extra regions · Alex Williamson <hidden> · 2018-06-07
[RFC PATCH kernel 5/5] vfio_pci: Add NVIDIA GV100GL [Tesla V100 SXM2] [10de:1db1] subdriver · Alexey Kardashevskiy <hidden> · 2018-06-07
Re: [RFC PATCH kernel 5/5] vfio_pci: Add NVIDIA GV100GL [Tesla V100 SXM2] [10de:1db1] subdriver · Alex Williamson <hidden> · 2018-06-07
Re: [RFC PATCH kernel 5/5] vfio_pci: Add NVIDIA GV100GL [Tesla V100 SXM2] [10de:1db1] subdriver · Alexey Kardashevskiy <hidden> · 2018-06-08
Re: [RFC PATCH kernel 5/5] vfio_pci: Add NVIDIA GV100GL [Tesla V100 SXM2] [10de:1db1] subdriver · Alex Williamson <hidden> · 2018-06-08
Re: [RFC PATCH kernel 5/5] vfio_pci: Add NVIDIA GV100GL [Tesla V100 SXM2] [10de:1db1] subdriver · Alexey Kardashevskiy <hidden> · 2018-06-08
Re: [RFC PATCH kernel 5/5] vfio_pci: Add NVIDIA GV100GL [Tesla V100 SXM2] [10de:1db1] subdriver · Alex Williamson <hidden> · 2018-06-08
[RFC PATCH kernel 1/5] vfio/spapr_tce: Simplify page contained test · Alexey Kardashevskiy <hidden> · 2018-06-07
Re: [RFC PATCH kernel 1/5] vfio/spapr_tce: Simplify page contained test · David Gibson <hidden> · 2018-06-08
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Alex Williamson <hidden> · 2018-06-07
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Benjamin Herrenschmidt <benh@kernel.crashing.org> · 2018-06-07
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Alex Williamson <hidden> · 2018-06-07
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Benjamin Herrenschmidt <benh@kernel.crashing.org> · 2018-06-07
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Alex Williamson <hidden> · 2018-06-08
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Benjamin Herrenschmidt <benh@kernel.crashing.org> · 2018-06-08
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Alex Williamson <hidden> · 2018-06-08
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Alexey Kardashevskiy <hidden> · 2018-06-08
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Alex Williamson <hidden> · 2018-06-08
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Alexey Kardashevskiy <hidden> · 2018-06-08
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Alex Williamson <hidden> · 2018-06-08
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Alexey Kardashevskiy <hidden> · 2018-07-10
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Alex Williamson <hidden> · 2018-07-10
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Alexey Kardashevskiy <hidden> · 2018-07-11
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Alexey Kardashevskiy <hidden> · 2018-07-30
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Alex Williamson <hidden> · 2018-07-30
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Alexey Kardashevskiy <hidden> · 2018-07-31
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Alex Williamson <hidden> · 2018-07-31
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Alexey Kardashevskiy <hidden> · 2018-08-01
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Alex Williamson <hidden> · 2018-08-01
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Alexey Kardashevskiy <hidden> · 2018-08-08
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Alexey Kardashevskiy <hidden> · 2018-08-09
Re: [RFC PATCH kernel 0/5] powerpc/P9/vfio: Pass through NVIDIA Tesla V100 · Alex Williamson <hidden> · 2018-08-09

From: Alex Williamson <hidden>
Date: 2018-06-08 00:34:21
Also in: kvm

On Fri, 08 Jun 2018 09:20:30 +1000
Benjamin Herrenschmidt [off-list ref] wrote:

On Thu, 2018-06-07 at 16:15 -0600, Alex Williamson wrote:

quoted

On Fri, 08 Jun 2018 07:54:02 +1000
Benjamin Herrenschmidt [off-list ref] wrote:

quoted

On Thu, 2018-06-07 at 11:04 -0600, Alex Williamson wrote:

quoted

Can we back up and discuss whether the IOMMU grouping of NVLink
connected devices makes sense?  AIUI we have a PCI view of these
devices and from that perspective they're isolated.  That's the view of
the device used to generate the grouping.  However, not visible to us,
these devices are interconnected via NVLink.  What isolation properties
does NVLink provide given that its entire purpose for existing seems to
be to provide a high performance link for p2p between devices?

Not entire. On POWER chips, we also have an nvlink between the device
and the CPU which is running significantly faster than PCIe.

But yes, there are cross-links and those should probably be accounted
for in the grouping.

Then after we fix the grouping, can we just let the host driver manage
this coherent memory range and expose vGPUs to guests?  The use case of
assigning all 6 GPUs to one VM seems pretty limited.  (Might need to
convince NVIDIA to support more than a single vGPU per VM though)
Thanks,

I don't know about "vGPUs" and what nVidia may be cooking in that area.

The patched from Alexey allow for passing through the full thing, but
they aren't trivial (there are additional issues, I'm not sure how
covered they are, as we need to pay with the mapping attributes of
portions of the GPU memory on the host side...).

Note: The cross-links are only per-socket so that would be 2 groups of
3.

We *can* allow individual GPUs to be passed through, either if somebody
designs a system without cross links, or if the user is ok with the
security risk as the guest driver will not enable them if it doesn't
"find" both sides of them.

If GPUs are not isolated and we cannot prevent them from probing each
other via these links, then I think we have an obligation to configure
grouping in a way that doesn't rely on a benevolent userspace.  Thanks,

Alex

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help