Re: IXGBE, IOMMU DMAR DRHD handling fault issue
From: Burakov, Anatoly <hidden>
Date: 2018-01-31 09:59:29
On 29-Jan-18 10:35 PM, Ravi Kerur wrote:
Hi Burakov, When using vfio-pci on host both VF and PF interfaces works fine with dpdk i.e. I don't see DMAR fault messages anymore. However, when I attach a VF interface to a VM and start DPDK with vfio-pci inside VM I still see DMAR fault messages on host. Both host and VM are booted with 'intel-iommu=on' on GRUB. Ping from VM with DPDK/vfio-pci doesn't work (I think it's expected because of DMAR faults), however, when VF interface uses ixgbevf driver ping works. Following are some details /*****************On VM***************/ dpdk-devbind -s Network devices using DPDK-compatible driver ============================================ 0000:00:07.0 '82599 Ethernet Controller Virtual Function' drv=vfio-pci unused=ixgbevf Network devices using kernel driver =================================== 0000:03:00.0 'Device 1041' if=eth0 drv=virtio-pci unused=vfio-pci *Active* 0000:04:00.0 'Device 1041' if=eth1 drv=virtio-pci unused=vfio-pci 0000:05:00.0 'Device 1041' if=eth2 drv=virtio-pci unused=vfio-pci Other network devices ===================== <none> Crypto devices using DPDK-compatible driver =========================================== <none> Crypto devices using kernel driver ================================== <none> Other crypto devices ==================== <none> 00:07.0 Ethernet controller: Intel Corporation 82599 Ethernet Controller Virtual Function (rev 01) Subsystem: Intel Corporation 82599 Ethernet Controller Virtual Function Control: I/O- Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Region 0: Memory at fda00000 (64-bit, prefetchable) [size=16K] Region 3: Memory at fda04000 (64-bit, prefetchable) [size=16K] Capabilities: [70] MSI-X: Enable+ Count=3 Masked- Vector table: BAR=3 offset=00000000 PBA: BAR=3 offset=00002000 Capabilities: [a0] Express (v1) Root Complex Integrated Endpoint, MSI 00 DevCap: MaxPayload 128 bytes, PhantFunc 0 ExtTag- RBE- DevCtl: Report errors: Correctable- Non-Fatal- Fatal- Unsupported- RlxdOrd- ExtTag- PhantFunc- AuxPwr- NoSnoop- MaxPayload 128 bytes, MaxReadReq 128 bytes DevSta: CorrErr- UncorrErr- FatalErr- UnsuppReq- AuxPwr- TransPend- Capabilities: [100 v1] Advanced Error Reporting UESta: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol- UEMsk: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol- UESvrt: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol- CESta: RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr- CEMsk: RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr- AERCap: First Error Pointer: 00, GenCap- CGenEn- ChkCap- ChkEn- Kernel driver in use: vfio-pci Kernel modules: ixgbevf /***************on Host*************/ dmesg | grep DMAR ... [ 978.268143] DMAR: DRHD: handling fault status reg 2 [ 978.268147] DMAR: [DMA Read] *Request device [04:10.0]* fault addr 33a128000 [fault reason 06] PTE Read access is not set [ 1286.677726] DMAR: DRHD: handling fault status reg 102 [ 1286.677730] DMAR: [DMA Read] Request device [04:10.0] fault addr fb663000 [fault reason 06] PTE Read access is not set [ 1676.436145] DMAR: DRHD: handling fault status reg 202 [ 1676.436149] DMAR: [DMA Read] Request device [04:10.0] fault addr 33a128000 [fault reason 06] PTE Read access is not set [ 1734.433649] DMAR: DRHD: handling fault status reg 302 [ 1734.433652] DMAR: [DMA Read] Request device [04:10.0] fault addr 33a128000 [fault reason 06] PTE Read access is not set [ 2324.428938] DMAR: DRHD: handling fault status reg 402 [ 2324.428942] DMAR: [DMA Read] Request device [04:10.0] fault addr 7770c000 [fault reason 06] PTE Read access is not set [ 2388.553640] DMAR: DRHD: handling fault status reg 502 [ 2388.553643] DMAR: [DMA Read] *Request device [04:10.0]* fault addr 33a128000 [fault reason 06] PTE Read access is not set VM is started with qemu-system-x86_64 -enable-kvm -M q35,accel=kvm,kernel-irqchip=split -object iothread,id=iothread0 -device intel-iommu,intremap=on,device-iotlb=on,caching-mode=on -cpu host -daemonize -m 16G -smp 14 -uuid 0fc91c66-f0b1-11e7-acf4-525400123456 -name 212748-sriov-ravi-smac-alpha-SMAC10 -device ioh3420,id=root.1,chassis=1 -device ioh3420,id=root.2,chassis=2 -netdev tap,vhost=on,queues=2,ifname=vn-vn2_1_,downscript=no,id=vn-vn2_1_,script=no -device ioh3420,id=root.3,chassis=3 -device virtio-net-pci,netdev=vn-vn2_1_,bus=root.3,ats=on,mq=on,vectors=6,mac=DE:AD:02:88:10:37,id=vn-vn2_1__dev -netdev tap,vhost=on,queues=2,ifname=vn-vn92_1_,downscript=no,id=vn-vn92_1_,script=no -device ioh3420,id=root.4,chassis=4 -device virtio-net-pci,mac=DE:AD:02:88:10:38,netdev=vn-vn92_1_,bus=root.4,ats=on,mq=on,vectors=6,id=vn-vn92_1__dev -netdev tap,vhost=on,queues=2,ifname=vn-vn93_1_,downscript=no,id=vn-vn93_1_,script=no -device ioh3420,id=root.5,chassis=5 -device virtio-net-pci,mac=DE:AD:02:88:10:39,netdev=vn-vn93_1_,bus=root.5,ats=on,mq=on,vectors=6,id=vn-vn93_1__dev -vnc :16,websocket=15916 -qmp tcp:127.0.0.1:12001 <http://127.0.0.1:12001>,server,nowait -chardev socket,id=charmonitor,path=/tmp/mon.12001,server,nowait -mon chardev=charmonitor,id=monitor -cdrom /var/venom/cloud_init/0fc91c66-f0b1-11e7-acf4-525400123456.iso -*device vfio-pci,host=04:10.0* -drive file=/var/venom/instance_repo/test.img,if=none,id=drive-virtio-disk0,format=raw,aio=native,cache=none -balloon none -device virtio-blk-pci,scsi=off,iothread=iothread0,drive=drive-virtio-disk0,id=virtio-disk0,bus=root.1,ats=on,bootindex=1 Thanks. On Thu, Jan 25, 2018 at 2:49 AM, Burakov, Anatoly <anatoly.burakov@intel.com <mailto:anatoly.burakov@intel.com>> wrote: On 24-Jan-18 7:13 PM, Ravi Kerur wrote: Hi Burakov, Thank you. I will try with vfio-pci driver. I am assuming it will work for both PF and VF interfaces since I am using both in my setup? Thanks. Yes, it should work for both PF and VF devices. On Wed, Jan 24, 2018 at 2:31 AM, Burakov, Anatoly <anatoly.burakov@intel.com <mailto:anatoly.burakov@intel.com> <mailto:anatoly.burakov@intel.com <mailto:anatoly.burakov@intel.com>>> wrote: On 23-Jan-18 5:25 PM, Ravi Kerur wrote: Hi, I am running into an issue when DPDK is started with iommu on via GRUB command. Problem is not seen with regular kernel driver, error messages show when DPDK is started and happens for both PF and VF interfaces. I am using DPDK 17.05 so the patch proposed in the following link is available http://dpdk.org/ml/archives/dev/2017-February/057048.html <http://dpdk.org/ml/archives/dev/2017-February/057048.html> <http://dpdk.org/ml/archives/dev/2017-February/057048.html <http://dpdk.org/ml/archives/dev/2017-February/057048.html>> Workaround is to use "iommu=pt" but I want iommu enabled in my setup. I checked BIOS for reserved memory(DMA RMRR for IXGBE) didn't get any details on it. Kindly let me know how to resolve this issue. Following are the details (1) Linux kernel 4.9 (2) DPDK 17.05 (3) IXGBE details ethtool -i enp4s0f0 (PF driver) driver: ixgbe version: 5.3.3 firmware-version: 0x800007b8, 1.1018.0 bus-info: 0000:04:00.0 supports-statistics: yes supports-test: yes supports-eeprom-access: yes supports-register-dump: yes supports-priv-flags: yes ethtool -i enp4s16f2 (VF driver) driver: ixgbevf version: 4.3.2 firmware-version: bus-info: 0000:04:10.2 supports-statistics: yes supports-test: yes supports-eeprom-access: no supports-register-dump: yes supports-priv-flags: no Bus info Device Class Description ========================================================= pci@0000:01:00.0 ens11f0 network 82599ES 10-Gigabit SFI/SFP+ Network Connection pci@0000:01:00.1 ens11f1 network 82599ES 10-Gigabit SFI/SFP+ Network Connection pci@0000:04:00.0 enp4s0f0 network 82599ES 10-Gigabit SFI/SFP+ Network Connection pci@0000:04:00.1 enp4s0f1 network 82599ES 10-Gigabit SFI/SFP+ Network Connection pci@0000:04:10.0 enp4s16 network Illegal Vendor ID pci@0000:04:10.2 enp4s16f2 network Illegal Vendor ID (4) DPDK bind interfaces # dpdk-devbind -s Network devices using DPDK-compatible driver ============================================ 0000:01:00.0 '82599ES 10-Gigabit SFI/SFP+ Network Connection 10fb' drv=igb_uio unused=vfio-pci 0000:04:10.2 '82599 Ethernet Controller Virtual Function 10ed' drv=igb_uio unused=vfio-pci Network devices using kernel driver =================================== 0000:01:00.1 '82599ES 10-Gigabit SFI/SFP+ Network Connection 10fb' if=ens11f1 drv=ixgbe unused=igb_uio,vfio-pci 0000:04:00.0 '82599ES 10-Gigabit SFI/SFP+ Network Connection 10fb' if=enp4s0f0 drv=ixgbe unused=igb_uio,vfio-pci 0000:04:00.1 '82599ES 10-Gigabit SFI/SFP+ Network Connection 10fb' if=enp4s0f1 drv=ixgbe unused=igb_uio,vfio-pci 0000:04:10.0 '82599 Ethernet Controller Virtual Function 10ed' if=enp4s16 drv=ixgbevf unused=igb_uio,vfio-pci 0000:06:00.0 'I210 Gigabit Network Connection 1533' if=eno1 drv=igb unused=igb_uio,vfio-pci *Active* Other Network devices ===================== <none> ... (5) Kernel dmesg # dmesg | grep -e DMAR [ 0.000000] ACPI: DMAR 0x000000007999BAD0 0000E0 (v01 ALASKA A M I 00000001 INTL 20091013) [ 0.000000] DMAR: IOMMU enabled [ 0.518747] DMAR: Host address width 46 [ 0.526616] DMAR: DRHD base: 0x000000fbffc000 flags: 0x0 [ 0.537447] DMAR: dmar0: reg_base_addr fbffc000 ver 1:0 cap d2078c106f0466 ecap f020df [ 0.553620] DMAR: DRHD base: 0x000000c7ffc000 flags: 0x1 [ 0.564445] DMAR: dmar1: reg_base_addr c7ffc000 ver 1:0 cap d2078c106f0466 ecap f020df [ 0.580611] DMAR: RMRR base: 0x0000007bbc6000 end: 0x0000007bbd4fff [ 0.593344] DMAR: ATSR flags: 0x0 [ 0.600178] DMAR: RHSA base: 0x000000c7ffc000 proximity domain: 0x0 [ 0.612905] DMAR: RHSA base: 0x000000fbffc000 proximity domain: 0x1 [ 0.625632] DMAR-IR: IOAPIC id 3 under DRHD base 0xfbffc000 IOMMU 0 [ 0.638522] DMAR-IR: IOAPIC id 1 under DRHD base 0xc7ffc000 IOMMU 1 [ 0.651426] DMAR-IR: IOAPIC id 2 under DRHD base 0xc7ffc000 IOMMU 1 [ 0.664324] DMAR-IR: HPET id 0 under DRHD base 0xc7ffc000 [ 0.675326] DMAR-IR: Queued invalidation will be enabled to support x2apic and Intr-remapping. [ 0.693805] DMAR-IR: Enabled IRQ remapping in x2apic mode [ 9.395170] DMAR: dmar1: Using Queued invalidation [ 9.405011] DMAR: Setting RMRR: [ 9.412006] DMAR: Setting identity map for device 0000:00:1d.0 [0x7bbc6000 - 0x7bbd4fff] [ 9.428569] DMAR: Prepare 0-16MiB unity mapping for LPC [ 9.439712] DMAR: Setting identity map for device 0000:00:1f.0 [0x0 - 0xffffff] [ 9.454684] DMAR: Intel(R) Virtualization Technology for Directed I/O [ 287.023068] DMAR: DRHD: handling fault status reg 2 [ 287.023073] DMAR: [DMA Read] Request device [01:00.0] fault addr 18a260a000 [fault reason 06] PTE Read access is not set [ 287.023180] DMAR: DRHD: handling fault status reg 102 [ 287.023183] DMAR: [DMA Read] Request device [01:00.0] fault addr 18a3010000 [fault reason 06] PTE Read access is not set [ 287.038250] DMAR: DRHD: handling fault status reg 202 [ 287.038252] DMAR: [DMA Read] Request device [01:00.0] fault addr 18a3010000 [fault reason 06] PTE Read access is not set [ 288.170165] DMAR: DRHD: handling fault status reg 302 [ 288.170170] DMAR: [DMA Read] Request device [01:00.0] fault addr 1890754000 [fault reason 06] PTE Read access is not set [ 288.694496] DMAR: DRHD: handling fault status reg 402 [ 288.694499] DMAR: [DMA Read] Request device [04:10.2] fault addr 189069c000 [fault reason 06] PTE Read access is not set [ 289.927113] DMAR: DRHD: handling fault status reg 502 [ 289.927116] DMAR: [DMA Read] Request device [01:00.0] fault addr 1890754000 [fault reason 06] PTE Read access is not set [ 290.174275] DMAR: DRHD: handling fault status reg 602 [ 290.174279] DMAR: [DMA Read] Request device [01:00.0] fault addr 1890754000 [fault reason 06] PTE Read access is not set [ 292.174247] DMAR: DRHD: handling fault status reg 702 [ 292.174251] DMAR: [DMA Read] Request device [01:00.0] fault addr 1890754000 [fault reason 06] PTE Read access is not set [ 294.174227] DMAR: DRHD: handling fault status reg 2 [ 294.174230] DMAR: [DMA Read] Request device [01:00.0] fault addr 1890754000 [fault reason 06] PTE Read access is not set [ 296.174216] DMAR: DRHD: handling fault status reg 102 [ 296.174219] DMAR: [DMA Read] Request device [01:00.0] fault addr 1890754000 [fault reason 06] PTE Read access is not set [root@infradev-comp006.naw02.infradev.viasat.io <mailto:root@infradev-comp006.naw02.infradev.viasat.io> <mailto:root@infradev-comp006.naw02.infradev.viasat.io <mailto:root@infradev-comp006.naw02.infradev.viasat.io>> ~] # Thanks. Hi Ravi, The "iommu=pt" workaround applies only when you want to use igb_uio driver. VFIO driver is able to fully utilize IOMMU without the need for pass-through mode. From your log i can see that some devices are bound to igb_uio while others are bound to vfio-pci. Just bind all of the devices you want to use with DPDK to vfio-pci and these errors should go away. -- Thanks, Anatoly -- Thanks, Anatoly
Hi Ravi, Using vfio-pci in IOMMU mode will only work if your VM provides IOMMU emulation (it's a fairly recent development, so your QEMU must be of an appropriate version - can't recall which one off the top of my head). Otherwise you'd have to treat your VM as if it was a machine without IOMMU, i.e. use noiommu mode for VFIO, or igb_uio driver. -- Thanks, Anatoly