[PATCH 5.4 16/33] net: ipv6: Validate GSO SKB before finish IPv6 processing

[PATCH 5.4 00/33] 5.4.92-rc1 review · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 14/33] net: introduce skb_list_walk_safe for skb segment walking · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 21/33] esp: avoid unneeded kmap_atomic call · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 23/33] net: dcb: Accept RTM_GETDCB messages carrying set-like DCB commands · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 20/33] rndis_host: set proper input size for OID_GEN_PHYSICAL_MEDIUM request · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 22/33] net: dcb: Validate netlink message in DCB handler · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 15/33] net: skbuff: disambiguate argument and member for skb_list_walk_safe helper · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 24/33] rxrpc: Call state should be read with READ_ONCE() under some circumstances · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 09/33] nfsd4: readdirplus shouldnt return parent of export · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 19/33] net: mvpp2: Remove Pause and Asym_Pause support · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 18/33] mlxsw: core: Increase critical threshold for ASIC thermal zone · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 33/33] spi: cadence: cache reference clock rate during probe · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 17/33] mlxsw: core: Add validation of transceiver temperature thresholds · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 27/33] net: avoid 32 x truesize under-estimation for tiny skbs · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 31/33] mac80211: do not drop tx nulldata packets on encrypted links · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 30/33] tipc: fix NULL deref in tipc_link_xmit() · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 32/33] mac80211: check if atf has been disabled in __ieee80211_schedule_txq · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 04/33] elfcore: fix building with clang · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 28/33] rxrpc: Fix handling of an unsupported token type in rxrpc_read() · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 16/33] net: ipv6: Validate GSO SKB before finish IPv6 processing · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 26/33] net: sit: unregister_netdevice on newlinks error path · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 25/33] net: stmmac: Fixed mtu channged by cache aligned · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 07/33] spi: npcm-fiu: simplify the return expression of npcm_fiu_probe() · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 05/33] scsi: lpfc: Make function lpfc_defer_pt2pt_acc static · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 08/33] spi: npcm-fiu: Disable clock in probe error path · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 06/33] scsi: lpfc: Make lpfc_defer_acc_rsp static · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 03/33] xen/privcmd: allow fetching resource sizes · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 29/33] net, sctp, filter: remap copy_from_user failure error · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
Re: [PATCH 5.4 29/33] net, sctp, filter: remap copy_from_user failure error · Marcelo Ricardo Leitner <marcelo.leitner@gmail.com> · 2021-01-22
Re: [PATCH 5.4 29/33] net, sctp, filter: remap copy_from_user failure error · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-23
[PATCH 5.4 13/33] netxen_nic: fix MSI/MSI-x interrupts · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 02/33] compiler.h: Raise minimum version of GCC to 5.1 for arm64 · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 11/33] bpf: Fix helper bpf_map_peek_elem_proto pointing to wrong callback · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 10/33] bpf: Dont leak memory in bpf getsockopt when optlen == 0 · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 01/33] usb: ohci: Make distrust_firmware param default to false · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
[PATCH 5.4 12/33] udp: Prevent reuseport_select_sock from reading uninitialized socks · Greg Kroah-Hartman <gregkh@linuxfoundation.org> · 2021-01-22
Re: [PATCH 5.4 00/33] 5.4.92-rc1 review · Shuah Khan <skhan@linuxfoundation.org> · 2021-01-23
Re: [PATCH 5.4 00/33] 5.4.92-rc1 review · Naresh Kamboju <hidden> · 2021-01-23
Re: [PATCH 5.4 00/33] 5.4.92-rc1 review · Naresh Kamboju <hidden> · 2021-01-23
Re: [PATCH 5.4 00/33] 5.4.92-rc1 review · Jon Hunter <jonathanh@nvidia.com> · 2021-01-23
Re: [PATCH 5.4 00/33] 5.4.92-rc1 review · Guenter Roeck <linux@roeck-us.net> · 2021-01-23

STALE1986d REVIEWED: 4 (4M)

From: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Date: 2021-01-22 18:25:16
Also in: lkml

From: Aya Levin <redacted>

[ Upstream commit b210de4f8c97d57de051e805686248ec4c6cfc52 ]

There are cases where GSO segment's length exceeds the egress MTU:
 - Forwarding of a TCP GRO skb, when DF flag is not set.
 - Forwarding of an skb that arrived on a virtualisation interface
   (virtio-net/vhost/tap) with TSO/GSO size set by other network
   stack.
 - Local GSO skb transmitted on an NETIF_F_TSO tunnel stacked over an
   interface with a smaller MTU.
 - Arriving GRO skb (or GSO skb in a virtualised environment) that is
   bridged to a NETIF_F_TSO tunnel stacked over an interface with an
   insufficient MTU.

If so:
 - Consume the SKB and its segments.
 - Issue an ICMP packet with 'Packet Too Big' message containing the
   MTU, allowing the source host to reduce its Path MTU appropriately.

Note: These cases are handled in the same manner in IPv4 output finish.
This patch aligns the behavior of IPv6 and the one of IPv4.

Fixes: 9e50849054a4 ("netfilter: ipv6: move POSTROUTING invocation before fragmentation")
Signed-off-by: Aya Levin <redacted>
Reviewed-by: Tariq Toukan <tariqt@nvidia.com>
Link: https://lore.kernel.org/r/1610027418-30438-1-git-send-email-ayal@nvidia.com (local)
Signed-off-by: Jakub Kicinski <kuba@kernel.org>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
---
 net/ipv6/ip6_output.c |   41 ++++++++++++++++++++++++++++++++++++++++-
 1 file changed, 40 insertions(+), 1 deletion(-)

--- a/net/ipv6/ip6_output.c
+++ b/net/ipv6/ip6_output.c

@@ -124,8 +124,43 @@ static int ip6_finish_output2(struct net
 	return -EINVAL;
 }
 
+static int
+ip6_finish_output_gso_slowpath_drop(struct net *net, struct sock *sk,
+				    struct sk_buff *skb, unsigned int mtu)
+{
+	struct sk_buff *segs, *nskb;
+	netdev_features_t features;
+	int ret = 0;
+
+	/* Please see corresponding comment in ip_finish_output_gso
+	 * describing the cases where GSO segment length exceeds the
+	 * egress MTU.
+	 */
+	features = netif_skb_features(skb);
+	segs = skb_gso_segment(skb, features & ~NETIF_F_GSO_MASK);
+	if (IS_ERR_OR_NULL(segs)) {
+		kfree_skb(skb);
+		return -ENOMEM;
+	}
+
+	consume_skb(skb);
+
+	skb_list_walk_safe(segs, segs, nskb) {
+		int err;
+
+		skb_mark_not_on_list(segs);
+		err = ip6_fragment(net, sk, segs, ip6_finish_output2);
+		if (err && ret == 0)
+			ret = err;
+	}
+
+	return ret;
+}
+
 static int __ip6_finish_output(struct net *net, struct sock *sk, struct sk_buff *skb)
 {
+	unsigned int mtu;
+
 #if defined(CONFIG_NETFILTER) && defined(CONFIG_XFRM)
 	/* Policy lookup after SNAT yielded a new policy */
 	if (skb_dst(skb)->xfrm) {

@@ -134,7 +169,11 @@ static int __ip6_finish_output(struct ne
 	}
 #endif
 
-	if ((skb->len > ip6_skb_dst_mtu(skb) && !skb_is_gso(skb)) ||
+	mtu = ip6_skb_dst_mtu(skb);
+	if (skb_is_gso(skb) && !skb_gso_validate_network_len(skb, mtu))
+		return ip6_finish_output_gso_slowpath_drop(net, sk, skb, mtu);
+
+	if ((skb->len > mtu && !skb_is_gso(skb)) ||
 	    dst_allfrag(skb_dst(skb)) ||
 	    (IP6CB(skb)->frag_max_size && skb->len > IP6CB(skb)->frag_max_size))
 		return ip6_fragment(net, sk, skb, ip6_finish_output2);

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help