Re: [PATCH] Periodically flow expire from flow offload tables
From: Pablo Neira Ayuso <pablo@netfilter.org>
Date: 2022-10-26 10:50:42
Also in:
linux-doc, lkml, netfilter-devel
Hi, On Tue, Oct 25, 2022 at 03:32:51PM +0200, Michael Lilja wrote:
Hi, Thanks for the optimisation suggestions, my nft is a rough conversion from iptables, I will look into using maps. The ingress chain will work fine for SW OFFLOAD but HW OFFLOAD is not solved by this, at least what I see is that once a flow is offloaded to HW the driver doesn’t see the packets? If I use the ingress chain I guess I don’t have access to ‘ct mark’ yet? I could think of a use-case where schedules should only some ‘flow type’: meta mask != 0x12340000/16 meta day “Tuesday" meta hourquoted
= "06:00" meta hour < "07:00" dropI have more advanced rules that check the ct mark and will need to drop if mark == something. These mark == something rules are applied ‘runtime’ and flowables doesn’t seem to be flushed on nft load, which is also a reason for my ‘flow retire’ from the tables.
It should be also possible to notify the flowtable that the ruleset has been updated. That won't cover the meta day, hour, time scenario though. I think both mechanism (the 'retire' feature you propose) and ruleset update notifications are complementary each other and they would be good to have.
So my overall goal is to receive packets, mark them with a value depending on 'flow type' and then for the flows that are allowed to be forwarded offload them to the ingress flow table for either HW or SW offload. Once in a while I will change the verdict of a ‘flow type’ and will need that to apply for all existing flows and future flows, besides the fixed schedules, and it should work both for SW OFFLOAD and HW OFFLOAD. I only have the M7621 device to play with for HW OFFLOAD, but it works fine with my patch.
Thanks for explaining.
My suggestions are:
- Add support for this in the flowtable netlink interface (instead of
sysctl), I'm going to post a patch to add support for setting the
flowtable size, it can be used as reference to expose this new
'retire' feature.
- flow_offload_teardown() already unsets the IPS_OFFLOAD bit, so
probably your patch can follow that path too (instead of clearing
IPS_OFFLOAD_BIT from flow_offload_del).
static void nf_flow_offload_gc_step(struct nf_flowtable *flow_table,
struct flow_offload *flow, void *data)
{
if (nf_flow_has_expired(flow) ||
nf_ct_is_dying(flow->ct))
flow_offload_teardown(flow);