Re: [PATCH V2 2/2] xfs: Fix "use after free" of intent items
From: Dave Chinner <david@fromorbit.com>
Date: 2018-02-21 04:01:47
On Wed, Feb 21, 2018 at 07:58:26AM +0530, Chandan Rajendra wrote:
generic/388 can cause the following "use after free" error to occur,
=============================================================================
BUG xfs_efi_item (Not tainted): Poison overwritten
-----------------------------------------------------------------------------
Disabling lock debugging due to kernel taint
INFO: 0x00000000292c4bd4-0x00000000292c4bd4. First byte 0x6a instead of 0x6b
INFO: Allocated in .kmem_zone_alloc+0xcc/0x190 age=79 cpu=0 pid=12436
.__slab_alloc+0x54/0x80
.kmem_cache_alloc+0x124/0x350
.kmem_zone_alloc+0xcc/0x190
.xfs_efi_init+0x48/0xf0
.xfs_extent_free_create_intent+0x40/0x130
.xfs_defer_intake_work+0x74/0x1e0
.xfs_defer_finish+0xac/0x5c0
.xfs_itruncate_extents+0x170/0x590
.xfs_inactive_truncate+0xcc/0x170
.xfs_inactive+0x1d8/0x2f0
.xfs_fs_destroy_inode+0xe4/0x3d0
.destroy_inode+0x68/0xb0
.do_unlinkat+0x1e8/0x390
system_call+0x58/0x6c
INFO: Freed in .xfs_efi_item_free+0x44/0x80 age=79 cpu=0 pid=12436
.kmem_cache_free+0x120/0x2b0
.xfs_efi_item_free+0x44/0x80
.xfs_trans_free_items+0xd4/0x130
.__xfs_trans_commit+0xd0/0x350
.xfs_trans_roll+0x4c/0x90
.xfs_defer_trans_roll+0xa4/0x2b0
.xfs_defer_finish+0xb8/0x5c0
.xfs_itruncate_extents+0x170/0x590
.xfs_inactive_truncate+0xcc/0x170
.xfs_inactive+0x1d8/0x2f0
.xfs_fs_destroy_inode+0xe4/0x3d0
.destroy_inode+0x68/0xb0
.do_unlinkat+0x1e8/0x390
system_call+0x58/0x6c
This happens due to the following interaction,
1. xfs_defer_finish() creates "extent free" intent item and adds it to the
per-transction list of log items.
2. xfs_defer_trans_roll() invokes __xfs_trans_commit(). Here, if the
XFS_MOUNT_FS_SHUTDOWN flag is set, we invoke io_unlock() operation
for each of the log items in the per-transction list. For "extent
free" log items xfs_efi_item_unlock() gets invoked which then frees
the xfs_efi_log_item.Isn't this the problem? i.e. it's freeing the EFI item because the abort flag is set, but there are still other references to it? Isn't this wrong now that we have a cleanup function that will free the EFI is the commit fails? i.e: this ....
3. xfs_defer_trans_roll() then invokes xfs_defer_trans_abort(). Since the xfs_defer_pending->dfp_intent is still set to the "extent free" intent item, we invoke xfs_extent_free_abort_intent(). This accesses the previously freed xfs_efi_log_item to decrement the ref count.
... will free the EFI correctly on transaction commit failure and so we should be able to remove the "free immediately" hack from the iop_unlock() code that we used to need because there was no external reference that could free the EFI on commit failure....
This commit fixes the bug by invoking xfs_defer_trans_abort() only when the log items in the per-transaction list have been committed to the CIL. The log item "committed" status is being tracked by xfs_defer_ops->dop_committed. This was the behaviour prior to commit 3ab78df2a59a485f479d26852a060acfd8c4ecd7 (xfs: rework xfs_bmap_free callers to use xfs_defer_ops).
Freeing in ->iop_unlock was always troublesome. We should get rid of it, not go back to it. Cheers, Dave. -- Dave Chinner david@fromorbit.com