Thread (19 messages) 19 messages, 4 authors, 2026-01-16

Re: Slow git pack-refs --all

From: Jeff King <hidden>
Date: 2026-01-06 10:38:05

On Mon, Jan 05, 2026 at 11:45:41PM +0000, Martin Fick wrote:
By repacking to get one used, and one cruft pack only, and no loose 
objects, I have confirmed that pack-refs it is still slow. This rules out the 
idea that the loose object, or pack file counts were making things slow.
OK, that is interesting. I'd still expect opening the objects to be the
dominating factor, but now the load would be on jumping around the
mmap'd packfile rather than open/read/close calls.
OK, after discovering the strace -r and -T options, I have determined that
the 29K writes were all very fast in themselves. However, most of the
writes seem to follow each other with no other system calls in between.
This explains why it looks like the writes are slow, even though they aren't.

If I tally up the time between the previous system call, and each write(),
it adds up to the bulk of the time (4mins out of 4m15s) that it takes to
pack refs. This tells me that no visible I/O or system calls are the problem,
but rather that the program itself is taking a long time between writes.
I very much doubt that this is heavy CPU time, but rather I am going to 
guess that this is hidden system time spent accessing mmaped memory.
That would be consistent with reading object data from the packfile.
We'll jump around within the packfile to get that data.
Could it be really slow reading the packed-refs file? I can see the 
packed-refs file is mmaped() before the writes start, and then 
munmapped after the writes are completed. If I had to guess, that likely
means that the packed-refs file is being read in small increments by the 
kernel via mmap, and that is what is making things very slow over NFS.
The packed-refs file is mmap'd, but we'll be reading it sequentially. I
guess whether or not there is good read-ahead there may depend on the
NFS implementation.
My alternative theory, is that each ref is being looked up via a binary 
search, but I don't think git does this?
Git does binary search within the packed-refs file, but it shouldn't be
doing so here. The write-out phase of packing refs is a straight merge
between two lists: the existing packed-refs entries and the new entries
we are adding.

I'd second Patrick's suggestion to use perf or similar to try to see
where the time is going.

You might also try building Git with NO_MMAP. That might make the I/O
costs more apparent via strace, because they'll be coming via pread().

-Peff
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help