Re: do we have too much fsync() configuration in 'next'? (was: [PATCH v7]... | git

[PATCH 0/2] A design for future-proofing fsync() configuration · Neeraj K. Singh via GitGitGadget <hidden> · 2021-12-04
[PATCH 1/2] fsync: add writeout-only mode for fsyncing repo data · Neeraj Singh via GitGitGadget <hidden> · 2021-12-04
Re: [PATCH 1/2] fsync: add writeout-only mode for fsyncing repo data · Neeraj Singh <hidden> · 2021-12-06
[PATCH 2/2] core.fsync: introduce granular fsync control · Neeraj Singh via GitGitGadget <hidden> · 2021-12-04
[PATCH v2 0/3] A design for future-proofing fsync() configuration · Neeraj K. Singh via GitGitGadget <hidden> · 2021-12-07
[PATCH v2 1/3] core.fsyncmethod: add writeout-only mode · Neeraj Singh via GitGitGadget <hidden> · 2021-12-07
Re: [PATCH v2 1/3] core.fsyncmethod: add writeout-only mode · Patrick Steinhardt <hidden> · 2021-12-07
Re: [PATCH v2 1/3] core.fsyncmethod: add writeout-only mode · Ævar Arnfjörð Bjarmason <hidden> · 2021-12-07
Re: [PATCH v2 1/3] core.fsyncmethod: add writeout-only mode · Neeraj Singh <hidden> · 2021-12-07
Re: [PATCH v2 1/3] core.fsyncmethod: add writeout-only mode · Ævar Arnfjörð Bjarmason <hidden> · 2021-12-07
Re: [PATCH v2 1/3] core.fsyncmethod: add writeout-only mode · Neeraj Singh <hidden> · 2021-12-07
[PATCH v2 2/3] core.fsync: introduce granular fsync control · Neeraj Singh via GitGitGadget <hidden> · 2021-12-07
Re: [PATCH v2 2/3] core.fsync: introduce granular fsync control · Patrick Steinhardt <hidden> · 2021-12-07
Re: [PATCH v2 2/3] core.fsync: introduce granular fsync control · Neeraj Singh <hidden> · 2021-12-07
Re: [PATCH v2 2/3] core.fsync: introduce granular fsync control · Ævar Arnfjörð Bjarmason <hidden> · 2021-12-07
Re: [PATCH v2 2/3] core.fsync: introduce granular fsync control · Neeraj Singh <hidden> · 2021-12-07
Re: [PATCH v2 2/3] core.fsync: introduce granular fsync control · Ævar Arnfjörð Bjarmason <hidden> · 2021-12-08
Re: [PATCH v2 2/3] core.fsync: introduce granular fsync control · Neeraj Singh <hidden> · 2021-12-09
Re: [PATCH v2 2/3] core.fsync: introduce granular fsync control · Ævar Arnfjörð Bjarmason <hidden> · 2021-12-09
Re: [PATCH v2 2/3] core.fsync: introduce granular fsync control · Neeraj Singh <hidden> · 2021-12-09
Re: [PATCH v2 2/3] core.fsync: introduce granular fsync control · Neeraj Singh <hidden> · 2022-01-18
Re: [PATCH v2 2/3] core.fsync: introduce granular fsync control · Ævar Arnfjörð Bjarmason <hidden> · 2022-01-19
Re: [PATCH v2 2/3] core.fsync: introduce granular fsync control · Ævar Arnfjörð Bjarmason <hidden> · 2022-01-19
Re: [PATCH v2 2/3] core.fsync: introduce granular fsync control · Neeraj Singh <hidden> · 2022-01-28
[PATCH v2 3/3] core.fsync: new option to harden the index · Neeraj Singh via GitGitGadget <hidden> · 2021-12-07
Re: [PATCH v2 0/3] A design for future-proofing fsync() configuration · Patrick Steinhardt <hidden> · 2021-12-07
Re: [PATCH v2 0/3] A design for future-proofing fsync() configuration · Neeraj Singh <hidden> · 2021-12-08
[PATCH v3 0/4] A design for future-proofing fsync() configuration · Neeraj K. Singh via GitGitGadget <hidden> · 2021-12-09
[PATCH v3 1/4] core.fsyncmethod: add writeout-only mode · Neeraj Singh via GitGitGadget <hidden> · 2021-12-09
[PATCH v3 2/4] core.fsync: introduce granular fsync control · Neeraj Singh via GitGitGadget <hidden> · 2021-12-09
[PATCH v3 3/4] core.fsync: new option to harden the index · Neeraj Singh via GitGitGadget <hidden> · 2021-12-09
[PATCH v3 4/4] core.fsync: add a `derived-metadata` aggregate option · Neeraj Singh via GitGitGadget <hidden> · 2021-12-09
Re: [PATCH v3 0/4] A design for future-proofing fsync() configuration · Neeraj Singh <hidden> · 2022-01-08
RE: [PATCH v3 0/4] A design for future-proofing fsync() configuration · <hidden> · 2022-01-09
Re: [PATCH v3 0/4] A design for future-proofing fsync() configuration · Neeraj Singh <hidden> · 2022-01-10
[PATCH v4 0/4] A design for future-proofing fsync() configuration · Neeraj K. Singh via GitGitGadget <hidden> · 2022-02-01
[PATCH v4 1/4] core.fsyncmethod: add writeout-only mode · Neeraj Singh via GitGitGadget <hidden> · 2022-02-01
[PATCH v4 2/4] core.fsync: introduce granular fsync control · Neeraj Singh via GitGitGadget <hidden> · 2022-02-01
[PATCH v4 3/4] core.fsync: new option to harden the index · Neeraj Singh via GitGitGadget <hidden> · 2022-02-01
[PATCH v4 4/4] core.fsync: add a `derived-metadata` aggregate option · Neeraj Singh via GitGitGadget <hidden> · 2022-02-01
[PATCH v5 0/5] A design for future-proofing fsync() configuration · Neeraj K. Singh via GitGitGadget <hidden> · 2022-03-09
[PATCH v5 1/5] wrapper: move inclusion of CSPRNG headers the wrapper.c file · Neeraj Singh via GitGitGadget <hidden> · 2022-03-09
Re: [PATCH v5 1/5] wrapper: move inclusion of CSPRNG headers the wrapper.c file · brian m. carlson <hidden> · 2022-03-10
Re: [PATCH v5 1/5] wrapper: move inclusion of CSPRNG headers the wrapper.c file · Neeraj Singh <hidden> · 2022-03-10
[PATCH v5 2/5] core.fsyncmethod: add writeout-only mode · Neeraj Singh via GitGitGadget <hidden> · 2022-03-09
[PATCH v5 3/5] core.fsync: introduce granular fsync control · Neeraj Singh via GitGitGadget <hidden> · 2022-03-09
[PATCH v5 4/5] core.fsync: new option to harden the index · Neeraj Singh via GitGitGadget <hidden> · 2022-03-09
[PATCH v5 5/5] core.fsync: documentation and user-friendly aggregate options · Neeraj Singh via GitGitGadget <hidden> · 2022-03-09
Future-proofed syncing of refs · Patrick Steinhardt <hidden> · 2022-03-10
[PATCH 6/8] core.fsync: add `fsync_component()` wrapper which doesn't die · Patrick Steinhardt <hidden> · 2022-03-10
[PATCH 7/8] core.fsync: new option to harden loose references · Patrick Steinhardt <hidden> · 2022-03-10
Re: [PATCH 7/8] core.fsync: new option to harden loose references · Neeraj Singh <hidden> · 2022-03-10
[PATCH 8/8] core.fsync: new option to harden packed references · Patrick Steinhardt <hidden> · 2022-03-10
[PATCH v6 0/6] A design for future-proofing fsync() configuration · Neeraj K. Singh via GitGitGadget <hidden> · 2022-03-10
[PATCH v6 1/6] wrapper: make inclusion of Windows csprng header tightly scoped · Neeraj Singh via GitGitGadget <hidden> · 2022-03-10
[PATCH v6 3/6] core.fsync: introduce granular fsync control infrastructure · Neeraj Singh via GitGitGadget <hidden> · 2022-03-10
[PATCH v6 4/6] core.fsync: add configuration parsing · Neeraj Singh via GitGitGadget <hidden> · 2022-03-10
Re: [PATCH v6 4/6] core.fsync: add configuration parsing · Jiang Xin <hidden> · 2022-03-28
Re: [PATCH v6 4/6] core.fsync: add configuration parsing · Neeraj Singh <hidden> · 2022-03-28
[PATCH v6 5/6] core.fsync: new option to harden the index · Neeraj Singh via GitGitGadget <hidden> · 2022-03-10
[PATCH v6 2/6] core.fsyncmethod: add writeout-only mode · Neeraj Singh via GitGitGadget <hidden> · 2022-03-10
[PATCH v6 6/6] core.fsync: documentation and user-friendly aggregate options · Neeraj Singh via GitGitGadget <hidden> · 2022-03-10
[PATCH v7] core.fsync: documentation and user-friendly aggregate options · Neeraj Singh <hidden> · 2022-03-15
do we have too much fsync() configuration in 'next'? (was: [PATCH v7] core.fsync: documentation and user-friendly aggregate options) · Ævar Arnfjörð Bjarmason <hidden> · 2022-03-23
Re: do we have too much fsync() configuration in 'next'? (was: [PATCH v7] core.fsync: documentation and user-friendly aggregate options) · Neeraj Singh <hidden> · 2022-03-25
Re: do we have too much fsync() configuration in 'next'? (was: [PATCH v7] core.fsync: documentation and user-friendly aggregate options) · Ævar Arnfjörð Bjarmason <hidden> · 2022-03-26
Re: do we have too much fsync() configuration in 'next'? (was: [PATCH v7] core.fsync: documentation and user-friendly aggregate options) · Neeraj Singh <hidden> · 2022-03-26
Re: do we have too much fsync() configuration in 'next'? (was: [PATCH v7] core.fsync: documentation and user-friendly aggregate options) · Ævar Arnfjörð Bjarmason <hidden> · 2022-03-26
Re: do we have too much fsync() configuration in 'next'? (was: [PATCH v7] core.fsync: documentation and user-friendly aggregate options) · Neeraj Singh <hidden> · 2022-03-27
Re: do we have too much fsync() configuration in 'next'? (was: [PATCH v7] core.fsync: documentation and user-friendly aggregate options) · Ævar Arnfjörð Bjarmason <hidden> · 2022-03-27
Re: do we have too much fsync() configuration in 'next'? (was: [PATCH v7] core.fsync: documentation and user-friendly aggregate options) · Patrick Steinhardt <hidden> · 2022-03-28
Re: do we have too much fsync() configuration in 'next'? (was: [PATCH v7] core.fsync: documentation and user-friendly aggregate options) · Ævar Arnfjörð Bjarmason <hidden> · 2022-03-28
Re: do we have too much fsync() configuration in 'next'? (was: [PATCH v7] core.fsync: documentation and user-friendly aggregate options) · Neeraj Singh <hidden> · 2022-03-28
Re: do we have too much fsync() configuration in 'next'? (was: [PATCH v7] core.fsync: documentation and user-friendly aggregate options) · Neeraj Singh <hidden> · 2022-03-30
[PATCH v2] core.fsync: new option to harden references · Patrick Steinhardt <hidden> · 2022-03-11
Re: [PATCH v2] core.fsync: new option to harden references · SZEDER Gábor <hidden> · 2022-03-25

Re: do we have too much fsync() configuration in 'next'? (was: [PATCH v7] core.fsync: documentation and user-friendly aggregate options)

From: Ævar Arnfjörð Bjarmason <hidden>
Date: 2022-03-28 12:15:58

On Mon, Mar 28 2022, Patrick Steinhardt wrote:

[[PGP Signed Part:Undecided]]
On Sun, Mar 27, 2022 at 02:43:48PM +0200, Ævar Arnfjörð Bjarmason wrote:

quoted

On Sat, Mar 26 2022, Neeraj Singh wrote:

quoted

On Sat, Mar 26, 2022 at 8:34 AM Ævar Arnfjörð Bjarmason
[off-list ref] wrote:

quoted


On Fri, Mar 25 2022, Neeraj Singh wrote:

quoted

On Fri, Mar 25, 2022 at 5:33 PM Ævar Arnfjörð Bjarmason
[off-list ref] wrote:

[...]

quoted

I want to make a comment about the Index here.  Syncing the index is
strictly required for the "added" level of consistency, so that we
don't lose stuff that leaves the work tree but is staged.  But my
Windows enlistment has an index that's 266MB, which would be painful
to sync even with all the optimizations.  Maybe with split-index, this
wouldn't be so bad, but I just wanted to call out that some advanced
users may really care about the configurability.

So for that use-case you'd like to fsync the loose objects (if any), but
not the index? So the FS will "flush" up to the index, and then queue
the index for later syncing to platter?


But even in that case don't the settings need to be tied to one another,
because in the method=bulk sync=index && sync=!loose case wouldn't we be
syncing "loose" in any case?

quoted

As Git's various database implementations improve, the fsync stuff
will hopefully be more optimal and self-tuning.  But as that happens,
Git could just start ignoring settings that lose meaning without tying
anyones hands.

Yeah that would alleviate most of my concerns here, but the docs aren't
saying anything like that. Since you added them & they just landed, do
you mind doing a small follow-up where we e.g. say that these new
settings are "EXPERIMENTAL" or whatever, and subject to drastic change?

The doc is already pretty prescriptive.  It has this line at the end
of the first  paragraph:
"Unless you
have special requirements, it is recommended that you leave
this option empty or pick one of `committed`, `added`,
or `all`."

Those values are already designed to change as Git changes.

I'm referring to the documentation as it stands not being marked as
experimental in the sense that we might decide to re-do this to a large
extent, i.e. something like the diff I suggested upthread in
https://lore.kernel.org/git/220323.86fsn8ohg8.gmgdl@evledraar.gmail.com/ (local)

So yes, I agree that it e.g. clearly states that you can add a new
core.git=foobar or whatever down the line, but it clearly doesn't
suggest that e.g. core.fsync might have boolean semantics in some later
version, or that the rest might simply be ignored, even if that
e.g. means that we wouldn't sync loose objects on
core.fsync=loose-object, as we'd just warn with a "we don't provide this
anymore".

Or do you disagree with that? IOW I mean that we'd do something like
this, either in docs or code:

diff --git a/config.c b/config.c
index 3c9b6b589ab..94548566073 100644
--- a/config.c
+++ b/config.c

@@ -1675,6 +1675,9 @@ static int git_default_core_config(const char *var, const char *value, void *cb)
 	}
 
 	if (!strcmp(var, "core.fsync")) {
+		if (!the_repository->settings.feature_experimental)
+			warning(_("the '%s' configuration option is EXPERIMENTAL. opt-in to use it with feature.experimental=true"),
+				var);
 		if (!value)
 			return config_error_nonbool(var);
 		fsync_components = parse_fsync_components(var, value);

@@ -1682,6 +1685,9 @@ static int git_default_core_config(const char *var, const char *value, void *cb)
 	}
 
 	if (!strcmp(var, "core.fsyncmethod")) {
+		if (!the_repository->settings.feature_experimental)
+			warning(_("the '%s' configuration option is EXPERIMENTAL. opt-in to use it with feature.experimental=true"),
+				var);
 		if (!value)
 			return config_error_nonbool(var);
 		if (!strcmp(value, "fsync"))

Let's please not tie this to `feature.experimental=true`. Setting that
option has unintended sideeffects and will also change defaults which we
may not want to have in production. I don't mind adding a warning in the
docs though that the specific items which can be configured may be
subject to change in the future.

Yes, that was a bad (throwaway) idea. I think probably any sort of
warning is over-doing it, but having the same in the docs would be good,
as in:

    git gre EXPERIMENTAL -- Documentation

At GitLab, we've got a three-step plan:

    1. We need to migrate to `core.fsync` in the first place. In order
       to not migrate and change behaviour at the same point in time we
       already benefit from the fine-grainedness of this config because
       we can simply say `core.fsync=loose-objects` and have the same
       behaviour as before with `core.fsyncLooseObjects=true`.

*nod*.

    2. We'll want to enable syncing of packfiles, which I think wasn't
       previously covered by `core.fsyncLooseobjects`.

We've always fsynced packfiles and other things that use the
finalize_hashfile() API. I.e. the pack metadata (idx,midx,bitmap etc.),
commit-graph etc.

Which is one thing I find a bit uncomfortable about the proposed config
schema, i.e. it's allowing *granular* unsafe behavior we didn't allow
before.

I think we *should* allow disabling fsync() entirely via:

    core.fsync=false

Per Eric Wong's [added to CC] proposal here:
https://lore.kernel.org/git/20211028002102.19384-1-e@80x24.org/ (local); I think
that's useful for e.g. running git in CI, one off scripted mass-imports
where you run "sync(1)" after (or not...).

But I don't really see the use-case for turning off say "index" or
"pack-metadata", but otherwise keeping the default fsync().

    3. We'll add `refs` to also sync loose refs to disk.

Maybe I'm reading this wrong, but AFAICT if you upgrade from pre-v2.36.0
to v2.36.0 you'll have no way to do fsync()-ing as it was done before
with your bc22d845c43 (core.fsync: new option to harden references,
2022-03-11).

I.e. I first thought you meant to start with:

    core.fsync=-loose-objects
    core.fsync=-reference

And then remove that "core.fsync=-reference" line to get the behavior
that'll be new in v2.36.0, but that won't do that. The new "reference"
category doesn't just affect loose refs, but all ref updates.

So we don't have any way to move to v2.36.0 and get exactly the fsync()
behavior we did before, or have I misread the code?

Now, I don't think we need it to be configurable at all.

I.e. I think your is a bc22d845c43 good change, but it feels weird to
make it and leave the default of core.fsyncLooseObjects=false on the
table. I.e. what you're summarizing there is true, but it's also true of
the loose objects.

To the extent that we've had any reason at all not to sync those by
default (which has really mostly been "we didn't re-visit it for a
while") it's been performance.

And both loose refs & loose objects will suffer from the same
degradation in performance from many fsync()'s.

IOW I think it's perfectly fine not to add a config knob for it other
than core.fsync=false, and VERY SOON turn on
core.fsyncLooseobjects=true, especially if we can get most of the
performance benefits with the "bulk" mode.

But why half-way with bc22d845c43? I mean, *that change* should be
narrow, but in terms of where we go next what do you think of the above?

So while the end result will be the same as `committed`, having this
level of control helps us to assess the impact in a nicer way by being
able to do this step by step with feature flags.

*Nod*, although leaving aside the new syncing of loose refs the plan you
 outlined above could be done with the proposal of the simpler:

    core.fsync=true
    core.fsyncLooseObjects=false

On the other hand, many of the other parts we don't really care about.
Auxiliary metadata like the commit-graph or pack indices are data that
can in the worst case be regenerated by us, so it's not clear to me
whether it makes to also enable fsyncing those in production.

I'm not familiar with all of those in detail, i.e. how we behave
specifically in the face of them being corrupt.

The commit-graph I am, we *should* recover "gracefully" there, but you
might have an incident shortly there after due to "for-each-ref
--contains" slowdowns by 1000x or whatever.

For e.g. *.idx we'd be hosed until manual recovery.

So I think those area all in the same bucket as core.fsync=false, and
that we don't need the granularity.

So altogether, I agree with Neeraj: having the fine-grainedness greatly
helps us to roll out changes like this and be able to pick what we deem
to be important. Personally I would be fine with explicitly pointing out
that there are two groups of this config in our docs though:

Yes, that's fair. But pending replies to the above I think the main
point & proposal of us having too much config stands. I.e. depending on
what you want to do with loose object refs we'd just need this:

    core.fsync=[<bool>] # true by default
    core.fsyncLooseObjects=[<bool>] # false by default
    core.fsyncLooseRefs=[<bool>] # true by default?

And then the "bulk" config, which would be orthagonal to this.

I.e. do we really have a use-case for the rest of the kitchen sink?

    1. The "porcelain" group: "committed", "added", "all", "none". These
       are abstract groups whose behaviour should adapt as we change
       implementations, and are those that should typically be set by a
       user, if intended.

    2. The "plumbing" or "expert" group: these are fine-grained options
       which shouldn't typically be used by Git users. They still have
       merit though in hosting environments, where requirements are
       typically a lot more specific.

We may also provide different guarantees for both groups. The first one
should definitely be stable, but we might state that the second group is
subject to change in the future.

I hope we can work something out :)

Overall: I think you've left one of the the main things I brought up[1]
unaddressed, i.e. that the core.fsync config schema in its current form
assumes that we can sync A or B, and configure those separately.

Which AFAIKT is because Neeraj's initial implementation & the discussion
was focused on finishing A or B with a per-"group" "cookie" to flush the
files.

But as [2] shows it's more performant for us to simply defer the fsync
of A until the committing of B.

Which is the main reason I think we should be re-visiting this. Sure, if
we were just syncing A, B or C having per-[ABC] config options might be
a bit overdoing it, but would be relatively simple.

But once we start using a more optimized version of the "bulk" mode the
config schema will be making promises about individual steps in a
transaction that I think we'll want to leave opaque, and only promise
that when git returns it will have synced all the relevant assets as
efficiently as possible.

1. https://lore.kernel.org/git/220323.86fsn8ohg8.gmgdl@evledraar.gmail.com/ (local)
2. https://lore.kernel.org/git/RFC-patch-v2-4.7-61f4f3d7ef4-20220323T140753Z-avarab@gmail.com/ (local)

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help