Thread (3 messages) 3 messages, 2 authors, 2026-01-30

Re: [PATCH v2 1/1] Allow reworking with a file after deciding on all its hunks

From: Samuel Abraham <hidden>
Date: 2026-01-28 11:26:48

On Tue, Jan 27, 2026 at 9:48 PM Junio C Hamano [off-list ref] wrote:
Abraham Samuel Adekunle [off-list ref] writes:
quoted
diff --git a/add-patch.c b/add-patch.c
index 173a53241e..edb2fab3fd 100644
--- a/add-patch.c
+++ b/add-patch.c
@@ -1418,6 +1418,8 @@ N_("j - go to the next undecided hunk, roll over at the bottom\n"
    "e - manually edit the current hunk\n"
    "p - print the current hunk\n"
    "P - print the current hunk using the pager\n"
+   "> - go to the next file\n"
+   "< - go to the previous file\n"
    "? - print help\n");
As I said earlier, these may have to be optional.  It may give
existing users a jarring experience to be given a prompt after
deciding on all the hunks in a file, when they expect to be on
the next file already.
Yes I agree.
I will work on making it an optional feature.
quoted
@@ -1441,6 +1443,17 @@ static bool get_first_undecided(const struct file_diff *file_diff, size_t *idx)
      return false;
 }

+static size_t get_file_diff_index(struct add_p_state *s, struct file_diff *file_diff) {
+     size_t idx = 0;
+     for (size_t i = 0; i < s->file_diff_nr; i++) {
+             if (s->file_diff + i == file_diff) {
+                     idx = i;
+                     break;
+             }
+     }
+     return idx;
+}
Yuck.  Can't we lose the need for this function if we change the
interface into patch_update_file so that it takes the index of the
file (i.e., instead of "&s.file_diff[i]", pass "i")?  There is only
one caller to patch_update_file() which is run_add_p(), so such a
clean-up should be trivial.
Ah yes this is definitely a sweet and better option.
quoted
 static int patch_update_file(struct add_p_state *s,
                           struct file_diff *file_diff)
 {
@@ -1448,9 +1461,10 @@ static int patch_update_file(struct add_p_state *s,
      ssize_t i, undecided_previous, undecided_next, rendered_hunk_index = -1;
      struct hunk *hunk;
      char ch;
-     struct child_process cp = CHILD_PROCESS_INIT;
This is related to the hoisting of the actual patch application to
the caller, but it is not explained why such a change is needed, and
it byitself, even without the "jump to the next file before deciding
on all the hunks" feature.  What problem is it solving???
I explained this below
If it is necessary to move the code to run "git apply" to the
caller, would it make sense to split this patch into at least two
patches, one to do such a move, possibly another patch to change the
function signature of patch_update_file() so that it takes the file
index instead of file_diff struct, and finally another patch to
allow jumping around the files?
Okay yes it would make much sense.
quoted
      int colored = !!s->colored.len, quit = 0, use_pager = 0;
      enum prompt_mode_type prompt_mode_type;
+     size_t file_diff_index = get_file_diff_index(s, file_diff);
+     int all_decided = 0;

      /* Empty added files have no hunks */
      if (!file_diff->hunk_nr && !file_diff->added)
@@ -1467,7 +1481,9 @@ static int patch_update_file(struct add_p_state *s,
                      ALLOW_GOTO_NEXT_UNDECIDED_HUNK = 1 << 3,
                      ALLOW_SEARCH_AND_GOTO = 1 << 4,
                      ALLOW_SPLIT = 1 << 5,
-                     ALLOW_EDIT = 1 << 6
+                     ALLOW_EDIT = 1 << 6,
+                     ALLOW_GOTO_PREVIOUS_FILE = 1 << 7,
+                     ALLOW_GOTO_NEXT_FILE = 1 << 8
              } permitted = 0;

              if (hunk_index >= file_diff->hunk_nr)
@@ -1499,8 +1515,7 @@ static int patch_update_file(struct add_p_state *s,
              /* Everything decided? */
              if (undecided_previous < 0 && undecided_next < 0 &&
                  hunk->use != UNDECIDED_HUNK)
-                     break;
-
+                             all_decided = 1;
              strbuf_reset(&s->buf);
              if (file_diff->hunk_nr) {
                      if (rendered_hunk_index != hunk_index) {
@@ -1548,6 +1563,16 @@ static int patch_update_file(struct add_p_state *s,
                              permitted |= ALLOW_EDIT;
                              strbuf_addstr(&s->buf, ",e");
                      }
+                     if (file_diff_index >= 0 &&
+                             file_diff_index < s->file_diff_nr - 1) {
+                             permitted |= ALLOW_GOTO_NEXT_FILE;
+                             strbuf_addstr(&s->buf, ",>");
+                     }
+                     if (file_diff_index > 0 &&
+                             file_diff_index <= s->file_diff_nr - 1) {
+                             permitted |= ALLOW_GOTO_PREVIOUS_FILE;
+                             strbuf_addstr(&s->buf, ",<");
+                     }
As can be seen in what patch_update_file() does when the user says
'J' or 'K', hunks in a file are treated as a ring, and these
commands are enabled as long as there are more than one hunks.

Perhaps that is more familiar than "when we hit the floor, we cannot
sink deeper, and when we hit the ceiling, we cannot float more",
which seems to be what the above implements.
Yes I understand this now.
It does make sense this way.
quoted
                      strbuf_addstr(&s->buf, ",p,P");
              }
              if (file_diff->deleted)
@@ -1566,6 +1591,9 @@ static int patch_update_file(struct add_p_state *s,
                                              : 1));
              printf(_(s->mode->prompt_mode[prompt_mode_type]),
                     s->buf.buf);
+             if (all_decided)
+                     printf(_("\n%s All hunks decided. What now? "),
+                             s->s.prompt_color);
              if (*s->s.reset_color_interactive)
                      fputs(s->s.reset_color_interactive, stdout);
              fflush(stdout);
@@ -1618,7 +1646,24 @@ static int patch_update_file(struct add_p_state *s,
              } else if (ch == 'q') {
                      quit = 1;
                      break;
-             } else if (s->answer.buf[0] == 'K') {
+             } else if (s->answer.buf[0] == '>') {
+                     if (permitted & ALLOW_GOTO_NEXT_FILE) {
+                             quit = 0;
+                             break;
+                     } else {
+                             err(s, _("No next file"));
+                             continue;
+                     }
+             } else if (s->answer.buf[0] == '<') {
+                     if (permitted & ALLOW_GOTO_PREVIOUS_FILE) {
+                             quit = 2;
+                             break;
What's the magic number "2"?  Should "quit" become an enum with
elements that are more meaningfully named?
Okay, yes an enum would be better.
quoted
+                     } else {
+                             err(s, _("No previous file"));
+                             continue;
+                     }
+             }
+             else if (s->answer.buf[0] == 'K') {
                      if (permitted & ALLOW_GOTO_PREVIOUS_HUNK)
                              hunk_index = dec_mod(hunk_index,
                                                   file_diff->hunk_nr);
@@ -1775,33 +1820,6 @@ static int patch_update_file(struct add_p_state *s,
              }
      }

-     /* Any hunk to be used? */
-     for (i = 0; i < file_diff->hunk_nr; i++)
-             if (file_diff->hunk[i].use == USE_HUNK)
-                     break;
-
-     if (i < file_diff->hunk_nr ||
-         (!file_diff->hunk_nr && file_diff->head.use == USE_HUNK)) {
-             /* At least one hunk selected: apply */
-             strbuf_reset(&s->buf);
-             reassemble_patch(s, file_diff, 0, &s->buf);
-
-             discard_index(s->s.r->index);
-             if (s->mode->apply_for_checkout)
-                     apply_for_checkout(s, &s->buf,
-                                        s->mode->is_reverse);
-             else {
-                     setup_child_process(s, &cp, "apply", NULL);
-                     strvec_pushv(&cp.args, s->mode->apply_args);
-                     if (pipe_command(&cp, s->buf.buf, s->buf.len,
-                                      NULL, 0, NULL, 0))
-                             error(_("'git apply' failed"));
-             }
-             if (repo_read_index(s->s.r) >= 0)
-                     repo_refresh_and_write_index(s->s.r, REFRESH_QUIET, 0,
-                                                  1, NULL, NULL, NULL);
-     }
It is not obvious why the above code needs to be hoisted to the
caller.
I explained this below.
quoted
      putchar('\n');
      return quit;
 }
@@ -1813,7 +1831,9 @@ int run_add_p(struct repository *r, enum add_p_mode mode,
      struct add_p_state s = {
              { r }, STRBUF_INIT, STRBUF_INIT, STRBUF_INIT, STRBUF_INIT
      };
-     size_t i, binary_count = 0;
+     size_t i, j, binary_count = 0;
+     size_t patch_update_result = 0;
Hmph, I think patch_update_file() returns "int quit".  Why do we
want overly wide type to store the result, which cannot even express
negative number to potentially signal a failure?
Sorry, this is a mistake on my part
quoted
+     struct child_process cp = CHILD_PROCESS_INIT;

      init_add_i_state(&s.s, r, o);
@@ -1852,11 +1872,56 @@ int run_add_p(struct repository *r, enum add_p_mode mode,
              return -1;
      }

-     for (i = 0; i < s.file_diff_nr; i++)
-             if (s.file_diff[i].binary && !s.file_diff[i].hunk_nr)
+     for (i = 0; i < s.file_diff_nr;) {
+             if (s.file_diff[i].binary && !s.file_diff[i].hunk_nr) {
                      binary_count++;
-             else if (patch_update_file(&s, s.file_diff + i))
-                     break;
+                     i++;
+                     continue;
+             }
+             else {
+                     patch_update_result = patch_update_file(&s, s.file_diff + i);
+                     if (patch_update_result == 0) {
+                             i++;
+                             continue;
+                     }
+                     if (patch_update_result == 1)
+                             break;
+                     if (patch_update_result == 2) {
+                             i--;
+                             continue;
+                     }
+             }
+     }
+     for (i = 0; i < s.file_diff_nr; i++) {
+
+                     /* Any hunk to be used? */
+             for (j = 0; j < s.file_diff[i].hunk_nr; j++)
+                     if (s.file_diff[i].hunk[j].use == USE_HUNK)
+                             break;
+
+             if (j < s.file_diff[i].hunk_nr ||
+         (!s.file_diff[i].hunk_nr && s.file_diff[i].head.use == USE_HUNK)) {
+                     /* At least one hunk selected: apply */
+                     strbuf_reset(&s.buf);
+                     reassemble_patch(&s, s.file_diff + i, 0, &s.buf);
+
+                     discard_index(s.s.r->index);
+                     if (s.mode->apply_for_checkout)
+                             apply_for_checkout(&s, &s.buf,
+                                             s.mode->is_reverse);
+                     else {
+                             setup_child_process(&s, &cp, "apply", NULL);
+                             strvec_pushv(&cp.args, s.mode->apply_args);
+                             if (pipe_command(&cp, s.buf.buf, s.buf.len,
+                                             NULL, 0, NULL, 0))
+                                     error(_("'git apply' failed"));
+                     }
+                     if (repo_read_index(s.s.r) >= 0)
+                             repo_refresh_and_write_index(s.s.r, REFRESH_QUIET, 0,
+                                                             1, NULL, NULL, NULL);
+             }
+
+     }
One upside of having "git apply" at the end of patch_update_file()
is that you can "^C" out of "git add -p" or your terminal connection
can be cut off, after dealing with hunks in a few early files, and
these early part of your work that you have already done are already
reflected to the working tree files.  By hoisting the logic to the
caller, this is making the update all-or-none, which is good in
transactional systems, but can make a horrible experience for an
interactive use where you make progress while thinking.

So I am not yet convinced if this change makes sense---it could be
because of the lack of justification for this change.
What I observed after adding the '>' and '<' options is that if a user chooses
to use a hunk A in file 1, and then goes to file 2 with '>', comes back to
file 1 with '<', and decides on hunk A to skip it instead, because
patch_update_file() has
applied the file with the hunk the user initially decided to use
before proceeding to file
2 with '>', coming back to redecide and say skip does not apply the
latest decision
and when you check the index, the file with the hunks which the user
initially decided to
use but changed to skip is present in the index.

But if the user initially decided to skip a hunk in a file, goes to
the next file with '>'
and back to the first file, changes the decision on the hunk to use,
it applies the patch
with the hunk because the hunk was not initially selected when the
patch was applied.
But if he now goes away and comes back to the file a third time and
chooses to skip the
hunk, then quits with 'q', because he had selected to use the hunk the
second time,
choosing skip again will not work.

So basically, initially choosing to use a hunk in a file, going to
another file and coming
back to this file then choosing to skip it does not register the
latest skip decision
on that hunk.

That was why I decided to do it this way.
I will appreciate a better suggestion from you

Thanks
Abraham.
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help