Skip to content
  • Elijah Newren's avatar
    read-cache: fix directory/file conflict handling in read_index_unmerged() · ad376204
    Elijah Newren authored and Junio C Hamano's avatar Junio C Hamano committed
    read_index_unmerged() has two intended purposes:
      * return 1 if there are any unmerged entries, 0 otherwise
      * drops any higher-stage entries down to stage #0
    
    There are several callers of read_index_unmerged() that check the return
    value to see if it is non-zero, all of which then die() if that condition
    is met.  For these callers, dropping higher-stage entries down to stage #0
    is a waste of resources, and returning immediately on first unmerged entry
    would be better.  But it's probably only a very minor difference and isn't
    the focus of this series.
    
    The remaining callers ignore the return value and call this function for
    the side effect of dropping higher-stage entries down to stage #0.  As
    mentioned in commit e11d7b59 ("'reset --merge': fix unmerged case",
    2009-12-31),
    
        The _only_ reason we want to keep a previously unmerged entry in the
        index at stage #0 is so that we don't forget the fact that we have
        corresponding file in the work tree in order to be able to remove it
        when the tree we are resetting to does not have the path.
    
    In fact, prior to commit d1a43f2a
    
     ("reset --hard/read-tree --reset -u:
    remove unmerged new paths", 2008-10-15), read_index_unmerged() did just
    remove unmerged entries from the cache immediately but that had the
    unwanted effect of leaving around new untracked files in the tree from
    aborted merges.
    
    So, that's the intended purpose of this function.  The problem is that
    when directory/files conflicts are present, trying to add the file to the
    index at stage 0 fails (because there is still a directory in the way),
    and the function returns early with a -1 return code to signify the error.
    As noted above, none of the callers who want the drop-to-stage-0 behavior
    check the return status, though, so this means all remaining unmerged
    entries remain in the index and the callers proceed assuming otherwise.
    Users then see errors of the form:
    
        error: 'DIR-OR-FILE' appears as both a file and as a directory
        error: DIR-OR-FILE: cannot drop to stage #0
    
    and potentially also messages about other unmerged entries which came
    lexicographically later than whatever pathname was both a file and a
    directory.  Google finds a few hits searching for those messages,
    suggesting there were probably a couple people who hit this besides me.
    Luckily, calling `git reset --hard` multiple times would workaround
    this bug.
    
    Since the whole purpose here is to just put the entry *temporarily* into
    the index so that any associated file in the working copy can be removed,
    we can just skip the DFCHECK and allow both the file and directory to
    appear in the index.  The temporary simultaneous appearance of the
    directory and file entries in the index will be removed by the callers
    by calling unpack_trees(), which excludes these unmerged entries marked
    with CE_CONFLICTED flag from the resulting index, before they attempt to
    write the index anywhere.
    
    Signed-off-by: default avatarElijah Newren <newren@gmail.com>
    Signed-off-by: default avatarJunio C Hamano <gitster@pobox.com>
    ad376204