HomeSoftware Heritage

storage: Add methods to compute directories/revisions diff

Description

storage: Add methods to compute directories/revisions diff

This commit adds the implementation of an efficient algorithm for comparing
two directory trees in order to compute the list of introduced file changes
in terms of addition / deletion / modification/ renaming.
It can be found in the diff module located in the new namespace
swh.storage.algos

That algorithm is used to extend the storage API with the following methods:

  • diff_directories: compute diff between two arbitrary directories
  • diff_revisions: compute diff between two arbitrary revisions
  • diff_revision: compute diff between a revision and its first parent

Related T921

Closes D295

Details

Provenance
anlambertAuthored on Jan 26 2018, 4:52 PM
anlambertPushed on Feb 20 2018, 1:19 PM
Parents
rDSTO6b80cff86bec: Add a new table for "bucketed" object counts
Branches
Unknown
Tags
Unknown
References
tag: v0.0.99