Page MenuHomeSoftware Heritage

Check all objects in the production storage/journal have a correct hash
Closed, DuplicatePublic

Description

If not, it means there was a data corruption / bug somewhere in the ingestion process or database, and it means to be fixed.

Incorrect hashes would also prevent replay on storage instances with the validating storage proxy enabled.

Event Timeline

vlorentz triaged this task as Normal priority.Feb 1 2021, 12:38 PM
vlorentz created this task.

This is a duplicate of T75, the history of which would probably be useful to take into account (I suspect it can be closed).

There's also a few known bugs in imported data, e.g. zero-padded directories https://github.com/pallets/flask/issues/2029.