content integrity checker
Closed, MigratedEdits Locked
Actions

Assigned To

Authored By

	zack
	Feb 4 2016, 3:00 PM

Description

No matter how many backup copies we have (see T239), each object contained in each backup copy should be periodically checked for integrity, for protection against bit flips and other sources of corruption.

While bit flip protection can be offered by low-level mechanisms at the disk and/or file system level, we might also still want to periodically check all objects (e.g., with swh.storage.ObjStorage.check) for protection against human- or application-level errors.

Revisions and Commits

rDSTO Storage manager
	Closed	D37 The checker now have the possibility to have multiple backup servers
	Closed	D36 Fix an object id encoding into a test of objstorage api
	Closed	D31 Create a content integrity checker
rDOBJS Object storage
	Closed	D92 Make the content checker easy to extend and add an implementation

Related Objects
Search...

Status	Assigned	Task
Migrated	gitlab-migration	T304 content integrity checker
Migrated	gitlab-migration	T423 Make a content integrity checker that can run on a Tier 1 node
Migrated	gitlab-migration	T545 Create puppet manifests for the content integrity checkers

Event Timeline

zack created this task.Feb 4 2016, 3:00 PM

Herald added a project: Staff. · View Herald TranscriptFeb 4 2016, 3:00 PM

zack removed projects: Developers, Staff.Mar 10 2016, 5:51 PM

zack assigned this task to qcampos.Apr 27 2016, 9:13 PM

olasd changed the visibility from "All Users" to "Public (No Login Required)".May 13 2016, 5:08 PM

I see two options for the checker : it can run on each backup server, or only in the master storage which will manage the check scheduling and order.

The first option means whe won't increase the master's load, but require the slaves to have some softs (celery, cron ?)

The second have a bigger impact on master's load but would allow the slaves storages to remain storage-oriented with only the objstorage api.server module running. This also allow the master storage to get the result and eventually reschedule the archival of a corrupted content. However, that would also means the check request will be sent over http protocol (but the check itself remains fully in the backup).

Check order

In order to be sure to check all the content, an idea is to check the content with the oldest last check time. Each X <amount of time>, the checker will check the Y first contents sorted by oldest last check time.

A content is checked when archived, so there would be no need to catch-up the checks after the completion of the archival.

In T304#6389, @qcampos wrote:

I see two options for the checker : it can run on each backup server, or only in the master storage which will manage the check scheduling and order.

The first option means whe won't increase the master's load, but require the slaves to have some softs (celery, cron ?)

I think that in the beginning this option is preferrable, because it makes the archive copies more independent and more resilient (e.g., a single failure in the checking routine will not necessarily impact other copies).
I also like the fact that, upon corruption detection, individual copies can take care of restoring from other copies without central coordination.

In terms of deployment, we should aim at having an independent checker daemon that do not need external scheduling logic (a-la celery). It can either be something that runs in the background forever and does checking at its own schedule. Or something that is externally run by cron. Given the amount of data I suspect it will just be permanently busy, so running independently not even needing cron is probably better and easier.

Check order

In order to be sure to check all the content, an idea is to check the content with the oldest last check time. Each X <amount of time>, the checker will check the Y first contents sorted by oldest last check time.

I wasn't thinking of having stateful checks that keep track of when objects have been checked (e.g., in a DB table), but rather a stateless probabilistic approach that only guarantees that in the long run all objects get a chance of being checked. My rationale for that is that anyhow, a bit flip might occur the moment after you've checked an object, so no matter what what we're doing here is only giving guarantees up to a given failure probability. Hence the simplest approach that could possibly work is probably also the best one.

A content is checked when archived, so there would be no need to catch-up the checks after the completion of the archival.

Unfortunately no, due to the reason above (bit flip random appearance).

qcampos added a revision: D31: Create a content integrity checker.May 26 2016, 12:17 PM

qcampos mentioned this in D31: Create a content integrity checker.May 26 2016, 7:12 PM

qcampos created subtask T423: Make a content integrity checker that can run on a Tier 1 node.May 27 2016, 1:05 PM

qcampos closed subtask T423: Make a content integrity checker that can run on a Tier 1 node as Resolved.May 27 2016, 3:57 PM

qcampos added a revision: D36: Fix an object id encoding into a test of objstorage api.May 31 2016, 5:04 PM

qcampos added a revision: D37: The checker now have the possibility to have multiple backup servers.May 31 2016, 5:11 PM

olasd mentioned this in T523: Figure out what to do with corrupted copies detected by the archiver.Aug 3 2016, 3:31 PM

qcampos added a revision: D92: Make the content checker easy to extend and add an implementation.Aug 4 2016, 4:38 PM

qcampos mentioned this in D92: Make the content checker easy to extend and add an implementation.

qcampos created subtask T545: Create puppet manifests for the content integrity checkers.Aug 24 2016, 1:18 PM

ardumont changed the status of subtask T545: Create puppet manifests for the content integrity checkers from Open to Work in Progress.Aug 26 2016, 10:35 AM

ardumont closed subtask T545: Create puppet manifests for the content integrity checkers as Resolved.Aug 26 2016, 11:41 AM

An update on this, there are 3 checker implementations (all of them are puppetized independently):

LogChecker which simply log corrupted contents

RepairChecker (inherits LogChecker) - It fixes any corrupted content encountered by asking nicely to the backup storages for their copies. It's an iteration over the backups until one sends the copy. If nothing is found, it logs it.

ArchiveNotifierChecker (inherits LogChecker) - It updates the archiver's db regarding that content with its new 'corrupted' or 'missing' status. It's up to the archiver then to do its job.

I think this can be closed.

@qcampos Am i missing something?