Paths

Table of Contentst

Diffusion Storage manager 850a7553b6d5

Add support for a redis-based reporting for invalid mirrorred objects
850a7553b6d5
Actions

Tags

None

Subscribers

None

Description

Add support for a redis-based reporting for invalid mirrorred objects

The idea is that we check the BaseModel validity at journal
deserialization time so that we still have access to the raw object from
kafka for complete reporting (object id plus raw message from kafka).

This uses a new ModelObjectDeserializer class that is responsible for
deserializing the kafka message (still using kafka_to_value) then
immediately create the BaseModel object from that dict. Its convert
method is then passed as value_deserializer argument of the
JournalClient.

Then, for each deserialized object from kafka, if it's a HashableObject,
check its validity by comparing the computed hash with its id.

If it's invalid, report the error in logs, and if configured, register the
invalid object in via the reporter callback.

In the cli code, a Redis.set() is used a such a callback (if configured).
So it simply stores invalid objects using the object id a key (typically its
swhid), and the raw kafka message value as value.

Related to T3693.

Details

Provenance

douardda	Authored on Oct 27 2021, 5:31 PM
douardda	Pushed on Nov 9 2021, 5:28 PM

Differential Revision

D6571: Add support for a redis-based reporting for invalid mirrorred objects

Parents

rDSTO04bd15a0bca8: Refactor fixer.fix_objects() to extract the inner object_fixers dict

Branches

Unknown

Tags

Unknown

References

Tasks

T3693: Provide a mecanism to report (with persistence) objects that fails to get replayed (mirror)

Build Status

Buildable 24966
Build 39014: test-and-build	Jenkins console · Jenkins

Event Timeline

douardda committed rDSTO850a7553b6d5: Add support for a redis-based reporting for invalid mirrorred objects (authored by douardda).Nov 9 2021, 4:36 PM

douardda added a task: T3693: Provide a mecanism to report (with persistence) objects that fails to get replayed (mirror).Nov 9 2021, 5:28 PM

douardda added an edge: D6571: Add support for a redis-based reporting for invalid mirrorred objects.

Harbormaster completed building B24966: rDSTO850a7553b6d5: Add support for a redis-based reporting for invalid mirrorred objects.Nov 9 2021, 5:37 PM

swh-public-ci mentioned this in D6652: WIP: Add AsyncRemoteStorage.Nov 17 2021, 6:24 PM

swh-public-ci mentioned this in D6768: test_cassandra: Fix failing tests since swh-model update.Dec 7 2021, 1:46 PM

Changes (9)

Path

Size

requirements-swh-journal.txt

requirements-test.txt

requirements.txt

swh/

storage/

tests/

test_backfill.py

rDSTO850a7553b6d5

requirements-swh-journal.txt

Loading...

requirements-test.txt

Loading...

requirements.txt

Loading...

swh/storage/backfill.py

Loading...

swh/storage/cli.py

Loading...

swh/storage/replay.py

Loading...

swh/storage/tests/test_backfill.py

Loading...

swh/storage/tests/test_cli.py

Loading...

swh/storage/tests/test_replay.py

Loading...