HomeSoftware Heritage

cassandra: Make content_missing query in batches

This commit no longer exists in the repository. It may have been part of a branch which was deleted.

Description

cassandra: Make content_missing query in batches

Instead of calling content_find() for each object, which needs to make
two queries for each.

Given the latency of Cassandra queries, this should be a significant
speed-up (possibly up to 100 times faster, as this is the value of
PARTITION_KEY_RESTRICTION_MAX_SIZE).

This also changes the schema, because CQL does not allow doing IN
queries on compound partition keys.

Details

Commit No Longer Exists

This commit no longer exists in the repository.