Page MenuHomeSoftware Heritage

Provide stats on extracted metadata in the indexer storage api
Closed, ResolvedPublic

Description

Number of origins that were indexed, how many have a non-empty set of metadata, breakdown per metadata type.

Event Timeline

vlorentz triaged this task as Normal priority.

Useful queries:

select count(*) from origin_intrinsic_metadata;
select count(*) from origin_intrinsic_metadata where metadata != '{"@context": "https://doi.org/10.5063/schema/codemeta-2.0"}';

(The latter is a hack, for a long-term solution, doing JSON operations to check if there is any key other than @context would be better.)

vlorentz renamed this task from Show stats on extracted metadata to Provide stats on extracted metadata.
vlorentz renamed this task from Provide stats on extracted metadata to Provide stats on extracted metadata in the indexer storage api.
vlorentz closed this task as Resolved.Feb 7 2019, 3:50 PM