As the indexer will use another tool to index, the current bogus values won't be updated.
So they should be cleaned up.
Description
Description
Status | Assigned | Task | ||
---|---|---|---|---|
Migrated | gitlab-migration | T713 Index existing contents (mimetype, language, license) | ||
Migrated | gitlab-migration | T817 analyze bogus mimetype values in content_mimetype table | ||
Migrated | gitlab-migration | T854 clean up bogus mimetype values in content_mimetype table |
Event Timeline
Comment Actions
Bogus mimetype values are identified by the following queries:
softwareheritage=> select count(*) from content_mimetype where mimetype LIKE '[%' or mimetype like '' and indexer_configuration_id=7; count ------- 50733 (1 row)
As soon as the index is ready on prado (i did the analysis on somerset), clean up will be done with:
delete from content_mimetype where mimetype LIKE '[%' or mimetype like '' and indexer_configuration_id=7;