Page MenuHomeSoftware Heritage

Filter out sdist archives that are not of interest
Closed, ResolvedPublic

Description

Some PyPI origins declare sdist archives in formats that are not supported by swh.core.tarball.uncompress,
for instance .deb, .egg, .rpm or .whl. This leads to errors when trying to load such origins and generates a lot of noise in
sentry reports.

As the content of those archives does not match standard sdist format (there is no PKG-INFO file for instance),
we should filter them out from the artifacts to process for a PyPI origin.

Event Timeline

anlambert triaged this task as Normal priority.Sep 15 2021, 1:40 PM
anlambert created this task.
anlambert renamed this task from Filter out sdist archives that cannot be extracted by swh.core.tarball.uncompress to Filter out sdist archives that are not of interest.Sep 15 2021, 2:52 PM
anlambert updated the task description. (Show Details)
anlambert claimed this task.

Fix has been deployed to production, closing this.