Page MenuHomeSoftware Heritage

PyPI loaderFolder
ActivePublic

Members

  • This project does not have any members.
  • View All

Watchers

  • This project does not have any watchers.
  • View All

Details

Description

Ingest source code archives available from the Python Package Index (PyPI)

Recent Activity

Jan 8 2023

gitlab-migration closed T4512: Strip first level directory when loading tarballs from PyPI? as Migrated.

This task has been migrated to GitLab.

Jan 8 2023, 5:05 PM · PyPI loader
gitlab-migration closed T3449: pypi / package loader: handle dangling branches as Migrated.

This task has been migrated to GitLab.

Jan 8 2023, 5:03 PM · Package Loader, Easy hack, PyPI loader
gitlab-migration changed the status of T3576: Add support for Python eggs file processing from Invalid to Migrated.

This task has been migrated to GitLab.

Jan 8 2023, 4:35 PM · PyPI loader
gitlab-migration changed the status of T3575: Filter out sdist archives that are not of interest from Resolved to Migrated.

This task has been migrated to GitLab.

Jan 8 2023, 4:35 PM · PyPI loader
gitlab-migration changed the status of T2667: Decide what to do with PyPI snapshot metadata from Resolved to Migrated.

This task has been migrated to GitLab.

Jan 8 2023, 4:31 PM · Extrinsic metadata, PyPI loader
gitlab-migration changed the status of T1396: loader-pypi: Snapshot with null branch are badly handled by loader from Resolved to Migrated.

This task has been migrated to GitLab.

Jan 8 2023, 4:26 PM · PyPI loader
gitlab-migration changed the status of T1206: PyPI loader fails when the metadata is incomplete from Resolved to Migrated.

This task has been migrated to GitLab.

Jan 8 2023, 4:25 PM · PyPI loader
gitlab-migration changed the status of T421: PyPI loader from Resolved to Migrated.

This task has been migrated to GitLab.

Jan 8 2023, 4:19 PM · PyPI loader, Origin-Pypi

Oct 19 2022

gitlab-migration closed T3570: Upgrade python3-pkginfo debian package to latest upstream version as Migrated.

This task has been migrated to GitLab.

Oct 19 2022, 6:04 PM · System administration, PyPI loader

Sep 8 2022

vlorentz added a comment to T4512: Strip first level directory when loading tarballs from PyPI?.
In T4512#90697, @olasd wrote:

Diverging from the layout of the original tarball may make efforts to keep the metadata needed to efficiently rebuild original tarballs (via disarchive) harder.

Sep 8 2022, 2:02 PM · PyPI loader
anlambert added a comment to T4512: Strip first level directory when loading tarballs from PyPI?.

But clearly it's not great. I wonder if we could do something about this in swh-web instead

Sep 8 2022, 12:20 PM · PyPI loader
olasd added a comment to T4512: Strip first level directory when loading tarballs from PyPI?.

I agree that the UX of switching branches from a release to another on snapshots of PyPI origins is not good.

Sep 8 2022, 12:17 PM · PyPI loader
vlorentz added a comment to T4512: Strip first level directory when loading tarballs from PyPI?.

I don't think it would be appropriate to remove that directory; we try to reproduce tarball faithfully. And there might be other entries at the root (eg. when loading .jar, there would typically be only two directories at the root).

Sep 8 2022, 12:03 PM · PyPI loader
vlorentz triaged T4512: Strip first level directory when loading tarballs from PyPI? as Normal priority.
Sep 8 2022, 12:01 PM · PyPI loader
lunar created T4512: Strip first level directory when loading tarballs from PyPI?.
Sep 8 2022, 11:56 AM · PyPI loader

Oct 15 2021

anlambert closed T3575: Filter out sdist archives that are not of interest as Resolved.

Fix has been deployed to production, closing this.

Oct 15 2021, 5:30 PM · PyPI loader

Sep 15 2021

anlambert added a revision to T3575: Filter out sdist archives that are not of interest: D6270: pypi/loader: Filter out sdist archives not of interest.
Sep 15 2021, 3:39 PM · PyPI loader
anlambert renamed T3575: Filter out sdist archives that are not of interest from Filter out sdist archives that cannot be extracted by swh.core.tarball.uncompress to Filter out sdist archives that are not of interest.
Sep 15 2021, 2:52 PM · PyPI loader
anlambert closed T3576: Add support for Python eggs file processing as Invalid.

Apparently we decided not to archive them so better filtering those files out as proposed in T3575.

Sep 15 2021, 2:47 PM · PyPI loader
anlambert updated the task description for T3576: Add support for Python eggs file processing .
Sep 15 2021, 2:47 PM · PyPI loader
anlambert added a revision to T3576: Add support for Python eggs file processing : D6267: tarball: Add support for uncompressing egg files.
Sep 15 2021, 2:22 PM · PyPI loader
anlambert updated the task description for T3576: Add support for Python eggs file processing .
Sep 15 2021, 2:00 PM · PyPI loader
anlambert updated the task description for T3576: Add support for Python eggs file processing .
Sep 15 2021, 2:00 PM · PyPI loader
anlambert triaged T3576: Add support for Python eggs file processing as Normal priority.
Sep 15 2021, 1:58 PM · PyPI loader
anlambert triaged T3575: Filter out sdist archives that are not of interest as Normal priority.
Sep 15 2021, 1:40 PM · PyPI loader

Sep 13 2021

anlambert added a project to T3570: Upgrade python3-pkginfo debian package to latest upstream version: System administration.
Sep 13 2021, 5:27 PM · System administration, PyPI loader
anlambert triaged T3570: Upgrade python3-pkginfo debian package to latest upstream version as Normal priority.
Sep 13 2021, 4:29 PM · System administration, PyPI loader

Jul 29 2021

vlorentz triaged T3449: pypi / package loader: handle dangling branches as Normal priority.
Jul 29 2021, 4:03 PM · Package Loader, Easy hack, PyPI loader

Nov 2 2020

vlorentz closed T2667: Decide what to do with PyPI snapshot metadata as Resolved.
Nov 2 2020, 1:51 PM · Extrinsic metadata, PyPI loader

Oct 13 2020

vlorentz added a revision to T2667: Decide what to do with PyPI snapshot metadata: D4242: pypi: write metadata on revisions instead of snapshots..
Oct 13 2020, 11:04 AM · Extrinsic metadata, PyPI loader
vlorentz added a comment to T2667: Decide what to do with PyPI snapshot metadata.

We don't keep the binary indexes from Debian repositories, for instance.

Oct 13 2020, 10:22 AM · Extrinsic metadata, PyPI loader
olasd added a comment to T2667: Decide what to do with PyPI snapshot metadata.

So they're metadata specific to files that we don't archive at all because they're not source? That doesn't sound very useful to keep at all. We don't keep the binary indexes from Debian repositories, for instance.

Oct 13 2020, 10:18 AM · Extrinsic metadata, PyPI loader
vlorentz added a comment to T2667: Decide what to do with PyPI snapshot metadata.

They are metadata on the file itself (file name, checksums, has signature, upload time, file-specific comment (often empty), yank status), so they have nothing in common

Oct 13 2020, 10:16 AM · Extrinsic metadata, PyPI loader
olasd added a comment to T2667: Decide what to do with PyPI snapshot metadata.

In practice, is there many meaningful differences between the wheel metadata and the sdist metadata? If not then I think option 3 would be the most sensible.

Oct 13 2020, 9:59 AM · Extrinsic metadata, PyPI loader
vlorentz updated the task description for T2667: Decide what to do with PyPI snapshot metadata.
Oct 13 2020, 9:45 AM · Extrinsic metadata, PyPI loader

Oct 6 2020

vlorentz triaged T2667: Decide what to do with PyPI snapshot metadata as Normal priority.
Oct 6 2020, 10:19 AM · Extrinsic metadata, PyPI loader

Oct 1 2019

ardumont edited P320 loader errors per loader type: ~/.config/swh/kibana/group-by.yml.
Oct 1 2019, 10:06 AM · Git loader, Mercurial loader, PyPI loader

Sep 24 2019

ardumont added a comment to D2028: [wip] package-loader: Implement a common package loader mechanism.

roh, i did not want to close it...
I just wanted to update the diff... oh, well!

Sep 24 2019, 2:39 PM · Npm loader, PyPI loader, Origin-GNU, Core Loader
ardumont closed D2028: [wip] package-loader: Implement a common package loader mechanism.
Sep 24 2019, 2:37 PM · Npm loader, PyPI loader, Origin-GNU, Core Loader
ardumont updated the summary of D2028: [wip] package-loader: Implement a common package loader mechanism.
Sep 24 2019, 10:08 AM · Npm loader, PyPI loader, Origin-GNU, Core Loader

Nov 30 2018

ardumont closed T1396: loader-pypi: Snapshot with null branch are badly handled by loader as Resolved.

And deployed.

Nov 30 2018, 10:53 AM · PyPI loader

Nov 29 2018

ardumont renamed T1396: loader-pypi: Snapshot with null branch are badly handled by loader from loader-pypi: badly(?) formatted snapshot targets None releases to loader-pypi: Snapshot with null branch are badly handled by loader.
Nov 29 2018, 9:27 PM · PyPI loader
ardumont added a comment to T1396: loader-pypi: Snapshot with null branch are badly handled by loader.

Then I think these snapshots do look as expected, and the surrounding code should be adapted :)

Nov 29 2018, 5:19 PM · PyPI loader
olasd added a comment to T1396: loader-pypi: Snapshot with null branch are badly handled by loader.

Then I think these snapshots do look as expected, and the surrounding code should be adapted :)

Nov 29 2018, 5:17 PM · PyPI loader
anlambert added a comment to T1396: loader-pypi: Snapshot with null branch are badly handled by loader.

Looks related to source packages without the presence of the PKG-INFO file, see debug output of the loader below:

Nov 29 2018, 10:52 AM · PyPI loader
olasd added a comment to T1396: loader-pypi: Snapshot with null branch are badly handled by loader.

I'd expect branches will have a null target if that release only has binary distributions, but that's not the case for configpy. Needs to be investigated further

Nov 29 2018, 10:35 AM · PyPI loader
ardumont triaged T1396: loader-pypi: Snapshot with null branch are badly handled by loader as Normal priority.
Nov 29 2018, 10:05 AM · PyPI loader
ardumont created T1396: loader-pypi: Snapshot with null branch are badly handled by loader.
Nov 29 2018, 10:05 AM · PyPI loader

Oct 20 2018

ardumont edited P320 loader errors per loader type: ~/.config/swh/kibana/group-by.yml.
Oct 20 2018, 12:54 PM · Git loader, Mercurial loader, PyPI loader

Oct 19 2018

ardumont edited P320 loader errors per loader type: ~/.config/swh/kibana/group-by.yml.
Oct 19 2018, 10:20 AM · Git loader, Mercurial loader, PyPI loader