Jan 8 2023
Jan 8 2023
gitlab-migration changed the status of T2023: Re-implement gnu loader with package loader, a subtask of T1389: Implement a base "package" loader for package managers, from Resolved to Migrated.
gitlab-migration changed the status of T2022: Re-implement npm loader with base loader, a subtask of T1389: Implement a base "package" loader for package managers, from Resolved to Migrated.
gitlab-migration changed the status of T2021: Re-implement pypi loader with package loader from Resolved to Migrated.
gitlab-migration changed the status of T2021: Re-implement pypi loader with package loader, a subtask of T1389: Implement a base "package" loader for package managers, from Resolved to Migrated.
gitlab-migration changed the status of T1389: Implement a base "package" loader for package managers from Resolved to Migrated.
gitlab-migration changed the status of T1181: pypi: Schedule ingestion, a subtask of T419: ingest PyPI into the Software Heritage archive (meta task), from Resolved to Migrated.
gitlab-migration changed the status of T420: mirror PyPI, a subtask of T419: ingest PyPI into the Software Heritage archive (meta task), from Wontfix to Migrated.
gitlab-migration changed the status of T422: PyPI lister, a subtask of T419: ingest PyPI into the Software Heritage archive (meta task), from Resolved to Migrated.
gitlab-migration changed the status of T419: ingest PyPI into the Software Heritage archive (meta task) from Resolved to Migrated.
gitlab-migration changed the status of T3399: Improve PyPI lister to pull last update information when running incrementally from Resolved to Migrated.
gitlab-migration changed the status of T2025: Re-implement debian loader with package loader mechanism, a subtask of T1389: Implement a base "package" loader for package managers, from Resolved to Migrated.
gitlab-migration changed the status of T2024: Re-implement deposit loader with package loader mechanism, a subtask of T1389: Implement a base "package" loader for package managers, from Resolved to Migrated.
gitlab-migration changed the status of T2017: package loader: Discuss revision metadata normalization, a subtask of T1389: Implement a base "package" loader for package managers, from Resolved to Migrated.
gitlab-migration changed the status of T421: PyPI loader, a subtask of T419: ingest PyPI into the Software Heritage archive (meta task), from Resolved to Migrated.
Jul 9 2021
Jul 9 2021
ardumont closed T3399: Improve PyPI lister to pull last update information when running incrementally as Resolved.
Deployed and running so closing.
ardumont added a comment to T3399: Improve PyPI lister to pull last update information when running incrementally.
Deployed in production as well and triggered a run:
ardumont added a comment to T3399: Improve PyPI lister to pull last update information when running incrementally.
From a quick test, it looks like the "Project and release activity details" feed can go back multiple years without any issue, allowing us to backfill the data for all known origins, before adding the incremental behavior to the lister.
ardumont added a comment to T3399: Improve PyPI lister to pull last update information when running incrementally.
Deployed in staging and triggered a run:
swh-public-ci added a comment to D5977: Make PyPI lister incremental and complete in regards to last_update.
Build is green
ardumont updated the diff for D5977: Make PyPI lister incremental and complete in regards to last_update.
- Adapt according to last suggestion about stub
- and from the irc discussion, adding a sleep call to avoid the first throttling due to the initialization of the xmlrpc client [1]
swh-public-ci added a comment to D5977: Make PyPI lister incremental and complete in regards to last_update.
Build is green
ardumont added inline comments to D5977: Make PyPI lister incremental and complete in regards to last_update.
ardumont added inline comments to D5977: Make PyPI lister incremental and complete in regards to last_update.
ardumont updated the diff for D5977: Make PyPI lister incremental and complete in regards to last_update.
Attend to the major part of the review (thx)
olasd added a comment to D5977: Make PyPI lister incremental and complete in regards to last_update.
ardumont added a comment to D5977: Make PyPI lister incremental and complete in regards to last_update.
Looks good, thanks a lot!
Looks good, thanks a lot!
ardumont updated the test plan for D5977: Make PyPI lister incremental and complete in regards to last_update.
ardumont updated the test plan for D5977: Make PyPI lister incremental and complete in regards to last_update.
ardumont updated the test plan for D5977: Make PyPI lister incremental and complete in regards to last_update.
ardumont updated the test plan for D5977: Make PyPI lister incremental and complete in regards to last_update.
ardumont updated the test plan for D5977: Make PyPI lister incremental and complete in regards to last_update.
swh-public-ci added a comment to D5977: Make PyPI lister incremental and complete in regards to last_update.
Build is green
ardumont updated the diff for D5977: Make PyPI lister incremental and complete in regards to last_update.
- Handle throttling appropriately (adapt throttling decorators [1])
- Rework logic to still deal with pages of results so the write in dbs occur incrementally
ardumont updated the test plan for D5977: Make PyPI lister incremental and complete in regards to last_update.
ardumont planned changes to D5977: Make PyPI lister incremental and complete in regards to last_update.
Running through docker, i actually need to change a few things:
- throttling needs to change (D5983)
- actual logic of sending all origins in one go won't be ok for the first run (no flush prior to actually finish the run...) so i'll need to rework this (in-progress) [1]
swh-public-ci added a comment to D5977: Make PyPI lister incremental and complete in regards to last_update.
Build is green
ardumont updated the diff for D5977: Make PyPI lister incremental and complete in regards to last_update.
Rebase
swh-public-ci added a comment to D5977: Make PyPI lister incremental and complete in regards to last_update.
Build is green
ardumont updated the diff for D5977: Make PyPI lister incremental and complete in regards to last_update.
- Rebase
- Refactor
swh-public-ci added a comment to D5977: Make PyPI lister incremental and complete in regards to last_update.
Build is green