Page MenuHomeSoftware Heritage
Feed Advanced Search

Oct 6 2022

franckbret updated the diff for D8171: crates: Loader implements incremental mode.

Remove description from Release message, add raw extrinsic metadata

Oct 6 2022, 12:31 PM
franckbret updated the diff for D8171: crates: Loader implements incremental mode.

Make use of checksums

Oct 6 2022, 9:15 AM

Oct 5 2022

franckbret added a comment to D8454: Crates.io: Add last_update for each version of a crate.

LGTM, thanks !

Oct 5 2022, 5:24 PM
franckbret closed D8454: Crates.io: Add last_update for each version of a crate.
Oct 5 2022, 5:13 PM
franckbret committed rDLS4a09f660b35a: Crates.io: Add last_update for each version of a crate (authored by franckbret).
Crates.io: Add last_update for each version of a crate
Oct 5 2022, 5:13 PM
franckbret updated the diff for D8454: Crates.io: Add last_update for each version of a crate.

rebase

Oct 5 2022, 5:13 PM
franckbret updated the diff for D8171: crates: Loader implements incremental mode.

Switch back artifacts and crates_metadata to list

Oct 5 2022, 4:54 PM
franckbret updated the diff for D8454: Crates.io: Add last_update for each version of a crate.

Fix documentation and remove finalize cleanup and related test now that we use tempdir

Oct 5 2022, 4:36 PM
franckbret updated the diff for D8454: Crates.io: Add last_update for each version of a crate.

Remove useless code

Oct 5 2022, 3:02 PM
franckbret updated the diff for D8454: Crates.io: Add last_update for each version of a crate.

Merge get_db_dumb and parse_db_dumb into get_and_parse_db_dump

Oct 5 2022, 2:35 PM
franckbret added a comment to D8454: Crates.io: Add last_update for each version of a crate.

extra_loader_arguments "artifacts" and "crates_metadata" are now lists + some code improvment

Oct 5 2022, 12:38 PM
franckbret added inline comments to D8454: Crates.io: Add last_update for each version of a crate.
Oct 5 2022, 12:36 PM
franckbret updated the diff for D8454: Crates.io: Add last_update for each version of a crate.

extra_loader_arguments "artifacts" and "crates_metadata" are now lists + some code improvment

Oct 5 2022, 12:35 PM
franckbret updated the diff for D8379: Hackage: Loads Hackage Listed origins.

shorter code for author fallback + use packaging.version parse

Oct 5 2022, 8:04 AM
franckbret updated the diff for D8379: Hackage: Loads Hackage Listed origins.

Remove release description from message

Oct 5 2022, 7:50 AM

Oct 4 2022

franckbret updated the diff for D8566: Conda: Anaconda packages archive loader.

Make use of checksums after D8595 landed

Oct 4 2022, 8:36 AM
franckbret added inline comments to D8379: Hackage: Loads Hackage Listed origins.
Oct 4 2022, 8:16 AM
franckbret updated the diff for D8379: Hackage: Loads Hackage Listed origins.

Add some tests to check that extract_intrinsic_metadata works as expected

Oct 4 2022, 8:01 AM

Sep 30 2022

franckbret updated the diff for D8566: Conda: Anaconda packages archive loader.

rebase

Sep 30 2022, 5:49 PM
franckbret updated the diff for D8566: Conda: Anaconda packages archive loader.

Rebase

Sep 30 2022, 5:46 PM
franckbret updated the diff for D8379: Hackage: Loads Hackage Listed origins.

Manage empty description and simplify .cabal parsing

Sep 30 2022, 5:44 PM
franckbret updated the diff for D8566: Conda: Anaconda packages archive loader.

shorter code and empty description handling

Sep 30 2022, 4:53 PM
franckbret added a comment to D8379: Hackage: Loads Hackage Listed origins.

@franckbret , you updated the wrong diff (conda instead of hackage)

Sep 30 2022, 4:27 PM
franckbret updated the diff for D8379: Hackage: Loads Hackage Listed origins.

restore previous state after I've arc diff to bad differential number

Sep 30 2022, 4:20 PM
franckbret updated the diff for D8566: Conda: Anaconda packages archive loader.

artifacts are now list

Sep 30 2022, 4:18 PM
franckbret updated the task description for T4547: Create a Conda Lister.
Sep 30 2022, 4:09 PM · Conda lister
franckbret closed D8588: Conda: switch artifacts from dict to list.
Sep 30 2022, 4:08 PM
franckbret committed rDLS6f40d2c1a5da: Conda: switch artifacts from dict to list (authored by franckbret).
Conda: switch artifacts from dict to list
Sep 30 2022, 4:08 PM
franckbret updated the diff for D8379: Hackage: Loads Hackage Listed origins.

artifacts are now list

Sep 30 2022, 4:07 PM
franckbret requested review of D8588: Conda: switch artifacts from dict to list.
Sep 30 2022, 4:05 PM
franckbret added inline comments to D8379: Hackage: Loads Hackage Listed origins.
Sep 30 2022, 10:19 AM
franckbret updated the diff for D8379: Hackage: Loads Hackage Listed origins.

Trust p_info.version instead of intrinsic_metadata["version"]

Sep 30 2022, 10:09 AM

Sep 29 2022

franckbret updated the diff for D8379: Hackage: Loads Hackage Listed origins.

Add more fixture and tests

Sep 29 2022, 6:27 PM
franckbret added a comment to D8379: Hackage: Loads Hackage Listed origins.

Here the problems is that the cabal file isn't formatted as the naïve parser expect.. we expect each line to be something like {k}: {v}\n but in this case its {k}: \n \t {v} \n and we end with an empty value..

We can let the cabal parser in its current state at the moment, we will see how many packages have a cabal file with unexpected format when testing the loader on staging.

I'm actually making more fixture and test for each case, will update this patch soon.

I guess other errors are related to some tricky things like this one and that until we have a a real cabal parser that manage all cases we won't be able to fall into exceptions.

Or the info is simply missing in the cabal file, doing intrinsic_metadata.get("<field_name>") and handling None values should help to fix those issues.

Sep 29 2022, 6:14 PM
franckbret added a comment to D8379: Hackage: Loads Hackage Listed origins.

sorry for posting twice my last comment, I didn't see the first one after submitting it...

Sep 29 2022, 6:12 PM
franckbret updated the task description for T4580: Puppet Loader.
Sep 29 2022, 4:09 PM · Puppet loader
franckbret updated the task description for T4580: Puppet Loader.
Sep 29 2022, 4:09 PM · Puppet loader
franckbret closed D8575: Puppet: The puppet loader loads origins from https://forge.puppet.com.
Sep 29 2022, 4:07 PM
franckbret committed rDLDBASE6299c091ec85: Puppet: The puppet loader loads origins from https://forge.puppet.com (authored by franckbret).
Puppet: The puppet loader loads origins from https://forge.puppet.com
Sep 29 2022, 4:07 PM
franckbret updated the diff for D8575: Puppet: The puppet loader loads origins from https://forge.puppet.com.

Ensure filename ends with {version}.tar.gz before splitting filename to get a package name

Sep 29 2022, 3:52 PM
franckbret closed D8557: Cpan: Cpan loader loads Perl modules from cpan.org.
Sep 29 2022, 3:17 PM
franckbret committed rDLDBASE2db1a754e368: Cpan: Cpan loader loads Perl modules from cpan.org (authored by franckbret).
Cpan: Cpan loader loads Perl modules from cpan.org
Sep 29 2022, 3:17 PM
franckbret updated the diff for D8557: Cpan: Cpan loader loads Perl modules from cpan.org.

Rebase

Sep 29 2022, 3:15 PM
franckbret updated the task description for T4519: Create a Puppet Forge lister.
Sep 29 2022, 2:46 PM · Puppet lister
franckbret updated the task description for T4519: Create a Puppet Forge lister.
Sep 29 2022, 2:46 PM · Puppet lister
franckbret updated the task description for T4520: Create a CPAN lister.
Sep 29 2022, 2:42 PM · CPAN lister
franckbret updated the task description for T4520: Create a CPAN lister.
Sep 29 2022, 2:41 PM · CPAN lister
franckbret updated the task description for T4547: Create a Conda Lister.
Sep 29 2022, 2:35 PM · Conda lister
franckbret updated the task description for T4547: Create a Conda Lister.
Sep 29 2022, 2:35 PM · Conda lister
franckbret updated the task description for T4547: Create a Conda Lister.
Sep 29 2022, 2:34 PM · Conda lister
franckbret updated the task description for T4547: Create a Conda Lister.
Sep 29 2022, 2:34 PM · Conda lister
franckbret updated the task description for T4579: Create Conda Loader.
Sep 29 2022, 2:33 PM · Conda loader
franckbret closed D8529: RubyGems: List origins from https://rubygems.org.
Sep 29 2022, 2:20 PM
franckbret committed rDLS52ccf49e1104: RubyGems: List origins from https://rubygems.org (authored by franckbret).
RubyGems: List origins from https://rubygems.org
Sep 29 2022, 2:20 PM
franckbret updated the diff for D8529: RubyGems: List origins from https://rubygems.org.

rebase

Sep 29 2022, 2:20 PM
franckbret added inline comments to D8575: Puppet: The puppet loader loads origins from https://forge.puppet.com.
Sep 29 2022, 1:49 PM
franckbret added inline comments to D8575: Puppet: The puppet loader loads origins from https://forge.puppet.com.
Sep 29 2022, 1:37 PM
franckbret added a comment to D8379: Hackage: Loads Hackage Listed origins.

That diff requires some changes as the api_info function got renamed (see inline comments).

Also while testing the loader in docker, I got a couple of errors on some packages, see below:

docker-swh-loader-1  | [2022-09-26 13:06:42,922: ERROR/ForkPoolWorker-8] Failed to load branch releases/0.1.0 for https://hackage.haskell.org/package/numeric-qq
docker-swh-loader-1  | Traceback (most recent call last):
docker-swh-loader-1  |   File "/src/swh-loader-core/swh/loader/package/loader.py", line 672, in load
docker-swh-loader-1  |     res = self._load_release(p_info, origin)
docker-swh-loader-1  |   File "/src/swh-loader-core/swh/loader/package/loader.py", line 851, in _load_release
docker-swh-loader-1  |     p_info, uncompressed_path, directory=directory.hash
docker-swh-loader-1  |   File "/src/swh-loader-core/swh/loader/package/hackage/loader.py", line 171, in build_release
docker-swh-loader-1  |     assert version == p_info.version
docker-swh-loader-1  | AssertionError
Sep 29 2022, 1:34 PM
franckbret updated subscribers of D8575: Puppet: The puppet loader loads origins from https://forge.puppet.com.
Sep 29 2022, 1:23 PM
franckbret added a comment to D8379: Hackage: Loads Hackage Listed origins.

That diff requires some changes as the api_info function got renamed (see inline comments).

Also while testing the loader in docker, I got a couple of errors on some packages, see below:

docker-swh-loader-1  | [2022-09-26 13:06:42,922: ERROR/ForkPoolWorker-8] Failed to load branch releases/0.1.0 for https://hackage.haskell.org/package/numeric-qq
docker-swh-loader-1  | Traceback (most recent call last):
docker-swh-loader-1  |   File "/src/swh-loader-core/swh/loader/package/loader.py", line 672, in load
docker-swh-loader-1  |     res = self._load_release(p_info, origin)
docker-swh-loader-1  |   File "/src/swh-loader-core/swh/loader/package/loader.py", line 851, in _load_release
docker-swh-loader-1  |     p_info, uncompressed_path, directory=directory.hash
docker-swh-loader-1  |   File "/src/swh-loader-core/swh/loader/package/hackage/loader.py", line 171, in build_release
docker-swh-loader-1  |     assert version == p_info.version
docker-swh-loader-1  | AssertionError
docker-swh-loader-1  | [2022-09-26 13:08:03,416: ERROR/ForkPoolWorker-11] Failed to load branch releases/1.0.0.0 for https://hackage.haskell.org/package/haskell2010
docker-swh-loader-1  | Traceback (most recent call last):
docker-swh-loader-1  |   File "/src/swh-loader-core/swh/loader/package/loader.py", line 672, in load
docker-swh-loader-1  |     res = self._load_release(p_info, origin)
docker-swh-loader-1  |   File "/src/swh-loader-core/swh/loader/package/loader.py", line 851, in _load_release
docker-swh-loader-1  |     p_info, uncompressed_path, directory=directory.hash
docker-swh-loader-1  |   File "/src/swh-loader-core/swh/loader/package/hackage/loader.py", line 172, in build_release
docker-swh-loader-1  |     author = Person.from_fullname(intrinsic_metadata["author"].encode())
docker-swh-loader-1  | KeyError: 'author'
docker-swh-loader-1  | [2022-09-26 13:21:31,790: ERROR/ForkPoolWorker-40] Failed to load branch releases/0.1.0.0 for https://hackage.haskell.org/package/hs-inspector
docker-swh-loader-1  | Traceback (most recent call last):
docker-swh-loader-1  |   File "/src/swh-loader-core/swh/loader/package/loader.py", line 672, in load
docker-swh-loader-1  |     res = self._load_release(p_info, origin)
docker-swh-loader-1  |   File "/src/swh-loader-core/swh/loader/package/loader.py", line 851, in _load_release
docker-swh-loader-1  |     p_info, uncompressed_path, directory=directory.hash
docker-swh-loader-1  |   File "/src/swh-loader-core/swh/loader/package/hackage/loader.py", line 173, in build_release
docker-swh-loader-1  |     description: str = intrinsic_metadata["synopsis"]
docker-swh-loader-1  | KeyError: 'synopsis'
docker-swh-loader-1  | Traceback (most recent call last):
docker-swh-loader-1  |   File "/src/swh-loader-core/swh/loader/package/loader.py", line 672, in load
docker-swh-loader-1  |     res = self._load_release(p_info, origin)
docker-swh-loader-1  |   File "/src/swh-loader-core/swh/loader/package/loader.py", line 851, in _load_release
docker-swh-loader-1  |     p_info, uncompressed_path, directory=directory.hash
docker-swh-loader-1  |   File "/src/swh-loader-core/swh/loader/package/hackage/loader.py", line 170, in build_release
docker-swh-loader-1  |     version: str = intrinsic_metadata["version"]
docker-swh-loader-1  | KeyError: 'version'
Sep 29 2022, 12:39 PM
franckbret requested review of D8575: Puppet: The puppet loader loads origins from https://forge.puppet.com.
Sep 29 2022, 10:21 AM
franckbret created T4580: Puppet Loader.
Sep 29 2022, 10:03 AM · Puppet loader

Sep 28 2022

franckbret updated the diff for D8557: Cpan: Cpan loader loads Perl modules from cpan.org.

Some improvments after review

Sep 28 2022, 5:26 PM
franckbret requested review of D8566: Conda: Anaconda packages archive loader.
Sep 28 2022, 4:48 PM
franckbret requested review of D8557: Cpan: Cpan loader loads Perl modules from cpan.org.
Sep 28 2022, 9:45 AM

Sep 27 2022

franckbret closed D8528: Nuget: Lister for NuGet the package manager for .NET.
Sep 27 2022, 4:28 PM
franckbret committed rDLS3928fc9ee9ff: Nuget: Lister for NuGet the package manager for .NET (authored by franckbret).
Nuget: Lister for NuGet the package manager for .NET
Sep 27 2022, 4:28 PM
franckbret updated the diff for D8528: Nuget: Lister for NuGet the package manager for .NET.

rebase

Sep 27 2022, 4:01 PM
franckbret added a comment to D8529: RubyGems: List origins from https://rubygems.org.

Is https://rubygems.org/versions documented somewhere?

Sep 27 2022, 3:59 PM
franckbret updated the diff for D8529: RubyGems: List origins from https://rubygems.org.

Make use of http_request after D8520, update documentation docker section.

Sep 27 2022, 3:23 PM
franckbret updated the diff for D8528: Nuget: Lister for NuGet the package manager for .NET.

Explain that the lister discovers origins on other forges because NuGet packages are binaries

Sep 27 2022, 2:59 PM
franckbret closed D8531: Puppet: Lister for Puppet modules.
Sep 27 2022, 2:46 PM
franckbret committed rDLScd596eb2b46d: Puppet: Lister for Puppet modules (authored by franckbret).
Puppet: Lister for Puppet modules
Sep 27 2022, 2:46 PM
franckbret updated the diff for D8531: Puppet: Lister for Puppet modules.

rebase

Sep 27 2022, 2:45 PM
franckbret added a comment to D8542: Cpan: List Perl module origins from cpan.org.

@franckbret , have you considered exploiting the https://fastapi.metacpan.org/v1/release/_search endpoint of the CPAN elasticsearch ?

It seems to list all CPAN releases with dates, links to tarballs and checksums. You could build a list of artifacts for each package as in the crates loader
and pass them as loader arguments.

Thanks for the review.

Sep 27 2022, 2:42 PM
franckbret closed D8542: Cpan: List Perl module origins from cpan.org.
Sep 27 2022, 2:32 PM
franckbret committed rDLSa4aec3894e3c: Cpan: List Perl module origins from cpan.org (authored by franckbret).
Cpan: List Perl module origins from cpan.org
Sep 27 2022, 2:32 PM
franckbret updated the diff for D8542: Cpan: List Perl module origins from cpan.org.

rebase

Sep 27 2022, 2:31 PM
franckbret closed D8338: Hackage: List origins from hackage.haskell.org, The Haskell Package Repository.
Sep 27 2022, 2:25 PM
franckbret committed rDLS6696a8424ad1: Hackage: List origins from hackage.haskell.org, The Haskell Package Repository (authored by franckbret).
Hackage: List origins from hackage.haskell.org, The Haskell Package Repository
Sep 27 2022, 2:25 PM
franckbret updated the diff for D8338: Hackage: List origins from hackage.haskell.org, The Haskell Package Repository.

rebase

Sep 27 2022, 2:24 PM
franckbret closed D8517: Conda: List origins from anaconda.com, the Package, dependency and environment management for any language.
Sep 27 2022, 2:19 PM
franckbret committed rDLS8ff418fbc28b: Conda: List origins for Anaconda, the package manager that provides tooling for… (authored by franckbret).
Conda: List origins for Anaconda, the package manager that provides tooling for…
Sep 27 2022, 2:19 PM
franckbret updated the diff for D8379: Hackage: Loads Hackage Listed origins.

Replace api_info that has been renamed to get_url_body

Sep 27 2022, 10:55 AM
franckbret requested review of D8542: Cpan: List Perl module origins from cpan.org.
Sep 27 2022, 9:32 AM

Sep 26 2022

franckbret updated the diff for D8531: Puppet: Lister for Puppet modules.

Sphinx fix

Sep 26 2022, 6:53 PM
franckbret added inline comments to D8531: Puppet: Lister for Puppet modules.
Sep 26 2022, 6:47 PM
franckbret updated the diff for D8531: Puppet: Lister for Puppet modules.

typo

Sep 26 2022, 6:44 PM
franckbret updated the diff for D8531: Puppet: Lister for Puppet modules.

Make use of self.http_request as introduced by D8520

Sep 26 2022, 6:38 PM
franckbret updated the diff for D8517: Conda: List origins from anaconda.com, the Package, dependency and environment management for any language.

Make use of http_request after D8520

Sep 26 2022, 5:59 PM
franckbret updated the diff for D8528: Nuget: Lister for NuGet the package manager for .NET.

Update docker usage documentation section and remove some useless code

Sep 26 2022, 5:23 PM
franckbret added inline comments to D8338: Hackage: List origins from hackage.haskell.org, The Haskell Package Repository.
Sep 26 2022, 5:06 PM
franckbret updated the diff for D8338: Hackage: List origins from hackage.haskell.org, The Haskell Package Repository.

Improvments after review

Sep 26 2022, 5:04 PM
franckbret updated subscribers of D8528: Nuget: Lister for NuGet the package manager for .NET.
Sep 26 2022, 4:16 PM
franckbret updated the diff for D8528: Nuget: Lister for NuGet the package manager for .NET.

More complete tests and basic documentation

Sep 26 2022, 4:10 PM
franckbret updated the diff for D8528: Nuget: Lister for NuGet the package manager for .NET.

Make use of generic http_request method after D8520

Sep 26 2022, 2:38 PM
franckbret updated the diff for D8338: Hackage: List origins from hackage.haskell.org, The Haskell Package Repository.

Make use of http_retry instead of throttling_retry decorator after D8519

Sep 26 2022, 12:27 PM
franckbret updated the diff for D8517: Conda: List origins from anaconda.com, the Package, dependency and environment management for any language.

Make use of http_retry instead of throttling_retry decorator after D8519

Sep 26 2022, 12:17 PM
franckbret retitled D8528: Nuget: Lister for NuGet the package manager for .NET from [WIP] Nuget: Lister for NuGet the package manager for .NET to Nuget: Lister for NuGet the package manager for .NET.
Sep 26 2022, 12:14 PM
franckbret updated the diff for D8528: Nuget: Lister for NuGet the package manager for .NET.

Make use of http_retry instead of throttling_retry decorator after D8519

Sep 26 2022, 12:13 PM