Page MenuHomeSoftware Heritage
Feed All Stories

Jul 1 2022

douardda triaged T4370: Refactor the origin visit data model (aka get rid of the OriginVisit model object) as High priority.
Jul 1 2022, 4:35 PM · Storage manager, Data Model
anlambert triaged T4369: canonicalize gitlab urls in origin API as Normal priority.
Jul 1 2022, 3:24 PM · Web app
anlambert added a comment to T4369: canonicalize gitlab urls in origin API.

For the record, we recently disabled any origin URL processing in the Web API part (we were checking an origin URL with and without
trailing slash, but this should only be performed in the Web UI as the Web API should be as dumb as possible, see D7988).

Jul 1 2022, 3:23 PM · Web app
douardda requested review of D8066: Add the 'flavor' argument to db_utils.initialize_database_for_module().
Jul 1 2022, 3:03 PM
jayeshv closed D8065: Add type annotations for base_connection.
Jul 1 2022, 3:00 PM
jayeshv committed rDGQLe4d4a30b1f36: Add type annotations for base_connection (authored by jayeshv).
Add type annotations for base_connection
Jul 1 2022, 3:00 PM
anlambert accepted D8065: Add type annotations for base_connection.

Looks good to me.

Jul 1 2022, 2:58 PM
swh-public-ci added a comment to D8065: Add type annotations for base_connection.

Build is green

Jul 1 2022, 2:54 PM
jayeshv updated the diff for D8065: Add type annotations for base_connection.

Address review comments

Jul 1 2022, 2:52 PM
anlambert added inline comments to D8065: Add type annotations for base_connection.
Jul 1 2022, 2:13 PM
Harbormaster failed remote builds in B30196: Diff 29104 for D8047: Indexer for Packagist(composer.json)!
Jul 1 2022, 1:05 PM
swh-public-ci added a comment to D8047: Indexer for Packagist(composer.json).

Build was aborted

Jul 1 2022, 1:05 PM
VickyMerzOwn updated the diff for D8047: Indexer for Packagist(composer.json).

Excludes author.role from composer.py

Jul 1 2022, 12:54 PM
swh-public-ci added a comment to D8065: Add type annotations for base_connection.

Build is green

Jul 1 2022, 12:20 PM
jayeshv updated the diff for D8065: Add type annotations for base_connection.

Add type annotation on _get_paged_result functions

Jul 1 2022, 12:17 PM
jayeshv requested review of D8065: Add type annotations for base_connection.
Jul 1 2022, 11:55 AM
jayeshv added a revision to T4261: Swh-graphql Code cleanup: D8065: Add type annotations for base_connection.
Jul 1 2022, 11:53 AM · GraphQL API
rdicosmo added a comment to T4369: canonicalize gitlab urls in origin API.

Nice catch.... actually, the GitLab API uses the project slug (e.g.: checkscale-gitlab/git-wtf) without the .git, we should make sure we can handle this same slug

Jul 1 2022, 11:52 AM · Web app
bchauvet created T4369: canonicalize gitlab urls in origin API.
Jul 1 2022, 11:15 AM · Web app
douardda triaged T4368: Loosen "foreign key" validation in storages used as mirror ingestion endpoint as High priority.
Jul 1 2022, 11:13 AM · Storage manager
douardda created T4368: Loosen "foreign key" validation in storages used as mirror ingestion endpoint.
Jul 1 2022, 11:13 AM · Storage manager
jayeshv updated the task description for T4365: GraphQL: Use the right date time serialization.
Jul 1 2022, 10:54 AM · GraphQL API
jayeshv updated the task description for T4366: GraphQL: add revision log to the revision type.
Jul 1 2022, 10:54 AM · GraphQL API
douardda added a comment to T2309: Add support for other hash algo than sha1 in current objstorage implementation.

do you have in mind to make the actual hash used as primary key in an objstorage a configuration of said storage instance? e.g. create a pathslicer or s3 objstorage using sha256 is just a matter of configuration of the objstorage?

Jul 1 2022, 10:38 AM · Object storage
douardda added a comment to T2309: Add support for other hash algo than sha1 in current objstorage implementation.

do you have in mind to make the actual hash used as primary key in an objstorage a configuration of said storage instance? e.g. create a pathslicer or s3 objstorage using sha256 is just a matter of configuration of the objstorage?

Jul 1 2022, 10:34 AM · Object storage
douardda added inline comments to D8029: Start introducing composite ObjId in the interface.
Jul 1 2022, 10:22 AM
ardumont committed rCJSWH0613c0c9d3df: README.md: Improve gogs setup instructions (authored by ardumont).
README.md: Improve gogs setup instructions
Jul 1 2022, 10:02 AM
ardumont committed rCJSWHa8a26a77f28a: Ignore docker-compose.override.yml (authored by ardumont).
Ignore docker-compose.override.yml
Jul 1 2022, 10:02 AM
swh-public-ci added a comment to D8047: Indexer for Packagist(composer.json).

Build is green

Jul 1 2022, 9:06 AM
VickyMerzOwn updated the diff for D8047: Indexer for Packagist(composer.json).

Excludes codemeta-v1 column from composer.csv

Jul 1 2022, 9:01 AM
vlorentz added inline comments to D8047: Indexer for Packagist(composer.json).
Jul 1 2022, 8:39 AM

Jun 30 2022

vsellier committed rCJSWHb0e07c673ec0: wip - add a forge to host the local Changes (authored by vsellier).
wip - add a forge to host the local Changes
Jun 30 2022, 11:25 PM
swh-public-ci added a comment to D8047: Indexer for Packagist(composer.json).

Build is green

Jun 30 2022, 11:18 PM
VickyMerzOwn updated the diff for D8047: Indexer for Packagist(composer.json).

Corrects Copyright year

Jun 30 2022, 11:13 PM
swh-public-ci added a comment to D8047: Indexer for Packagist(composer.json).

Build is green

Jun 30 2022, 10:52 PM
VickyMerzOwn updated the diff for D8047: Indexer for Packagist(composer.json).

Adds license to composer.py
Excludes redundant columns from composer.csv

Jun 30 2022, 10:46 PM
VickyMerzOwn added inline comments to D8047: Indexer for Packagist(composer.json).
Jun 30 2022, 10:45 PM
vlorentz added inline comments to D8047: Indexer for Packagist(composer.json).
Jun 30 2022, 10:31 PM
swh-public-ci added a comment to D8047: Indexer for Packagist(composer.json).

Build is green

Jun 30 2022, 10:00 PM
VickyMerzOwn updated the diff for D8047: Indexer for Packagist(composer.json).

Undo changes to crosswalk.csv

Jun 30 2022, 9:54 PM
swh-public-ci added a comment to D8047: Indexer for Packagist(composer.json).

Build is green

Jun 30 2022, 9:40 PM
vlorentz added inline comments to D8047: Indexer for Packagist(composer.json).
Jun 30 2022, 9:38 PM
swh-public-ci added a comment to D8047: Indexer for Packagist(composer.json).

Build is green

Jun 30 2022, 9:38 PM
VickyMerzOwn updated the diff for D8047: Indexer for Packagist(composer.json).

Undo changes to crosswalk.csv

Jun 30 2022, 9:34 PM
VickyMerzOwn updated the diff for D8047: Indexer for Packagist(composer.json).

Composer mappings use a separate table

Jun 30 2022, 9:32 PM
seirl committed rDGRPH555e3b6d3e11: rpc_server: use shlex.quote() to print command (authored by seirl).
rpc_server: use shlex.quote() to print command
Jun 30 2022, 8:49 PM
seirl committed rDGRPHa55edc1f88d5: Remove shlex.join() call (3.8 only) (authored by seirl).
Remove shlex.join() call (3.8 only)
Jun 30 2022, 8:33 PM
Harbormaster failed to build B30184: rDGRPH441aacc6aeeb: Update sux4j to 5.4.0 for rDGRPH441aacc6aeeb: Update sux4j to 5.4.0!
Jun 30 2022, 8:15 PM
Harbormaster failed to build B30185: rDGRPH94418d4944a8: GRPC: use port 50091 by default for rDGRPH94418d4944a8: GRPC: use port 50091 by default!
Jun 30 2022, 8:15 PM
seirl committed rDGRPH94418d4944a8: GRPC: use port 50091 by default (authored by seirl).
GRPC: use port 50091 by default
Jun 30 2022, 8:07 PM
seirl committed rDGRPH441aacc6aeeb: Update sux4j to 5.4.0 (authored by seirl).
Update sux4j to 5.4.0
Jun 30 2022, 8:07 PM
VickyMerzOwn added inline comments to D8047: Indexer for Packagist(composer.json).
Jun 30 2022, 7:57 PM
vlorentz requested changes to D8047: Indexer for Packagist(composer.json).
Jun 30 2022, 7:43 PM
swh-public-ci added a comment to D8047: Indexer for Packagist(composer.json).

Build is green

Jun 30 2022, 7:15 PM
seirl requested review of D8064: swh-graph: rename services (now production-ready, no longer dev).
Jun 30 2022, 7:15 PM
VickyMerzOwn updated the diff for D8047: Indexer for Packagist(composer.json).

Adds composer mappings to metadata dictionary

Jun 30 2022, 7:09 PM
Harbormaster failed to build B30181: rDGRPH26e52b389fb1: rpc_server.py: terminate, then kill server on timeout for rDGRPH26e52b389fb1: rpc_server.py: terminate, then kill server on timeout!
Jun 30 2022, 7:06 PM
seirl committed rDGRPH26e52b389fb1: rpc_server.py: terminate, then kill server on timeout (authored by seirl).
rpc_server.py: terminate, then kill server on timeout
Jun 30 2022, 6:59 PM
seirl added a comment to T3259: Gracefully handle a client closing the connection in the middle of a response being streamed.

GRPC update: cancelling a GRPC stream works fine, but it doesn't seem like aiohttp is doing that when the HTTP stream is closed.

Jun 30 2022, 6:41 PM · Compressed graph service
ardumont committed rCJSWHb0872b4ec80b: wip: Start swh-apps/update-dependencies job (authored by ardumont).
wip: Start swh-apps/update-dependencies job
Jun 30 2022, 5:56 PM
ardumont committed rCJSWH6ce55e57218b: wip: Start swh-apps/update-dependencies job (authored by ardumont).
wip: Start swh-apps/update-dependencies job
Jun 30 2022, 5:46 PM
ardumont committed rCJSWHdfac84b3435f: wip: Start swh-apps/update-dependencies job (authored by ardumont).
wip: Start swh-apps/update-dependencies job
Jun 30 2022, 5:43 PM
ardumont committed rCJSWHe41c157273e8: wip: Start swh-apps/update-dependencies job (authored by ardumont).
wip: Start swh-apps/update-dependencies job
Jun 30 2022, 5:41 PM
ardumont committed rCJSWHd25fc9bb8b51: wip: Start swh-apps/update-dependencies job (authored by ardumont).
wip: Start swh-apps/update-dependencies job
Jun 30 2022, 5:37 PM
seirl closed T4316: Push of swh-graph to pypi is broken as Resolved.

We requested a larger quota here: https://github.com/pypa/pypi-support/issues/1998

Jun 30 2022, 5:37 PM · System administration, Compressed graph service
seirl committed rDGRPHf2f9e8ba69ef: DumpProperties: fix output for tag names (authored by seirl).
DumpProperties: fix output for tag names
Jun 30 2022, 5:18 PM
vsellier committed rCJSWH3d562f112c91: wip - poc the swh-apps pipeline (authored by vsellier).
wip - poc the swh-apps pipeline
Jun 30 2022, 5:13 PM
vsellier committed rCJSWH278f12b744b1: wip - poc the swh-apps pipeline (authored by vsellier).
wip - poc the swh-apps pipeline
Jun 30 2022, 5:10 PM
ardumont closed D8063: README.md: Adapt docker instructions.
Jun 30 2022, 5:06 PM
ardumont committed rCJSWHdf549fba24d0: README.md: Adapt docker instructions (authored by ardumont).
README.md: Adapt docker instructions
Jun 30 2022, 5:06 PM
ardumont requested review of D8063: README.md: Adapt docker instructions.
Jun 30 2022, 5:06 PM
seirl closed D8061: Re-add swh/graph/client.py for backward-compatibility.
Jun 30 2022, 4:48 PM
seirl committed rDGRPH3b9685923021: Re-add swh/graph/client.py for backward-compatibility (authored by seirl).
Re-add swh/graph/client.py for backward-compatibility
Jun 30 2022, 4:48 PM
vsellier closed D8062: fix a typo on the production objstorage vhost.
Jun 30 2022, 4:48 PM
vsellier committed rSPSITE57fb33253ca7: fix a typo on the production objstorage vhost (authored by vsellier).
fix a typo on the production objstorage vhost
Jun 30 2022, 4:47 PM
anlambert accepted D8061: Re-add swh/graph/client.py for backward-compatibility.

Great, thanks !

Jun 30 2022, 4:47 PM
seirl requested review of D8061: Re-add swh/graph/client.py for backward-compatibility.
Jun 30 2022, 4:44 PM
ardumont accepted D8062: fix a typo on the production objstorage vhost.

fix typo for fun and profit! ;)

Jun 30 2022, 4:42 PM
olasd accepted D8062: fix a typo on the production objstorage vhost.

lol

Jun 30 2022, 4:41 PM
vsellier requested review of D8062: fix a typo on the production objstorage vhost.
Jun 30 2022, 4:40 PM
jayeshv renamed T4367: GraphQL: Pagination fixes to comply with the Relay spec from GraphQL: Fix issue in pagination to GraphQL: Fix issue in pagination 'after' argument.
Jun 30 2022, 4:33 PM · GraphQL API
jayeshv triaged T4367: GraphQL: Pagination fixes to comply with the Relay spec as Low priority.
Jun 30 2022, 4:32 PM · GraphQL API
seirl closed T2100: Bootstrap Debian packaging for swh.graph, a subtask of T3168: Proper deployment of swh-graph with debian package, as Wontfix.
Jun 30 2022, 4:21 PM · Compressed graph service, Puppet recipes
seirl closed T2100: Bootstrap Debian packaging for swh.graph as Wontfix.

We are migrating away from Debian packages as a deployment tool, closing this as WONTFIX.

Jun 30 2022, 4:21 PM · Compressed graph service
seirl closed T3168: Proper deployment of swh-graph with debian package as Wontfix.

We are migrating away from Debian packages as a deployment tool, closing this as WONTFIX.

Jun 30 2022, 4:20 PM · Compressed graph service, Puppet recipes
seirl closed T2081: swh-graph: "Cannot open client FIFO" when answering HEAD requests as Resolved.

No longer happens after the GRPC migration:

Jun 30 2022, 4:19 PM · Compressed graph service
jayeshv triaged T4366: GraphQL: add revision log to the revision type as Normal priority.
Jun 30 2022, 4:17 PM · GraphQL API
seirl closed T2103: (Debian) package py4j, a subtask of T2100: Bootstrap Debian packaging for swh.graph, as Wontfix.
Jun 30 2022, 4:16 PM · Compressed graph service
seirl closed T2103: (Debian) package py4j as Wontfix.

We removed the Py4J dependency by migrating to GRPC.

Jun 30 2022, 4:16 PM · Compressed graph service
seirl closed T3301: graph: add test for the "algo" parameter of walk() as Wontfix.

We no longer support multiple algorithms for shortest path requests.

Jun 30 2022, 4:16 PM · Easy hack, Compressed graph service
jayeshv triaged T4365: GraphQL: Use the right date time serialization as Normal priority.
Jun 30 2022, 4:15 PM · GraphQL API
seirl closed T3623: Run swh-graph with gunicorn to support multiple/parallel requests as Resolved.

Obsoleted by the migration to GRPC. Now we use GRPC's threading model, with a threadpool configurable by passing --threads to the Java service. By default, nproc is used.

Jun 30 2022, 4:12 PM · Compressed graph service, System administration
seirl closed D8059: Generate .labelobl files to speed-up labelled graph loading.
Jun 30 2022, 4:11 PM
seirl committed rDGRPH32f2626a7d0a: Generate .labelobl files to speed-up labelled graph loading (authored by seirl).
Generate .labelobl files to speed-up labelled graph loading
Jun 30 2022, 4:11 PM
seirl closed T4113: Review border case of empty response for `visit_nodes` as Resolved.

Obsoleted by the migration to GRPC. We no longer create iterators on the decoded stream of a UNIX pipeline, we directly use GRPC stream iterators.

Jun 30 2022, 4:11 PM · Compressed graph service
swh-public-ci added a comment to D8059: Generate .labelobl files to speed-up labelled graph loading.

Build is green

Jun 30 2022, 4:10 PM
vsellier closed D8057: Add a docker environment to test the job-builder inside jenkins.
Jun 30 2022, 4:09 PM
swh-public-ci added a comment to D8059: Generate .labelobl files to speed-up labelled graph loading.

Build has FAILED

Jun 30 2022, 4:09 PM
vsellier committed rCJSWH21e47db56cb5: Add a docker environment to test the job-builder inside jenkins (authored by vsellier).
Add a docker environment to test the job-builder inside jenkins
Jun 30 2022, 4:09 PM
Harbormaster failed remote builds in B30170: Diff 29081 for D8059: Generate .labelobl files to speed-up labelled graph loading!
Jun 30 2022, 4:09 PM