Page MenuHomeSoftware Heritage
Feed Advanced Search

Sep 30 2021

vlorentz lowered the priority of T3623: Run swh-graph with gunicorn to support multiple/parallel requests from Normal to Low.

Hmm actually this might be harder than just using gunicorn, because the java subprocess needs to be shared between workers, hmm...

Sep 30 2021, 3:04 PM · Compressed graph service, System administration
vlorentz triaged T3624: Update swh-graph from 0.3.0 to 0.5.0 on granet as High priority.
Sep 30 2021, 3:04 PM · Compressed graph service, System administration
vlorentz updated the task description for T3623: Run swh-graph with gunicorn to support multiple/parallel requests.
Sep 30 2021, 2:32 PM · Compressed graph service, System administration
vlorentz updated the task description for T3623: Run swh-graph with gunicorn to support multiple/parallel requests.
Sep 30 2021, 2:31 PM · Compressed graph service, System administration
vlorentz updated the task description for T3623: Run swh-graph with gunicorn to support multiple/parallel requests.
Sep 30 2021, 2:30 PM · Compressed graph service, System administration
vlorentz updated the task description for T3623: Run swh-graph with gunicorn to support multiple/parallel requests.
Sep 30 2021, 2:30 PM · Compressed graph service, System administration
vlorentz triaged T3623: Run swh-graph with gunicorn to support multiple/parallel requests as Normal priority.
Sep 30 2021, 2:27 PM · Compressed graph service, System administration

Sep 23 2021

vlorentz added a parent task for T2807: document swh.graph.graph module: T3604: Document the architecture of all major packages/components.
Sep 23 2021, 2:52 PM · Documentation, Compressed graph service

Sep 9 2021

ardumont closed T3564: Puppetize graph service and add icinga alert, a subtask of T3168: Proper deployment of swh-graph with debian package, as Resolved.
Sep 9 2021, 1:15 PM · Compressed graph service, Puppet recipes
ardumont closed T3564: Puppetize graph service and add icinga alert as Resolved.
Sep 9 2021, 1:15 PM · System administration, Compressed graph service, Puppet recipes
ardumont added a comment to T3564: Puppetize graph service and add icinga alert.

Puppet was stuck on this machine.
It's now unstuck as well.

Sep 9 2021, 12:32 PM · System administration, Compressed graph service, Puppet recipes
ardumont moved T3564: Puppetize graph service and add icinga alert from in-progress to deployed/landed/monitoring on the System administration board.
Sep 9 2021, 12:31 PM · System administration, Compressed graph service, Puppet recipes
ardumont changed the status of T3564: Puppetize graph service and add icinga alert, a subtask of T3168: Proper deployment of swh-graph with debian package, from Open to Work in Progress.
Sep 9 2021, 12:31 PM · Compressed graph service, Puppet recipes
ardumont changed the status of T3564: Puppetize graph service and add icinga alert from Open to Work in Progress.
Sep 9 2021, 12:31 PM · System administration, Compressed graph service, Puppet recipes
ardumont added a project to T3564: Puppetize graph service and add icinga alert: System administration.
Sep 9 2021, 12:31 PM · System administration, Compressed graph service, Puppet recipes

Sep 8 2021

olasd placed T2100: Bootstrap Debian packaging for swh.graph up for grabs.
Sep 8 2021, 11:36 AM · Compressed graph service
olasd placed T2103: (Debian) package py4j up for grabs.
Sep 8 2021, 11:36 AM · Compressed graph service
olasd placed T3168: Proper deployment of swh-graph with debian package up for grabs.
Sep 8 2021, 11:36 AM · Compressed graph service, Puppet recipes

Sep 7 2021

ardumont updated the task description for T3564: Puppetize graph service and add icinga alert.
Sep 7 2021, 1:49 PM · System administration, Compressed graph service, Puppet recipes
ardumont added a revision to T3564: Puppetize graph service and add icinga alert: D6200: Add icinga checks around the graph service.
Sep 7 2021, 1:48 PM · System administration, Compressed graph service, Puppet recipes
ardumont added a revision to T3564: Puppetize graph service and add icinga alert: D6199: Install graph services as-is.
Sep 7 2021, 1:48 PM · System administration, Compressed graph service, Puppet recipes
ardumont updated the task description for T3564: Puppetize graph service and add icinga alert.
Sep 7 2021, 12:34 PM · System administration, Compressed graph service, Puppet recipes
ardumont updated the task description for T3564: Puppetize graph service and add icinga alert.
Sep 7 2021, 12:33 PM · System administration, Compressed graph service, Puppet recipes
ardumont updated the task description for T3168: Proper deployment of swh-graph with debian package.
Sep 7 2021, 12:33 PM · Compressed graph service, Puppet recipes
ardumont triaged T3564: Puppetize graph service and add icinga alert as High priority.
Sep 7 2021, 12:32 PM · System administration, Compressed graph service, Puppet recipes
ardumont added a revision to T3168: Proper deployment of swh-graph with debian package: D6200: Add icinga checks around the graph service.
Sep 7 2021, 12:19 PM · Compressed graph service, Puppet recipes
ardumont added a revision to T3168: Proper deployment of swh-graph with debian package: D6199: Install graph services as-is.
Sep 7 2021, 12:18 PM · Compressed graph service, Puppet recipes

Aug 31 2021

ardumont removed a parent task for T3168: Proper deployment of swh-graph with debian package: T3518: Enable vault cookers to access swh-graph.
Aug 31 2021, 5:21 PM · Compressed graph service, Puppet recipes
ardumont updated the task description for T3168: Proper deployment of swh-graph with debian package.
Aug 31 2021, 11:26 AM · Compressed graph service, Puppet recipes
ardumont closed T2102: Clean up Debian packaging branch bootstrapping scripts, a subtask of T2100: Bootstrap Debian packaging for swh.graph, as Resolved.
Aug 31 2021, 10:48 AM · Compressed graph service

Aug 30 2021

vlorentz added a parent task for T3168: Proper deployment of swh-graph with debian package: T3518: Enable vault cookers to access swh-graph.
Aug 30 2021, 9:36 AM · Compressed graph service, Puppet recipes

Jul 29 2021

vlorentz moved T2431: Document how to export the graph edge dataset from sys-admin (docs/sysadm) to developers (docs/devel/) on the Documentation board.
Jul 29 2021, 3:54 PM · Documentation, Compressed graph service, Datasets
vlorentz moved T2431: Document how to export the graph edge dataset from archive-users (docs/user-guides/) to sys-admin (docs/sysadm) on the Documentation board.
Jul 29 2021, 3:54 PM · Documentation, Compressed graph service, Datasets
vlorentz added a comment to T2431: Document how to export the graph edge dataset.

It is now somewhat documented here: https://forge.softwareheritage.org/source/swh-environment/browse/master/docker/services/swh-graph/entrypoint.sh

Jul 29 2021, 3:54 PM · Documentation, Compressed graph service, Datasets
vlorentz closed T1936: integrate swh-graph into the docker environment as Resolved.
Jul 29 2021, 3:51 PM · Docker environment, Compressed graph service

Jul 28 2021

vlorentz added revisions to T1936: integrate swh-graph into the docker environment: D6042: graph: don't run export+compression unless needed, and add documentation, D6038: journalprocessor: Fix deserialize_message raising EOFError on the last message of each assignment.
Jul 28 2021, 4:44 PM · Docker environment, Compressed graph service
ardumont added a comment to T1936: integrate swh-graph into the docker environment.

Plus that documents it a bit in an automatic manner, so *thumbs up*.

Jul 28 2021, 10:37 AM · Docker environment, Compressed graph service

Jul 27 2021

vlorentz added a comment to T1936: integrate swh-graph into the docker environment.

I decided to make the swh-graph container create the compressed graph itself before starting. That's the easiest way to use it AND to implement it IMO.

Jul 27 2021, 5:39 PM · Docker environment, Compressed graph service
vlorentz claimed T1936: integrate swh-graph into the docker environment.
Jul 27 2021, 5:37 PM · Docker environment, Compressed graph service
vlorentz added a revision to T1936: integrate swh-graph into the docker environment: D6028: journalprocessor: Fix freeze on empty offset ranges..
Jul 27 2021, 5:37 PM · Docker environment, Compressed graph service
vlorentz added revisions to T1936: integrate swh-graph into the docker environment: D6029: Add swh-graph to the docker environment, D6030: server: Define make_app_from_configfile so it can be ran by gunicorn.
Jul 27 2021, 5:37 PM · Docker environment, Compressed graph service

Jul 2 2021

ardumont added a subtask for T3168: Proper deployment of swh-graph with debian package: T2100: Bootstrap Debian packaging for swh.graph.
Jul 2 2021, 3:42 PM · Compressed graph service, Puppet recipes
ardumont added a parent task for T2100: Bootstrap Debian packaging for swh.graph: T3168: Proper deployment of swh-graph with debian package.
Jul 2 2021, 3:42 PM · Compressed graph service

May 4 2021

vlorentz triaged T3305: Add tests for limit/max_edges in swh/graph/tests/test_api_client.py as Normal priority.
May 4 2021, 12:43 PM · Easy hack, Compressed graph service

May 3 2021

zack renamed T3301: graph: add test for the "algo" parameter of walk() from swh-graph: No tests of the "algo" parameter of walk() to graph: add test for the "algo" parameter of walk().
May 3 2021, 6:55 PM · Easy hack, Compressed graph service
vlorentz triaged T3302: Write docstrings for each method in swh/graph/backend.py as Normal priority.
May 3 2021, 5:34 PM · Compressed graph service
vlorentz triaged T3301: graph: add test for the "algo" parameter of walk() as Normal priority.
May 3 2021, 5:16 PM · Easy hack, Compressed graph service

Apr 23 2021

vlorentz assigned T2220: swh-graph in production to zack.
Apr 23 2021, 4:50 PM · Roadmap 2022, meta-task, Roadmap 2021, Compressed graph service

Apr 16 2021

olasd triaged T3259: Gracefully handle a client closing the connection in the middle of a response being streamed as Normal priority.
Apr 16 2021, 6:34 PM · Compressed graph service

Apr 14 2021

zack closed T1968: existing graph endpoints should not return 404 upon missing arguments as Invalid.

Sure! My apologies @Hakimb, but it's thank to your work that we have realized what was the right fate for this task.

Apr 14 2021, 5:10 PM · Easy hack, Compressed graph service
seirl added a comment to T2981: Graph API: add a (node type) result filters.

I just want to write something here that maybe isn't clear from the initial task description. This filtering must happen *after* the visit, not during. We can already change *how* the graph is visited using the edges parameter, the goal of this task is to filter the result post-visit.

Apr 14 2021, 4:28 PM · Compressed graph service
seirl added a comment to T1968: existing graph endpoints should not return 404 upon missing arguments.

Right, I suppose we can close the task then?

Apr 14 2021, 4:25 PM · Easy hack, Compressed graph service
zack updated subscribers of T1968: existing graph endpoints should not return 404 upon missing arguments.

@seirl, @vlorentz: I see your point, and I agree. We should never have used /nested/paths for this API.
Maybe we should just reconsider this and, one @Hakimb is ready with a new traversal language proposal, we can map it to a better REST API that uses query parameters, and deal properly with 4xx return codes.

Apr 14 2021, 4:15 PM · Easy hack, Compressed graph service
zack added a comment to T2981: Graph API: add a (node type) result filters.
In T2981#63164, @Hakimb wrote:

questions:

1/ So for the "filter that applies to visits that return nodes one by one" part, we are talking about: neighbors, walk, visit/nodes only?

Apr 14 2021, 4:13 PM · Compressed graph service
Hakimb added a comment to T2981: Graph API: add a (node type) result filters.

1/ So for the "filter that applies to visits that return nodes one by one" part, we are talking about: neighbors, walk, visit/nodes only?
2/ the filter is a query parameter I guess?

Apr 14 2021, 12:27 PM · Compressed graph service

Apr 13 2021

seirl updated subscribers of T1968: existing graph endpoints should not return 404 upon missing arguments.

@zack We talked about this on IRC with @vlorentz, I think this issue is invalid. We chose to have the source and destination nodes as part of the URI in the API. Semantically, it makes sense that accessing the path without these path fragments would return a 404: it's not a missing argument but an invalid path. If we had a ?src= and a &dst= arguments instead, then having a 400 error would make sense, but in our case the semantics are really weird.

Apr 13 2021, 7:05 PM · Easy hack, Compressed graph service
Hakimb added a revision to T3161: graph service: add anti-DoS limit on the number of edges traversed: D5501: add an anti-Dos limit for edges traversed as a query parameter.
Apr 13 2021, 5:36 PM · Compressed graph service

Apr 8 2021

zack added a comment to T3161: graph service: add anti-DoS limit on the number of edges traversed.

ok, so @Hakimb: go for no default value. If the query param is not passed, the visit will not stop before the end. If it's given, it will stop once the limit is reached. Call the query param ?max_edges. You will find that the java code already keeps track of the number of edges traversed, so you should just need to compare with that.

Apr 8 2021, 2:44 PM · Compressed graph service
vlorentz added a comment to T3161: graph service: add anti-DoS limit on the number of edges traversed.

should there be a default value for it or not? (We want this to be consistent with swh-storage

Apr 8 2021, 2:41 PM · Compressed graph service
zack added a comment to T3161: graph service: add anti-DoS limit on the number of edges traversed.

To complement what @vlorentz mentioned, we should actually stop the visit after the maximum number of edges has been reached, because it is keep doing the visit (no matter how many results are returned after it) that can DoS the swh-graph backend.

Apr 8 2021, 2:24 PM · Compressed graph service
vlorentz added a comment to T3161: graph service: add anti-DoS limit on the number of edges traversed.
  1. I don't think you need to reproduce a DoS, just make sure we don't return more results than we should
  2. yes, a query parameter would be good
Apr 8 2021, 2:09 PM · Compressed graph service
Hakimb added a comment to T3161: graph service: add anti-DoS limit on the number of edges traversed.

I have two questions to make sure I understand well :

Apr 8 2021, 1:17 PM · Compressed graph service

Apr 7 2021

seirl closed T3178: document how to export the graph dataset automatically, a subtask of T1847: fully automate export of the graph dataset, as Invalid.
Apr 7 2021, 3:03 PM · Compressed graph service, Datasets
seirl added a subtask for T1847: fully automate export of the graph dataset: T2431: Document how to export the graph edge dataset.
Apr 7 2021, 3:03 PM · Compressed graph service, Datasets
seirl added a parent task for T2431: Document how to export the graph edge dataset: T1847: fully automate export of the graph dataset.
Apr 7 2021, 3:03 PM · Documentation, Compressed graph service, Datasets

Apr 6 2021

Hakimb added a revision to T1968: existing graph endpoints should not return 404 upon missing arguments: D5411: return a 400 error when accessing endpoints without the arguments.
Apr 6 2021, 11:31 AM · Easy hack, Compressed graph service

Mar 26 2021

zack reopened T1847: fully automate export of the graph dataset as "Open".

reopening, as ideally we'd like to have run the entire ORC export once to completion before closing

Mar 26 2021, 12:25 PM · Compressed graph service, Datasets
seirl closed T1847: fully automate export of the graph dataset as Resolved.

The ORC exporter is done, and it's likely that we won't provide CSV exports in the future, or we'll generate them from the ORC format.

Mar 26 2021, 12:04 PM · Compressed graph service, Datasets

Mar 23 2021

seirl assigned T3168: Proper deployment of swh-graph with debian package to olasd.
Mar 23 2021, 12:24 PM · Compressed graph service, Puppet recipes
zack updated the task description for T3168: Proper deployment of swh-graph with debian package.
Mar 23 2021, 12:24 PM · Compressed graph service, Puppet recipes
zack added a project to T3168: Proper deployment of swh-graph with debian package: Compressed graph service.
Mar 23 2021, 12:23 PM · Compressed graph service, Puppet recipes

Mar 22 2021

zack renamed T3161: graph service: add anti-DoS limit on the number of edges traversed from graph service: add limit on the number of edges traversed to graph service: add anti-DoS limit on the number of edges traversed.
Mar 22 2021, 9:43 AM · Compressed graph service
zack added a subtask for T2220: swh-graph in production: T3161: graph service: add anti-DoS limit on the number of edges traversed.
Mar 22 2021, 9:43 AM · Roadmap 2022, meta-task, Roadmap 2021, Compressed graph service
zack added a parent task for T3161: graph service: add anti-DoS limit on the number of edges traversed: T2220: swh-graph in production.
Mar 22 2021, 9:43 AM · Compressed graph service
vlorentz added a comment to T3161: graph service: add anti-DoS limit on the number of edges traversed.

To be consistent with swh-storage, swh-graph should take a limit as query parameter, but not have a hardcoded upper bound for that limit. Instead, swh-web provides that upper bound.

Mar 22 2021, 9:14 AM · Compressed graph service
zack triaged T3161: graph service: add anti-DoS limit on the number of edges traversed as Normal priority.
Mar 22 2021, 9:12 AM · Compressed graph service
zack closed T2113: swh-graph: add support to optionally resolve ori PIDs to origin URLs as Wontfix.

Now that this is (optionally) done by swh-web, I don't think we want to implement it in swh-graph too.

Mar 22 2021, 8:56 AM · Compressed graph service

Mar 19 2021

rdicosmo moved T2220: swh-graph in production from Backlog to Work in progress on the Roadmap 2021 board.
Mar 19 2021, 12:45 PM · Roadmap 2022, meta-task, Roadmap 2021, Compressed graph service

Mar 16 2021

Kaustuv942 added a comment to T1968: existing graph endpoints should not return 404 upon missing arguments.

ohh allright.

Mar 16 2021, 8:44 PM · Easy hack, Compressed graph service
zack placed T1968: existing graph endpoints should not return 404 upon missing arguments up for grabs.

Dear @Kaustuv942, sure, patches welcome. We do not use task claiming for non regular contributors though, just submit a patch when you have one.

Mar 16 2021, 6:59 PM · Easy hack, Compressed graph service
Kaustuv942 added a comment to T1968: existing graph endpoints should not return 404 upon missing arguments.

Hello @zack I want to complete this task.

Mar 16 2021, 5:59 PM · Easy hack, Compressed graph service
Kaustuv942 added a comment to T1968: existing graph endpoints should not return 404 upon missing arguments.
Mar 16 2021, 5:33 PM · Easy hack, Compressed graph service
Kaustuv942 claimed T1968: existing graph endpoints should not return 404 upon missing arguments.
Mar 16 2021, 5:30 PM · Easy hack, Compressed graph service

Mar 14 2021

zack added a parent task for T3125: add revision timestamp to the compression timeline: T3126: API: add endpoint to find the earliest revision referencing a dir/cnt node.
Mar 14 2021, 12:04 PM · Compressed graph service
zack added a subtask for T3126: API: add endpoint to find the earliest revision referencing a dir/cnt node: T3125: add revision timestamp to the compression timeline.
Mar 14 2021, 12:04 PM · Compressed graph service
zack triaged T3126: API: add endpoint to find the earliest revision referencing a dir/cnt node as Normal priority.
Mar 14 2021, 12:03 PM · Compressed graph service
zack triaged T3125: add revision timestamp to the compression timeline as Normal priority.
Mar 14 2021, 12:02 PM · Compressed graph service

Mar 8 2021

rdicosmo added a parent task for T2220: swh-graph in production: T2204: Full-text search on source code (prototype).
Mar 8 2021, 10:47 AM · Roadmap 2022, meta-task, Roadmap 2021, Compressed graph service
rdicosmo updated the task description for T2220: swh-graph in production.
Mar 8 2021, 10:12 AM · Roadmap 2022, meta-task, Roadmap 2021, Compressed graph service
rdicosmo added a parent task for T2220: swh-graph in production: T3096: Efficient and reliable download via the Vault.
Mar 8 2021, 10:08 AM · Roadmap 2022, meta-task, Roadmap 2021, Compressed graph service
rdicosmo removed a project from T2220: swh-graph in production: Roadmap 2020.
Mar 8 2021, 9:57 AM · Roadmap 2022, meta-task, Roadmap 2021, Compressed graph service
rdicosmo added a project to T2220: swh-graph in production: Roadmap 2021.
Mar 8 2021, 9:52 AM · Roadmap 2022, meta-task, Roadmap 2021, Compressed graph service

Mar 3 2021

zack added a project to T1968: existing graph endpoints should not return 404 upon missing arguments: Easy hack.
Mar 3 2021, 9:51 AM · Easy hack, Compressed graph service
zack updated the task description for T2981: Graph API: add a (node type) result filters.
Mar 3 2021, 9:50 AM · Compressed graph service

Mar 1 2021

vlorentz moved T2114: swh-graph API: add ?limit=N method variants to return first N results from In progress to Done on the Easy hack board.
Mar 1 2021, 1:39 PM · Easy hack, Compressed graph service

Jan 21 2021

zack triaged T2983: graph service: allow loading in memory only one direction of the graph as Low priority.
Jan 21 2021, 10:09 AM · Compressed graph service

Jan 20 2021

zack triaged T2981: Graph API: add a (node type) result filters as Normal priority.
Jan 20 2021, 3:25 PM · Compressed graph service

Jan 9 2021

zack added a revision to T2647: add LLP support to graph compression pipeline: D4821: Add LLP compression to the WebGraph pipeline.
Jan 9 2021, 4:06 PM · Compressed graph service

Jan 8 2021

anlambert added a comment to T2900: Public graph/ API does not handle streaming results from endpoints.

The fix is now deployed and proxied graph responses are now properly streamed \o/

Jan 8 2021, 3:54 PM · System administration, Compressed graph service, Web app
seirl closed T2595: Add a default configuration based on graph size (eg: batch_size) as Resolved by committing rDGRPH5a987aae6e93: config: sane default for batch_size using a heuristic on ram size.
Jan 8 2021, 3:27 PM · Compressed graph service