Page MenuHomeSoftware Heritage
Feed Advanced Search

Nov 18 2019

zack closed T2096: CNAME for graph service: graph.internal.softwareheritage.org (?) as Resolved by committing rSPSITEc54e88a14607: add CNAME graph -> granet.
Nov 18 2019, 3:05 PM · Compressed graph service, System administration
zack updated the diff for D2297: add CNAME graph -> granet.

Updating D2297: add CNAME graph -> granet

Nov 18 2019, 3:05 PM
zack added a reviewer for D2297: add CNAME graph -> granet: olasd.
Nov 18 2019, 2:54 PM
zack added a revision to T2096: CNAME for graph service: graph.internal.softwareheritage.org (?): D2297: add CNAME graph -> granet.
Nov 18 2019, 2:54 PM · Compressed graph service, System administration
zack created D2297: add CNAME graph -> granet.
Nov 18 2019, 2:54 PM
zack raised the priority of T1848: refresh graph dataset export from Low to Normal.
Nov 18 2019, 2:50 PM · Datasets
zack lowered the priority of T1847: fully automate export of the graph dataset from High to Normal.
Nov 18 2019, 2:50 PM · Compressed graph service, Datasets
zack added a project to T1847: fully automate export of the graph dataset: Compressed graph service.
Nov 18 2019, 2:48 PM · Compressed graph service, Datasets
zack raised the priority of T1868: refresh compressed representation of the archive from Low to Normal.
Nov 18 2019, 2:48 PM · Compressed graph service
zack committed rDGRPH5e46dc2452fb: REST API doc: cross-ref method variants with main methods (authored by zack).
REST API doc: cross-ref method variants with main methods
Nov 18 2019, 2:41 PM
zack committed rDGRPHe2f31c2b3e8f: test_api_client.py: do not test /walk parameter validation (authored by zack).
test_api_client.py: do not test /walk parameter validation
Nov 18 2019, 2:03 PM
zack committed rDGRPH4fd7d3250395: client.py: renamse check_status -> raise_for_status (authored by zack).
client.py: renamse check_status -> raise_for_status
Nov 18 2019, 2:03 PM
zack committed rDGRPHb1c8234fad06: test_api_client: check HTTP status codes for bad requests (authored by zack).
test_api_client: check HTTP status codes for bad requests
Nov 18 2019, 2:03 PM
zack committed rDGRPH3d5083ee11f4: add tests for parameter validation (authored by zack).
add tests for parameter validation
Nov 18 2019, 2:03 PM
zack closed D2265: add tests for parameter validation.
Nov 18 2019, 2:03 PM
zack updated the diff for D2265: add tests for parameter validation.
  • add tests for parameter validation
  • client.py: renamse check_status -> raise_for_status
  • test_api_client: check HTTP status codes for bad requests
  • test_api_client.py: do not test /walk parameter validation
Nov 18 2019, 2:01 PM
zack committed rDGRPH51d6b602c3e8: add /last sub-endpoint to only return destination in walks (authored by zack).
add /last sub-endpoint to only return destination in walks
Nov 18 2019, 1:56 PM
zack committed rDGRPH97e84765c276: REST API doc: document /last for all applicable methods (authored by zack).
REST API doc: document /last for all applicable methods
Nov 18 2019, 1:56 PM
zack closed T2084: swh-graph: add /last endpoint variants to the REST API as Resolved by committing rDGRPH51d6b602c3e8: add /last sub-endpoint to only return destination in walks.
Nov 18 2019, 1:56 PM · Compressed graph service
zack closed D2289: add /last sub-endpoint to only return destination in walks.
Nov 18 2019, 1:56 PM
zack updated the diff for D2289: add /last sub-endpoint to only return destination in walks.
  • add /last sub-endpoint to only return destination in walks
  • REST API doc: document /last for all applicable methods
Nov 18 2019, 1:53 PM
zack committed rDGRPH51cd3e1c263c: REST API tests: skip /walk (currently disabled) (authored by zack).
REST API tests: skip /walk (currently disabled)
Nov 18 2019, 1:52 PM
zack placed T1969: graph: reduce RAM usage for /walk up for grabs.
Nov 18 2019, 1:51 PM · Compressed graph service
zack reopened T1969: graph: reduce RAM usage for /walk as "Open".
Nov 18 2019, 1:51 PM · Compressed graph service
zack added inline comments to D2289: add /last sub-endpoint to only return destination in walks.
Nov 18 2019, 1:49 PM
zack updated the diff for D2289: add /last sub-endpoint to only return destination in walks.
  • add /last sub-endpoint to only return destination in walks
  • REST API doc: document /last for all applicable methods
Nov 18 2019, 1:49 PM
zack updated the diff for D2265: add tests for parameter validation.
  • add tests for parameter validation
  • client.py: renamse check_status -> raise_for_status
  • test_api_client: check HTTP status codes for bad requests
Nov 18 2019, 1:42 PM
zack closed T1969: graph: reduce RAM usage for /walk as Resolved by committing rDGRPH3c3f004352f8: REST API: disable /walk.
Nov 18 2019, 1:38 PM · Compressed graph service
zack committed rDGRPH3c3f004352f8: REST API: disable /walk (authored by zack).
REST API: disable /walk
Nov 18 2019, 1:38 PM
zack closed D2296: REST API: disable /walk.
Nov 18 2019, 1:38 PM
zack closed D2266: RPCClient: check HTTP status code for errors also when streaming.

merged in rDCOREd0e2f59a56bb

Nov 18 2019, 1:34 PM
zack committed rDCOREd0e2f59a56bb: RPCClient: add response attribute to RemoteException (authored by zack).
RPCClient: add response attribute to RemoteException
Nov 18 2019, 1:31 PM
zack committed rDCORE4b9e92acb8fa: RPCClent: rename and refactor check_status (now raise_for_status) (authored by zack).
RPCClent: rename and refactor check_status (now raise_for_status)
Nov 18 2019, 1:31 PM
zack committed rDCORE49a8a670d8f4: RPCClient: check HTTP status code for errors also when streaming (authored by zack).
RPCClient: check HTTP status code for errors also when streaming
Nov 18 2019, 1:31 PM
zack added a revision to T1969: graph: reduce RAM usage for /walk: D2296: REST API: disable /walk.
Nov 18 2019, 1:26 PM · Compressed graph service
zack created D2296: REST API: disable /walk.
Nov 18 2019, 1:26 PM

Nov 16 2019

zack updated the diff for D2289: add /last sub-endpoint to only return destination in walks.
  • add /last sub-endpoint to only return destination in walks
  • REST API doc: document /last for all applicable methods
Nov 16 2019, 8:02 PM
zack updated the diff for D2289: add /last sub-endpoint to only return destination in walks.
  • REST API doc: document /last for all applicable methods
Nov 16 2019, 7:31 PM
zack committed rDGRPHacb78fb9051a: REST API doc: clarify edge restriciton semantics for the backward graph (authored by zack).
REST API doc: clarify edge restriciton semantics for the backward graph
Nov 16 2019, 5:02 PM
zack closed T1888: graph API documentation: clarify the relationship between directory=backward and edges= as Resolved by committing rDGRPHacb78fb9051a: REST API doc: clarify edge restriciton semantics for the backward graph.
Nov 16 2019, 5:02 PM · Documentation, Compressed graph service
zack closed D2290: REST API doc: clarify edge restriciton semantics for the backward graph.
Nov 16 2019, 5:02 PM
zack changed the status of T1888: graph API documentation: clarify the relationship between directory=backward and edges= from Open to Work in Progress.
Nov 16 2019, 3:41 PM · Documentation, Compressed graph service
zack added a revision to T1888: graph API documentation: clarify the relationship between directory=backward and edges=: D2290: REST API doc: clarify edge restriciton semantics for the backward graph.
Nov 16 2019, 3:41 PM · Documentation, Compressed graph service
zack created D2290: REST API doc: clarify edge restriciton semantics for the backward graph.
Nov 16 2019, 3:41 PM
zack closed T2072: common configuration file for swh graph rpc-serve, compress, … as Resolved.

this has been merged into master with commit 9cece1f6722ce836cec9353b928bb4bb4b7b77e6

Nov 16 2019, 3:23 PM · Compressed graph service
zack closed T2077: add random walk endpoint with limited retries as Resolved.

this has been merged into master in commit 40daed1eaa06da82afd14652200b903c807c34ae

Nov 16 2019, 3:22 PM · Compressed graph service
zack triaged T2096: CNAME for graph service: graph.internal.softwareheritage.org (?) as Low priority.
Nov 16 2019, 3:21 PM · Compressed graph service, System administration
zack committed rDGRPHa76d1e045db2: REST API doc: fix minor markup error (authored by zack).
REST API doc: fix minor markup error
Nov 16 2019, 3:16 PM
zack changed the status of T2084: swh-graph: add /last endpoint variants to the REST API from Open to Work in Progress.

upon review, /last only makes sense for /walk and /randomwalk endpoints, for other endpoints it doesn't, as it will return arbitrary results (e.g., what's the point of knowing the last neighbor of one node, given they're in arbitrary order?)

Nov 16 2019, 3:15 PM · Compressed graph service
zack added a revision to T2084: swh-graph: add /last endpoint variants to the REST API: D2289: add /last sub-endpoint to only return destination in walks.
Nov 16 2019, 3:14 PM · Compressed graph service
zack created D2289: add /last sub-endpoint to only return destination in walks.
Nov 16 2019, 3:14 PM
zack triaged T2095: Cron <root@uffizi> /usr/local/bin/swh-dar-copy-remote-backup "munin0" "munin0" "/srv/backups" "/srv/remote-backups" as Normal priority.
Nov 16 2019, 1:22 PM · System administration
zack created T2095: Cron <root@uffizi> /usr/local/bin/swh-dar-copy-remote-backup "munin0" "munin0" "/srv/backups" "/srv/remote-backups".
Nov 16 2019, 1:22 PM · System administration

Nov 15 2019

zack added a comment to D2265: add tests for parameter validation.

That would be ok with me.

Nov 15 2019, 11:59 PM
zack updated the diff for D2265: add tests for parameter validation.
  • client.py: renamse check_status -> raise_for_status
  • test_api_client: check HTTP status codes for bad requests
Nov 15 2019, 11:58 PM
zack updated the diff for D2266: RPCClient: check HTTP status code for errors also when streaming.
  • RPCClent: rename and refactor check_status (now raise_for_status)
  • RPCClient: add response attribute to RemoteException
Nov 15 2019, 11:50 PM
zack triaged T2094: KeyError: 'content:add' in swh.loader.core.loader as High priority.
Nov 15 2019, 11:23 PM · Git loader
zack accepted D2288: graph backend: use global config.
Nov 15 2019, 7:58 PM

Nov 14 2019

zack committed rMSLDf9d3c9c9596d: check-in slides for UZH talk (authored by zack).
check-in slides for UZH talk
Nov 14 2019, 11:11 AM

Nov 13 2019

zack closed D2249: swh-graph: add random walk endpoint.

merged in 97e79c289aff6625ebdd9068a006c10aca28fc12

Nov 13 2019, 6:04 PM
zack updated the diff for D2249: swh-graph: add random walk endpoint.
  • add random walk endpoint
  • random walk: include starting node into returned path
  • random walk: use reservoir sampling to pick random successor
Nov 13 2019, 6:01 PM
zack triaged T2084: swh-graph: add /last endpoint variants to the REST API as Normal priority.
Nov 13 2019, 5:12 PM · Compressed graph service
zack committed rDGRPHf02a5a973dbe: docker: add helpers to build/run container and update doc accordingly (authored by zack).
docker: add helpers to build/run container and update doc accordingly
Nov 13 2019, 5:09 PM
zack committed rDGRPH093cebe77b90: Dockerfile: deps adjustment: add zstd, remove time (authored by zack).
Dockerfile: deps adjustment: add zstd, remove time
Nov 13 2019, 4:34 PM
zack committed rDGRPHa35a0c9a2a01: find_graph_jar(): also look under /usr/local/share (authored by zack).
find_graph_jar(): also look under /usr/local/share
Nov 13 2019, 4:09 PM
zack committed rDGRPHb866420e1118: Dockerfile: also install python3-wheel, needed for pip bdist_wheel (authored by zack).
Dockerfile: also install python3-wheel, needed for pip bdist_wheel
Nov 13 2019, 4:09 PM
zack renamed T1968: existing graph endpoints should not return 404 upon missing arguments from existing graph endpoints should not return 404 upon for missing arguments to existing graph endpoints should not return 404 upon missing arguments.
Nov 13 2019, 3:32 PM · Easy hack, Compressed graph service
zack triaged T2083: provide systemd service file for swh-graph as Low priority.
Nov 13 2019, 3:32 PM · Compressed graph service
zack added a comment to D2265: add tests for parameter validation.

Could you make the tests check for the error message? (with raises(RemoteException, match='regexp'):)

Nov 13 2019, 12:59 PM
zack committed rDGRPH0a39c45f3dd4: fat JAR: trim size down from 100 to 40 MB, by excluding mg4j (authored by zack).
fat JAR: trim size down from 100 to 40 MB, by excluding mg4j
Nov 13 2019, 12:01 PM
zack updated the summary of D2265: add tests for parameter validation.
Nov 13 2019, 9:38 AM
zack created D2266: RPCClient: check HTTP status code for errors also when streaming.
Nov 13 2019, 9:36 AM
zack created D2265: add tests for parameter validation.
Nov 13 2019, 9:35 AM
zack committed rDCOREada38193b0c4: test_rpc_client_server.py: fix typo in docstring (authored by zack).
test_rpc_client_server.py: fix typo in docstring
Nov 13 2019, 9:08 AM

Nov 12 2019

zack committed rDGRPHc77e8e6007b2: webgraph.py: improve logging by adding explicit start/end timings (authored by zack).
webgraph.py: improve logging by adding explicit start/end timings
Nov 12 2019, 10:28 PM
zack updated the diff for D2249: swh-graph: add random walk endpoint.
  • random walk: include starting node into returned path
  • random walk: use reservoir sampling to pick random successor
Nov 12 2019, 5:38 PM
zack assigned T2055: swh-graph CI hangs badly when py4j doesn't find needed files to seirl.

Another simple way to reproduce is just removing the *.jar file and running pytest on test_api_client.py.
This is not even a Java exception, but chances are fixing that case will fix at least a significant part of the general problem, if not all.

Nov 12 2019, 4:16 PM · Continuous Integration, Compressed graph service
zack triaged T2081: swh-graph: "Cannot open client FIFO" when answering HEAD requests as Low priority.
Nov 12 2019, 2:58 PM · Compressed graph service
zack updated the diff for D2249: swh-graph: add random walk endpoint.

swh-graph: add random walk endpoint

Nov 12 2019, 1:31 PM
zack committed rDGRPHfad1e71325df: app.py: fix wrong invocation of node_of_pid in /walk (authored by zack).
app.py: fix wrong invocation of node_of_pid in /walk
Nov 12 2019, 1:09 PM
zack closed D2247: REST server: support HEAD requests.

Closed in f9bdeef394329857c3500f0842e72a031aa996ff

Nov 12 2019, 12:29 PM
zack updated the diff for D2247: REST server: support HEAD requests.

Updating D2247: REST server: support HEAD requests

Nov 12 2019, 12:26 PM
zack closed D2246: REST server: add validation for PID parameters.

Closed in cc712446e8ef1e6aa7bf23488108778d5f847897

Nov 12 2019, 12:25 PM
zack updated the diff for D2246: REST server: add validation for PID parameters.
  • REST server: validate all query parameters and refactor validators
Nov 12 2019, 12:20 PM
zack updated the diff for D2247: REST server: support HEAD requests.

REST server: support HEAD requests

Nov 12 2019, 11:38 AM
zack updated the diff for D2249: swh-graph: add random walk endpoint.
  • randomwalk: do not bother randomizing singleton sets
Nov 12 2019, 10:58 AM
zack added a comment to T1970: Web API: make /origin/ return the swh:1:ori:... PID.

Related T2045

Nov 12 2019, 10:50 AM · Web app
zack updated the diff for D2246: REST server: add validation for PID parameters.

REST server: add validation for PID parameters

Nov 12 2019, 8:49 AM
zack updated the diff for D2249: swh-graph: add random walk endpoint.
  • skeleton for random walk endpoint
  • REST client: add binding for /randomwalk
  • test_api_client: add test case for /randomwalk
  • REST doc: update /randomwalk doc to match what is returned
  • access edge stats: also count skipped edges for /randomwalk
  • /randomwalk: use proper runtime errors instead of assertions
  • app.py: add TODO about making RANDOM_RETRIES configurable
  • app.py: refactor random v. non-random walk iteration logic
Nov 12 2019, 8:48 AM
zack added inline comments to D2249: swh-graph: add random walk endpoint.
Nov 12 2019, 8:47 AM
zack committed rDGRPHed9e0c00ae9b: REST API doc: document /count method variants (authored by zack).
REST API doc: document /count method variants
Nov 12 2019, 8:35 AM
zack committed rDGRPH779fc85fcb51: REST API doc: update to match current aiohttp implementation (authored by zack).
REST API doc: update to match current aiohttp implementation
Nov 12 2019, 8:35 AM
zack closed D2245: REST API doc: update to match current aiohttp implementation.
Nov 12 2019, 8:35 AM
zack updated the diff for D2247: REST server: support HEAD requests.

REST server: support HEAD requests

Nov 12 2019, 8:34 AM
zack updated the diff for D2245: REST API doc: update to match current aiohttp implementation.
  • REST API doc: update to match current aiohttp implementation
  • REST API doc: document /count method variants
Nov 12 2019, 8:32 AM
zack committed rDGRPH72d62a1a4602: app.py: inline MIME types, they are single use anyway (authored by zack).
app.py: inline MIME types, they are single use anyway
Nov 12 2019, 8:31 AM
zack committed rDGRPHe602cdd347d1: REST server: set content-type to text or ndjson where appropriate (authored by zack).
REST server: set content-type to text or ndjson where appropriate
Nov 12 2019, 8:31 AM
zack closed D2244: REST server: set content-type to text or ndjson where appropriate.
Nov 12 2019, 8:31 AM
zack updated the diff for D2244: REST server: set content-type to text or ndjson where appropriate.
  • REST server: set content-type to text or ndjson where appropriate
  • app.py: inline MIME types, they are single use anyway
Nov 12 2019, 8:28 AM
zack added inline comments to D2244: REST server: set content-type to text or ndjson where appropriate.
Nov 12 2019, 8:28 AM