Page MenuHomeSoftware Heritage

zack (Stefano Zacchiroli)
UserAdministrator

User Details

User Since
Sep 7 2015, 3:43 PM (350 w, 4 d)
Roles
Administrator

Recent Activity

Wed, May 18

zack added a comment to T3560: Polish the swh-search QL.

Hey @vlorentz @zack, I've been using sourcegraph.com for almost a year now and I feel that they have worked a lot on polishing their search query language. I think we can learn from them and adapt our language. Here are a few suggestions:

Wed, May 18, 1:32 PM · Archive search
zack resigned from D7839: Documentation overhaul.

Monumental documentation work, thanks!
I think this is generally great, and I've pointed out only some minor issues/suggestions here and there.

Wed, May 18, 1:21 PM · Compressed graph service

Sun, May 15

zack committed rMSLD8e903979a72a: latex: disable use of \rowcolors, broken with texlive >= 2022 (authored by zack).
latex: disable use of \rowcolors, broken with texlive >= 2022
Sun, May 15, 3:56 PM

Fri, May 13

zack resigned from D7814: Remove dead code from the Python interface.
Fri, May 13, 4:37 PM
zack added a comment to D7814: Remove dead code from the Python interface.
In D7814#203592, @seirl wrote:
In D7814#203336, @zack wrote:

I'm fine with this code cleanup, with one caveat: that we document/ship the systemd startup service (and its meaning, including some intuitions about the trade-offs you mention) somewhere, in replacement of the cachemount command.

It's already in puppet (swh-site/site-modules/profile/templates/swh/deploy/graph/swhgraphshm.service.erb).

Fri, May 13, 7:38 AM

Wed, May 11

zack requested changes to D7814: Remove dead code from the Python interface.

I'm fine with this code cleanup, with one caveat: that we document/ship the systemd startup service (and its meaning, including some intuitions about the trade-offs you mention) somewhere, in replacement of the cachemount command.

Wed, May 11, 10:40 PM

Fri, May 6

zack created P1359 (An Untitled Masterwork).
Fri, May 6, 4:50 PM

Fri, Apr 29

zack renamed T3652: Cannot ingest git repositories with (too) large packfiles from Ingest git loader origins with smaller packfiles to Cannot ingest git repositories with (too) large packfiles.
Fri, Apr 29, 4:00 PM · Git loader

Thu, Apr 28

zack created P1354 (An Untitled Masterwork).
Thu, Apr 28, 2:09 PM

Apr 26 2022

zack accepted D7686: docs: remove PostgreSQL local setup.
Apr 26 2022, 4:36 PM

Apr 22 2022

zack renamed T2833: cpan.loader - archive Perl modules from CPAN from cpan.loader - preserver Perl modules from CPAN to cpan.loader - archive Perl modules from CPAN.
Apr 22 2022, 11:26 AM · Archive coverage

Apr 5 2022

zack changed the status of T1743: create a nice landing web page for exported dataset, a subtask of T3085: Complete and updated copy of the archive on S3 (objects+graph), from Open to Work in Progress.
Apr 5 2022, 1:39 PM · Roadmap 2022, meta-task, Roadmap 2021, System administration, Object storage
zack changed the status of T1743: create a nice landing web page for exported dataset from Open to Work in Progress.
Apr 5 2022, 1:39 PM · Datasets
zack changed the status of T3329: document ORC format dataset availability from Open to Work in Progress.
Apr 5 2022, 1:38 PM · Datasets

Apr 1 2022

zack accepted D7487: Docs: update dataset list with recent datasets.

Just a nitpick: either add thousand separators to the node/edge counts, or summarized them with M/B suffixes.
Rationale: those numbers are so huge that are hard to read without that.

Apr 1 2022, 5:05 PM
zack committed rDDOC2d89a6533a58: roadmap 2022 intro: fix year and improve wording (authored by zack).
roadmap 2022 intro: fix year and improve wording
Apr 1 2022, 3:36 PM

Mar 30 2022

zack removed a watcher for Developers: zack.
Mar 30 2022, 3:39 PM
zack added a watcher for Team HR and management: zack.
Mar 30 2022, 1:43 PM
zack added a member for Datasets: seirl.
Mar 30 2022, 1:42 PM
zack added a watcher for Datasets: zack.
Mar 30 2022, 1:41 PM
zack added a watcher for Code scanner: zack.
Mar 30 2022, 1:41 PM
zack added a member for Software Heritage filesystem: zack.
Mar 30 2022, 1:41 PM
zack added a watcher for Compressed graph service: zack.
Mar 30 2022, 1:41 PM
zack added a watcher for Software Heritage filesystem: zack.
Mar 30 2022, 1:41 PM
zack renamed Compressed graph service from Graph service to Compressed graph service.
Mar 30 2022, 1:40 PM
zack added a member for Datasets: zack.
Mar 30 2022, 1:39 PM
zack added a member for Code scanner: zack.
Mar 30 2022, 1:39 PM

Mar 10 2022

zack triaged T4029: create vpn and unix account for Andrey to access granet as High priority.
Mar 10 2022, 11:13 AM · System administration

Mar 3 2022

zack added a member for Staff: bchauvet.
Mar 3 2022, 8:40 AM
zack removed a member for Staff: compay2k.
Mar 3 2022, 8:40 AM
zack added a member for Reviewers: bchauvet.
Mar 3 2022, 8:39 AM
zack removed a member for Reviewers: compay2k.
Mar 3 2022, 8:39 AM
zack removed a member for Developers: compay2k.
Mar 3 2022, 8:39 AM
zack added a member for Developers: bchauvet.
Mar 3 2022, 8:39 AM

Mar 2 2022

zack added a member for Team HR and management: bchauvet.
Mar 2 2022, 6:50 PM
zack accepted D7274: onboarding: Mention the creds needed for HTTP Basic auth for the intranet wiki.
Mar 2 2022, 10:45 AM

Mar 1 2022

zack added a member for Team HR and management: compay2k.
Mar 1 2022, 11:04 AM

Feb 25 2022

zack added a member for Staff: compay2k.
Feb 25 2022, 11:16 AM

Feb 22 2022

zack added a subtask for T3952: Make the search query language a first class citizen : T3560: Polish the swh-search QL.
Feb 22 2022, 6:46 PM · meta-task, Roadmap 2022, Archive search
zack added a parent task for T3560: Polish the swh-search QL: T3952: Make the search query language a first class citizen .
Feb 22 2022, 6:46 PM · Archive search

Feb 17 2022

zack added a comment to D7192: Route for fetching Git-encoded objects.

Sorry, that is a bit rambly and not very helpful. @anlambert @zack What do you think?

Feb 17 2022, 1:06 PM

Feb 10 2022

zack triaged T3923: Include submodules recursively when saving git repositories as Normal priority.
Feb 10 2022, 7:44 AM · Git loader, Save Code Now

Jan 27 2022

zack added a comment to T3887: Storing multiple authors in Revisions and Releases.

Then let's just go for it (insert here ref. to upcoming separate task :-)).

Jan 27 2022, 6:00 PM · SWORD deposit, Data Model, BZR loader
zack added a comment to T3887: Storing multiple authors in Revisions and Releases.
In T3887#77949, @olasd wrote:

Now that I've written it out loud, of course, Releases don't have extra_headers so the package loaders can't make use of this workaround/hack for now.

Jan 27 2022, 5:40 PM · SWORD deposit, Data Model, BZR loader
zack committed rMSLD4bf5d5d41819: check in recent presentations (authored by zack).
check in recent presentations
Jan 27 2022, 10:13 AM

Jan 25 2022

zack triaged T3885: Filter rows of size >32MB from dataset export as Normal priority.
Jan 25 2022, 1:32 PM · Datasets

Jan 10 2022

zack committed R183:6b876e2ac76b: add several entries about reproducibility, FOSS geography, and diversity (authored by zack).
add several entries about reproducibility, FOSS geography, and diversity
Jan 10 2022, 7:58 PM

Jan 4 2022

zack closed T3260: publish swh.dataset to pypi as Resolved.
Jan 4 2022, 1:42 PM · Continuous Integration, Datasets
zack changed the status of T3768: Read compression input from ORC instead of the edges file from Open to Work in Progress.
Jan 4 2022, 1:35 PM · Compressed graph service

Jan 3 2022

zack added a comment to T3822: Update the fundraising banner.

@marla.dasilva @anlambert: let's go for "Until Jan 30th" then. (I'll also ping you about this in the chat, just in case.)

Jan 3 2022, 3:43 PM · Unknown Object (Project)
zack triaged T3822: Update the fundraising banner as High priority.

Thanks Marla, I also planned to raise this.

Jan 3 2022, 11:13 AM · Unknown Object (Project)

Dec 16 2021

zack triaged T3811: archive.s.o: change Debian tooltip to include derivatives as Low priority.
Dec 16 2021, 10:40 AM · Web app
zack renamed T2400: Ingest current and historical Ubuntu releases from Ingest current and history Ubuntu releases to Ingest current and historical Ubuntu releases.
Dec 16 2021, 10:36 AM · System administration, Debian loader, Package Loader, Archive coverage

Dec 14 2021

zack raised the priority of T3161: graph service: add anti-DoS limit on the number of edges traversed from Normal to High.
Dec 14 2021, 1:31 PM · Compressed graph service

Dec 6 2021

zack accepted D4821: Add LLP compression to the WebGraph pipeline.

Just to be sure: test_pipeline() from test_cli.py is now run with all new passes as well, and as such it also testes the LLP step(s), correct?
It seems that way to me because test_pipeline() seems to be running all passes, but I'd like this to be double-checked before landing.

Dec 6 2021, 6:08 PM

Dec 4 2021

zack committed rMSLDfc9bffe30c07: check-in slides for tech presentation at #swh5years sponsors meeting (authored by zack).
check-in slides for tech presentation at #swh5years sponsors meeting
Dec 4 2021, 10:22 AM

Dec 1 2021

zack moved T2595: Add a default configuration based on graph size (eg: batch_size) from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 5:00 PM · Compressed graph service
zack changed the status of T2113: swh-graph: add support to optionally resolve ori PIDs to origin URLs from Wontfix to Resolved.
Dec 1 2021, 5:00 PM · Compressed graph service
zack moved T2112: make "swh graph map lookup" accept lists of identifiers from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 5:00 PM · Compressed graph service
zack moved T2096: CNAME for graph service: graph.internal.softwareheritage.org (?) from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 5:00 PM · Compressed graph service, System administration
zack moved T2083: provide systemd service file for swh-graph from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 5:00 PM · Compressed graph service
zack changed the status of T2056: fix swh-graph sphinx table of content from Invalid to Resolved.
Dec 1 2021, 5:00 PM · Documentation, Compressed graph service
zack moved T1933: bad invocation of o.s.graph.backend.Setup in docker doc from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 5:00 PM · Compressed graph service
zack moved T1898: swh-graph: refactor algo implementations to not forcibly memoize results from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 5:00 PM · Compressed graph service
zack moved T1888: graph API documentation: clarify the relationship between directory=backward and edges= from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 5:00 PM · Documentation, Compressed graph service
zack moved T1884: python bindings for compressed graph access from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 5:00 PM · Compressed graph service
zack moved T1937: nicer landing page for the swh-graph REST API from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 5:00 PM · Compressed graph service
zack changed the status of T1968: existing graph endpoints should not return 404 upon missing arguments from Invalid to Resolved.
Dec 1 2021, 5:00 PM · Easy hack, Compressed graph service
zack moved T1936: integrate swh-graph into the docker environment from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:38 PM · Docker environment, Compressed graph service
zack changed the status of T1930: swh-graph: ship swh-graph.jar in the docker container from Wontfix to Resolved.
Dec 1 2021, 4:38 PM · Compressed graph service
zack moved T1887: publish swh-graph documentation at docs.s.o from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:38 PM · Documentation, Compressed graph service
zack moved T1851: Integrate graph-compression git repo in swh-environment from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:38 PM · Compressed graph service
zack moved T1877: Add contextual info to compression pipeline from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:37 PM · Compressed graph service
zack moved T1878: Write documentation on compression process from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:37 PM · Compressed graph service
zack moved T1879: Write documentation on compression Docker env from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:37 PM · Compressed graph service
zack moved T1889: graph API: add endpoint to return the leaves of a subgraph from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:37 PM · Compressed graph service
zack moved T1886: graph API: add endpoint to return the adjacency list of a node from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:37 PM · Compressed graph service
zack moved T1901: refactor graph API to use integer IDs in the kernel and translate to/from SWH PIDs on outer layers from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:37 PM · Compressed graph service
zack moved T1904: build developer documentation for swh-graph from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:37 PM · Documentation, Compressed graph service
zack moved T1920: graph service: add tests for the python client from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:37 PM · Compressed graph service
zack moved T1938: swh-graph: NullPointerException upon (wrong) /walk from cnt to snp from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:37 PM · Compressed graph service
zack moved T1945: Return timings instead of simply logging them from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:37 PM · Compressed graph service
zack moved T1952: Log raw datapoint in graph benchmarks from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:37 PM · Compressed graph service
zack moved T2053: support graph export for the cassandra backend from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:37 PM · Compressed graph service, Storage manager
zack moved T2072: common configuration file for swh graph rpc-serve, compress, … from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:37 PM · Compressed graph service
zack moved T2077: add random walk endpoint with limited retries from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:37 PM · Compressed graph service
zack moved T2084: swh-graph: add /last endpoint variants to the REST API from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:37 PM · Compressed graph service
zack moved T2114: swh-graph API: add ?limit=N method variants to return first N results from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:37 PM · Easy hack, Compressed graph service
zack moved T2589: expose swh-graph API at archive.s.o/api/1/graph/ from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:36 PM · System administration, Web app, Compressed graph service
zack moved T2900: Public graph/ API does not handle streaming results from endpoints from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:36 PM · System administration, Compressed graph service, Web app
zack moved T1862: Implement new graph API specifications from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:36 PM · Compressed graph service
zack moved T1902: Use in-memory bitmap to store node->types relations in graph API from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:36 PM · Compressed graph service
zack moved T1903: Add graph service README files from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:36 PM · Compressed graph service
zack moved T1915: Add support for origin nodes in graph service API from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:36 PM · Compressed graph service
zack moved T1867: compress Merkle DAG and origin nodes together from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:36 PM · Compressed graph service
zack moved T1921: swh-graph: add logging of endpoint timing from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:36 PM · Compressed graph service
zack moved T1922: swh-graph optimization: bypass edge restriction checks when edges=* from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:36 PM · Compressed graph service
zack moved T1939: Measure memory needs for a swh-graph Azure VM from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:36 PM · Compressed graph service
zack moved T1941: Automatically generate mapping files after compressing graph from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:36 PM · Compressed graph service
zack moved T1944: use a compact, binary format for node ids mapping files from Backlog to Deployed on the Compressed graph service board.
Dec 1 2021, 4:36 PM · Compressed graph service