Page MenuHomeSoftware Heritage
Feed Advanced Search

Nov 25 2020

zack updated the summary of D4594: fuse: lookup: do not log ENOENT.
Nov 25 2020, 5:07 PM
zack accepted D4594: fuse: lookup: do not log ENOENT.
Nov 25 2020, 5:06 PM
zack committed rDTSCN65f0b8e4c6ea: honor HTTP(S)_PROXY environment variables, to support HTTP proxies (authored by zack).
honor HTTP(S)_PROXY environment variables, to support HTTP proxies
Nov 25 2020, 4:42 PM
zack closed T2680: proxy support for swh scanner as Resolved by committing rDTSCN65f0b8e4c6ea: honor HTTP(S)_PROXY environment variables, to support HTTP proxies.
Nov 25 2020, 4:42 PM · Easy hack, Code scanner
zack committed rDTSCN3a7c40040879: CLI: add help message for "swh scanner db" command group (authored by zack).
CLI: add help message for "swh scanner db" command group
Nov 25 2020, 4:24 PM
zack updated the task description for T2811: FUSE: fix various paper cuts (user testing 2020-11-24).
Nov 25 2020, 2:55 PM · Software Heritage filesystem
zack updated the task description for T2811: FUSE: fix various paper cuts (user testing 2020-11-24).
Nov 25 2020, 2:53 PM · Software Heritage filesystem
zack renamed T2811: FUSE: fix various paper cuts (user testing 2020-11-24) from FISE: fix various paper cuts (user testing 2020-11-24) to FUSE: fix various paper cuts (user testing 2020-11-24).
Nov 25 2020, 2:51 PM · Software Heritage filesystem
zack updated the task description for T2811: FUSE: fix various paper cuts (user testing 2020-11-24).
Nov 25 2020, 2:51 PM · Software Heritage filesystem
zack triaged T2811: FUSE: fix various paper cuts (user testing 2020-11-24) as Low priority.
Nov 25 2020, 2:50 PM · Software Heritage filesystem
zack updated the task description for T2793: add notable past events to the archive changelog.
Nov 25 2020, 2:00 PM · Archive coverage, Documentation
zack updated the task description for T2793: add notable past events to the archive changelog.
Nov 25 2020, 1:59 PM · Archive coverage, Documentation
zack updated the task description for T2793: add notable past events to the archive changelog.
Nov 25 2020, 1:58 PM · Archive coverage, Documentation
zack updated the task description for T2793: add notable past events to the archive changelog.
Nov 25 2020, 1:56 PM · Archive coverage, Documentation
zack changed the status of T2793: add notable past events to the archive changelog, a subtask of T2460: public journal of notable archiving policy changes, from Open to Work in Progress.
Nov 25 2020, 1:55 PM · General
zack changed the status of T2793: add notable past events to the archive changelog from Open to Work in Progress.
Nov 25 2020, 1:55 PM · Archive coverage, Documentation
zack committed rDDOC2af5c3ae9a30: archive journal: add first archival date, gitorious, google code (authored by zack).
archive journal: add first archival date, gitorious, google code
Nov 25 2020, 1:52 PM
zack closed T617: ingest Google Code Subversion repositories as Resolved.
Nov 25 2020, 1:49 PM · Archive coverage, Origin-GoogleCode, SVN Loader
zack closed T617: ingest Google Code Subversion repositories, a subtask of T367: ingest Google Code repositories, as Resolved.
Nov 25 2020, 1:49 PM · Archive coverage, Restricted Project

Nov 24 2020

zack updated the title for P879 swh-fuse user testing session 2020-11-24 from Command-Line Input to swh-fuse user testing session 2020-11-24.
Nov 24 2020, 3:40 PM
zack created P879 swh-fuse user testing session 2020-11-24.
Nov 24 2020, 3:39 PM
zack accepted D4552: 'db serve' option to start the API service.

I'm accepting this, but I've also noted down a bunch of minor issues in the code. Please fix them before landing this.

Nov 24 2020, 1:38 PM

Nov 23 2020

zack committed rDDOC522a9f5cf6ed: journal: as of now, crawling as been restarted (authored by zack).
journal: as of now, crawling as been restarted
Nov 23 2020, 7:43 PM
zack triaged T2807: document swh.graph.graph module as Low priority.
Nov 23 2020, 1:23 PM · Documentation, Compressed graph service
zack committed rDTSCN502595f6f081: CLI: make "db import" CLI more strict and improve user messages (authored by zack).
CLI: make "db import" CLI more strict and improve user messages
Nov 23 2020, 10:47 AM
zack requested changes to D4552: 'db serve' option to start the API service.

Thanks, looks generally good to me !
I've pointed out only a few minor things here and there.

Nov 23 2020, 10:29 AM

Nov 21 2020

zack triaged T2803: FUSE history/by-{date,hash} views need an index to improve performances as High priority.
Nov 21 2020, 5:47 PM · Software Heritage filesystem
zack triaged T2802: FUSE: avoid logging normal conditions like ENOENT as Low priority.
Nov 21 2020, 1:13 PM · Software Heritage filesystem

Nov 20 2020

zack committed rDFUSE0fe1497da427: tutorial: improve type setting of shell logs (cosmetic) (authored by zack).
tutorial: improve type setting of shell logs (cosmetic)
Nov 20 2020, 5:35 PM
zack committed rDFUSEf2fe7c57d55a: doc: add tutorial for end users (authored by zack).
doc: add tutorial for end users
Nov 20 2020, 5:35 PM
zack closed T2676: FUSE: write tutorial doc as Resolved by committing rDFUSEf2fe7c57d55a: doc: add tutorial for end users.
Nov 20 2020, 5:35 PM · Documentation, Software Heritage filesystem
zack closed D4296: doc: add tutorial for end users.
Nov 20 2020, 5:35 PM
zack updated the diff for D4296: doc: add tutorial for end users.
  • tutorial: improve type setting of shell logs (cosmetic)
Nov 20 2020, 5:29 PM
zack added a reviewer for D4296: doc: add tutorial for end users: seirl.
Nov 20 2020, 5:13 PM
zack retitled D4296: doc: add tutorial for end users from [WIP] doc: add tutorial for end users to doc: add tutorial for end users.
Nov 20 2020, 5:12 PM
zack updated the diff for D4296: doc: add tutorial for end users.

complete first full draft of the tutorial integrating content from the paper

Nov 20 2020, 5:11 PM
zack lowered the priority of T2710: swh-fuse: fails with "'TypeError: Cannot merge a <class 'dict'> with a <class 'NoneType'>" when conffile is empty or commented out from Normal to Low.
Nov 20 2020, 2:50 PM · Software Heritage filesystem
zack lowered the priority of T2775: Add top-level README to explain briefly archive/ and meta/ behavior from Normal to Low.
Nov 20 2020, 2:50 PM · Software Heritage filesystem
zack raised the priority of T2784: FUSE: add support for origin visits from Low to Normal.
Nov 20 2020, 2:49 PM · Software Heritage filesystem
zack moved T2785: FUSE design doc: dangling links to data model, SWHIDs, etc. from Backlog to Done on the Software Heritage filesystem board.
Nov 20 2020, 2:29 PM · Documentation, Software Heritage filesystem
zack committed rDFUSE07a41c5fad65: design doc: fix Sphinx/MyST markup to avoid broken cross-package links (authored by zack).
design doc: fix Sphinx/MyST markup to avoid broken cross-package links
Nov 20 2020, 2:29 PM
zack closed T2785: FUSE design doc: dangling links to data model, SWHIDs, etc. as Resolved by committing rDFUSE07a41c5fad65: design doc: fix Sphinx/MyST markup to avoid broken cross-package links.
Nov 20 2020, 2:29 PM · Documentation, Software Heritage filesystem
zack committed rDFUSE10623b096b50: doc: uniform naming of SwhFS as "Software Heritage Filesystem" (authored by zack).
doc: uniform naming of SwhFS as "Software Heritage Filesystem"
Nov 20 2020, 2:29 PM

Nov 19 2020

zack accepted D4508: scanner: 'db import' option to create local database with known swhids.

LGTM, I've only reported a minor typing issue.

Nov 19 2020, 9:12 PM
zack updated the summary of D4508: scanner: 'db import' option to create local database with known swhids.
Nov 19 2020, 9:08 PM
zack created P869 Command-Line Input.
Nov 19 2020, 11:30 AM

Nov 18 2020

zack requested changes to D4508: scanner: 'db import' option to create local database with known swhids.

Thanks, this looks great, and almost there.

Nov 18 2020, 8:26 PM
zack lowered the priority of T2795: FUSE: fix build failure when pytest try to run gen-api-data.py from Normal to Low.
Nov 18 2020, 6:12 PM · Software Heritage filesystem
zack triaged T2793: add notable past events to the archive changelog as Normal priority.
Nov 18 2020, 10:20 AM · Archive coverage, Documentation
zack committed rDDOC0eaddf704f7e: add journal of notable archival changes (authored by zack).
add journal of notable archival changes
Nov 18 2020, 10:03 AM
zack closed T2460: public journal of notable archiving policy changes as Resolved by committing rDDOC0eaddf704f7e: add journal of notable archival changes.
Nov 18 2020, 10:03 AM · General
zack closed D4498: add journal of notable archival changes.
Nov 18 2020, 10:03 AM
zack added a revision to T2460: public journal of notable archiving policy changes: D4498: add journal of notable archival changes.
Nov 18 2020, 9:58 AM · General
zack created D4498: add journal of notable archival changes.
Nov 18 2020, 9:57 AM
zack changed the status of T2460: public journal of notable archiving policy changes from Open to Work in Progress.
Nov 18 2020, 9:46 AM · General

Nov 17 2020

zack updated subscribers of T2771: FUSE: rethink the visibility of files under archive/ and meta/, and possibly add a new cache/ entrypoint.
Nov 17 2020, 9:17 PM · Software Heritage filesystem
zack added a comment to T2771: FUSE: rethink the visibility of files under archive/ and meta/, and possibly add a new cache/ entrypoint.

It occurred to me that, if we accept that archive/ and meta/ will return nothing when ls'd, we're accepting a fundamental inconsistency for them: file entries in there exist but are not user-visible.
If we are OK with that, we can also go a bit further, and find what I think is a win-win middle ground between this task and T2694.

Nov 17 2020, 9:16 PM · Software Heritage filesystem
zack committed rDFUSE2b1c921089b8: arch diagram: show user control over user-space daemon (authored by zack).
arch diagram: show user control over user-space daemon
Nov 17 2020, 3:40 PM
zack committed rDFUSE9be20693d083: arch diagram: separate user/tool (authored by zack).
arch diagram: separate user/tool
Nov 17 2020, 3:29 PM
zack accepted D4492: logging: replace f-strings in logging calls.
Nov 17 2020, 2:46 PM
zack renamed T2788: deduplicate validation logic between parse_swhid() and SWHID class constructor from Improve swh.model.identifiers.parse_swhid and SWHID class to deduplicate validation logic between parse_swhid() and SWHID class constructor.
Nov 17 2020, 2:27 PM · Data Model
zack reopened T2269: cron spam: <root@*> find /var/log/kafka -type f -not -name *.gz -a -ctime +1 -exec gzip {} \+ as "Open".

I'm not ~100% sure is the same issue, but the symptoms are very similar. I'm still getting daily this logspam:

Nov 17 2020, 12:48 PM · System administration

Nov 16 2020

zack committed rDDOCc14674a148ce: conf.py: bump copyright year (authored by zack).
conf.py: bump copyright year
Nov 16 2020, 10:12 PM
zack updated the task description for T2784: FUSE: add support for origin visits.
Nov 16 2020, 9:56 PM · Software Heritage filesystem
zack triaged T2785: FUSE design doc: dangling links to data model, SWHIDs, etc. as Normal priority.
Nov 16 2020, 9:54 PM · Documentation, Software Heritage filesystem
zack lowered the priority of T2784: FUSE: add support for origin visits from Normal to Low.
Nov 16 2020, 9:50 PM · Software Heritage filesystem
zack triaged T2784: FUSE: add support for origin visits as Normal priority.
Nov 16 2020, 9:50 PM · Software Heritage filesystem
zack added a comment to T2782: add a "Filter Pull Requests" checkbox (or similar) in the Branches view of an origin in the web UI .

You're absolutely right that this is a different concern than T2459, because even when we will fix that we will not rewrite history.
On the other hand the heuristic that the loader will us is probably related to what you want here: you'll want to filter out by default the same refs that the (new) loader will filter out, and have an option to unfilter them (mostly useful for old snapshots).
If we have that, will you still want the fine-grained filtering that you mentioned above? I'm inclined to think it would be overkill. YMMV.

Nov 16 2020, 7:27 PM · Web app
zack changed the status of T2694: FUSE: add sharding support for top-level dirs from Resolved to Wontfix.
Nov 16 2020, 3:15 PM · Software Heritage filesystem
zack added inline comments to D4476: fs: history: add by-page/ sharded directory.
Nov 16 2020, 11:39 AM

Nov 13 2020

zack created P864 Command-Line Input.
Nov 13 2020, 5:19 PM

Nov 12 2020

zack created P863 (An Untitled Masterwork).
Nov 12 2020, 1:16 PM
zack accepted D4458: model.identifiers: Improve error messages in case of invalid SWHIDs.

(only a couple of minor things noted above, but you can fix it before pushing)

Nov 12 2020, 11:09 AM
zack requested changes to D4458: model.identifiers: Improve error messages in case of invalid SWHIDs.
Nov 12 2020, 10:44 AM

Nov 11 2020

zack triaged T2773: FUSE: add history/by-date/ dir for revision objects as Normal priority.
Nov 11 2020, 3:23 PM · Software Heritage filesystem
zack triaged T2772: FUSE: add history/by-page/ dir for revision objects as Normal priority.
Nov 11 2020, 3:18 PM · Software Heritage filesystem
zack triaged T2771: FUSE: rethink the visibility of files under archive/ and meta/, and possibly add a new cache/ entrypoint as Low priority.
Nov 11 2020, 3:11 PM · Software Heritage filesystem

Nov 10 2020

zack triaged T2768: unbreak swh-graph CI as High priority.
Nov 10 2020, 2:23 PM · Continuous Integration, Compressed graph service
zack closed T2694: FUSE: add sharding support for top-level dirs as Wontfix.

We have decided to go for a more radical approach and make archive/ and meta/ not ls-able.
Separate task and update to the design document to follow.

Nov 10 2020, 12:48 PM · Software Heritage filesystem

Nov 9 2020

zack committed rDWCLIe117dc50e85f: client: bind /known API endpoint to verify for object presence (authored by zack).
client: bind /known API endpoint to verify for object presence
Nov 9 2020, 1:16 PM
zack closed D4444: client: bind /known API endpoint to verify for object presence.
Nov 9 2020, 1:16 PM
zack updated the diff for D4444: client: bind /known API endpoint to verify for object presence.

rebase

Nov 9 2020, 1:15 PM

Nov 8 2020

zack created D4444: client: bind /known API endpoint to verify for object presence.
Nov 8 2020, 9:36 PM
zack triaged T2763: Web API: /known return false for existing release and snapshot SWHIDs as High priority.
Nov 8 2020, 8:16 PM · Web app
zack committed rDGRPH8b7c01d861c9: git2graph: fixo typo and wording of EXPRs in --help message (authored by zack).
git2graph: fixo typo and wording of EXPRs in --help message
Nov 8 2020, 6:10 PM
zack updated the task description for T735: SourceForge lister.
Nov 8 2020, 2:08 PM · Origin-SourceForge

Nov 7 2020

zack committed rDGRPH1d335f23e5bd: git2graph doc: document glib dependency (authored by zack).
git2graph doc: document glib dependency
Nov 7 2020, 4:19 PM

Nov 6 2020

zack resigned from D4416: fs: history: add by-hash/ sharded directory.
Nov 6 2020, 5:33 PM
zack updated the task description for T2760: swh-scanner: add support for local DB of known SWHIDs.
Nov 6 2020, 2:50 PM · Code scanner
zack triaged T2760: swh-scanner: add support for local DB of known SWHIDs as Normal priority.
Nov 6 2020, 2:32 PM · Code scanner

Nov 5 2020

zack requested changes to D4416: fs: history: add by-hash/ sharded directory.

LGTM, I've only noted down a couple of minor issues

Nov 5 2020, 1:41 PM
zack created P854 Command-Line Input.
Nov 5 2020, 10:28 AM

Nov 4 2020

zack accepted D4412: templates/api: Add section about authentication in documentation.
Nov 4 2020, 7:31 PM
zack requested changes to D4412: templates/api: Add section about authentication in documentation.

great stuff, thanks !

Nov 4 2020, 6:32 PM
zack accepted D4345: fuse: add cache on directories entries.
Nov 4 2020, 5:01 PM
zack updated the title for P850 rebuild a kernel Debian package with larger FUSE getdirplus() buffer from untitled to rebuild a kernel Debian package with larger FUSE getdirplus() buffer.
Nov 4 2020, 9:15 AM
zack added a comment to P850 rebuild a kernel Debian package with larger FUSE getdirplus() buffer.

patch fuse-increase-readdirplus-bufsiz.patch is in F4141491

Nov 4 2020, 9:15 AM
zack edited P850 rebuild a kernel Debian package with larger FUSE getdirplus() buffer.
Nov 4 2020, 9:15 AM
zack changed the visibility for F4141491: fuse-increase-readdirplus-bufsiz.patch.
Nov 4 2020, 9:09 AM

Nov 3 2020

zack created P850 rebuild a kernel Debian package with larger FUSE getdirplus() buffer.
Nov 3 2020, 8:08 PM