Page MenuHomeSoftware Heritage
Feed Advanced Search

Jul 1 2019

zack created T1870: revamp archive coverage page to list instances of mentioned listers.
Jul 1 2019, 6:11 PM · Web app
zack accepted D1614: Specification of extrinsic origin metadata and their storage..

LGTM. (Still waiting on @moranegg for the deposit URL example.)

Jul 1 2019, 6:03 PM
zack requested changes to D1614: Specification of extrinsic origin metadata and their storage..
Jul 1 2019, 5:46 PM
zack committed rMSLDfc23c55d3a16: last talk: fix date format (authored by zack).
last talk: fix date format
Jul 1 2019, 4:53 PM
zack committed rMSLDc7c93b6261ce: talk team digitale: gain some vspace on conclusion slide (authored by zack).
talk team digitale: gain some vspace on conclusion slide
Jul 1 2019, 10:12 AM

Jun 30 2019

zack committed rMSLD01521f7869f4: team digitale talk: add sample graph dataset queries to appendix (authored by zack).
team digitale talk: add sample graph dataset queries to appendix
Jun 30 2019, 9:12 PM
zack committed rMSLDfea8af7b6e20: move graphdataset inclusion out of webui module (authored by zack).
move graphdataset inclusion out of webui module
Jun 30 2019, 9:12 PM
zack committed rMSLDcebf3c1fa79c: graph dataset module: add sample queries (authored by zack).
graph dataset module: add sample queries
Jun 30 2019, 9:12 PM
zack committed rMSLDc04058f9f556: team digitale talk: add intro (authored by zack).
team digitale talk: add intro
Jun 30 2019, 3:02 PM
zack committed rMSLD5bd293798c21: archive coverage slide: add NPM logo (authored by zack).
archive coverage slide: add NPM logo
Jun 30 2019, 3:02 PM
zack committed rMSLD5735cbd26f84: check in slides for talk @ Team Digitale (authored by zack).
check in slides for talk @ Team Digitale
Jun 30 2019, 2:49 PM
zack committed rMSLD3dd62a1c08a4: web UI module: integrate sample slides (Apollo 11, Quake 3) (authored by zack).
web UI module: integrate sample slides (Apollo 11, Quake 3)
Jun 30 2019, 2:28 PM
zack added a subtask for T1868: refresh compressed representation of the archive: T1848: refresh graph dataset export.
Jun 30 2019, 1:58 PM · Compressed graph service
zack added a parent task for T1848: refresh graph dataset export: T1868: refresh compressed representation of the archive.
Jun 30 2019, 1:58 PM · Datasets
zack triaged T1868: refresh compressed representation of the archive as Low priority.
Jun 30 2019, 1:57 PM · Compressed graph service
zack added a subtask for T1867: compress Merkle DAG and origin nodes together: T1731: Intrinsic identifiers for origins.
Jun 30 2019, 1:56 PM · Compressed graph service
zack added a parent task for T1731: Intrinsic identifiers for origins: T1867: compress Merkle DAG and origin nodes together.
Jun 30 2019, 1:56 PM · Storage manager, Data Model
zack triaged T1867: compress Merkle DAG and origin nodes together as Normal priority.
Jun 30 2019, 1:56 PM · Compressed graph service
zack created T1867: compress Merkle DAG and origin nodes together.
Jun 30 2019, 1:56 PM · Compressed graph service
zack committed rMSLD627e4831a998: dataset: new module describing available datasets (authored by zack).
dataset: new module describing available datasets
Jun 30 2019, 11:34 AM

Jun 29 2019

zack committed rMSLD37194ad8648c: biblio module: update MSR 2019 entry (authored by zack).
biblio module: update MSR 2019 entry
Jun 29 2019, 7:11 PM
zack committed rMSLDd75bba7320bc: status extended module: make stats more round (authored by zack).
status extended module: make stats more round
Jun 29 2019, 6:21 PM
zack committed rMSLDbefd8ef80f5b: status extended: add anchors for software/hardware stack slides (authored by zack).
status extended: add anchors for software/hardware stack slides
Jun 29 2019, 6:21 PM

Jun 28 2019

zack added a comment to T1855: automatically check CONTRIBUTORS file for completeness.

in 288055f74683ae517770dc2f5a17a8b4bdaeff03 i've added a (Python!) script to check for CONTRIBUTORS completeness

Jun 28 2019, 10:16 AM · Continuous Integration
zack committed rDSNIP288055f74683: check-contributors: new script to check CONTRIBUTORS completeness (authored by zack).
check-contributors: new script to check CONTRIBUTORS completeness
Jun 28 2019, 10:16 AM
zack triaged T1855: automatically check CONTRIBUTORS file for completeness as Low priority.
Jun 28 2019, 9:54 AM · Continuous Integration
zack committed rDWAPPSe4488e76e7e8: CONTRIBUTORS: add Ishan Bhanuka (authored by zack).
CONTRIBUTORS: add Ishan Bhanuka
Jun 28 2019, 9:43 AM
zack committed rDSTOe73e88696d03: CONTRIBUTORS: add Ishan Bhanuka (authored by zack).
CONTRIBUTORS: add Ishan Bhanuka
Jun 28 2019, 9:43 AM
zack committed rDSCH09d9c84b3f91: CONTRIBUTORS: add Ishan Bhanuka (authored by zack).
CONTRIBUTORS: add Ishan Bhanuka
Jun 28 2019, 9:43 AM
zack committed rDMODd8f17f26fe83: CONTRIBUTORS: add Ishan Bhanuka (authored by zack).
CONTRIBUTORS: add Ishan Bhanuka
Jun 28 2019, 9:43 AM
zack committed rDLDSVN21165ac78d48: CONTRIBUTORS: add Ishan Bhanuka (authored by zack).
CONTRIBUTORS: add Ishan Bhanuka
Jun 28 2019, 9:43 AM
zack committed rDDEP48b3035003b0: CONTRIBUTORS: add Ishan Bhanuka (authored by zack).
CONTRIBUTORS: add Ishan Bhanuka
Jun 28 2019, 9:43 AM

Jun 27 2019

zack added a member for Speakers: ardumont.
Jun 27 2019, 11:09 AM
zack added a member for Speakers: douardda.
Jun 27 2019, 11:09 AM

Jun 26 2019

zack triaged T1852: URL glitch: double trailing slash needed to avoid 404 on deb package URLs as Low priority.
Jun 26 2019, 5:55 PM · Web app
zack committed rDGRPHa15c8f23e099: CONTRIBUTORS: initialize and add haltode (authored by zack).
CONTRIBUTORS: initialize and add haltode
Jun 26 2019, 2:28 PM
D1625: api: docs: new graph API (path/ and explore/ endpoints) is now accepted and ready to land.
Jun 26 2019, 1:12 PM
zack committed rMSLD7934bf6617f0: drop roberto email from common, it should be in slide decks (authored by zack).
drop roberto email from common, it should be in slide decks
Jun 26 2019, 12:33 PM
zack retitled D1534: Add blake2s256 hash to the output of directory_ls from Add blake2s256 hash in the output of directory_ls. to Add blake2s256 hash to the output of directory_ls.
Jun 26 2019, 12:28 PM
zack accepted D1625: api: docs: new graph API (path/ and explore/ endpoints).
Jun 26 2019, 12:03 PM

Jun 25 2019

zack committed rMSLD4dbfcff82871: check in R logo (authored by zack).
check in R logo
Jun 25 2019, 10:33 PM
zack requested changes to D1625: api: docs: new graph API (path/ and explore/ endpoints).

Looks great ! I've noted down a bunch of requested changes, but they really only about style. Nothing that will get in the way of actually implementing any of this.

Jun 25 2019, 7:53 PM

Jun 23 2019

zack added a subtask for T1848: refresh graph dataset export: T1741: graph dataset: update to use persistent identifiers everywhere.
Jun 23 2019, 10:23 PM · Datasets
zack added a parent task for T1741: graph dataset: update to use persistent identifiers everywhere: T1848: refresh graph dataset export.
Jun 23 2019, 10:23 PM · Datasets
zack triaged T1848: refresh graph dataset export as Low priority.
Jun 23 2019, 10:22 PM · Datasets
zack added a parent task for T1847: fully automate export of the graph dataset: T1848: refresh graph dataset export.
Jun 23 2019, 10:22 PM · Compressed graph service, Datasets
zack added a subtask for T1848: refresh graph dataset export: T1847: fully automate export of the graph dataset.
Jun 23 2019, 10:22 PM · Datasets
zack created T1848: refresh graph dataset export.
Jun 23 2019, 10:21 PM · Datasets
zack triaged T1847: fully automate export of the graph dataset as High priority.
Jun 23 2019, 10:20 PM · Compressed graph service, Datasets
zack created T1847: fully automate export of the graph dataset.
Jun 23 2019, 10:20 PM · Compressed graph service, Datasets
zack triaged T1846: exclude swh-py-template from CI build (or make it build) as Low priority.
Jun 23 2019, 6:33 PM · Continuous Integration
zack added inline comments to D1625: api: docs: new graph API (path/ and explore/ endpoints).
Jun 23 2019, 4:57 PM
zack requested changes to D1625: api: docs: new graph API (path/ and explore/ endpoints).
Jun 23 2019, 4:39 PM
zack added a reviewer for D1625: api: docs: new graph API (path/ and explore/ endpoints): seirl.

Thanks for this updated version, we're definitely refining various use cases here.

Jun 23 2019, 4:38 PM
zack resigned from D1623: Add origin_metadata_get API endpoint.
Jun 23 2019, 3:32 PM
zack requested changes to D1614: Specification of extrinsic origin metadata and their storage..

looks great !

Jun 23 2019, 3:31 PM

Jun 22 2019

zack updated the task description for T1805: Public API v2.
Jun 22 2019, 11:21 PM · meta-task, Web app
zack committed rDTPLa01aaf26742b: add (empty) CONTRIBUTORS file to note down (future) contributor names (authored by zack).
add (empty) CONTRIBUTORS file to note down (future) contributor names
Jun 22 2019, 11:20 PM

Jun 21 2019

zack updated the task description for T1805: Public API v2.
Jun 21 2019, 5:41 PM · meta-task, Web app
zack triaged T1844: make archive.s.o point to the Azure-hosted webapp as Low priority.
Jun 21 2019, 3:58 PM · System administration
zack renamed T1843: make sure front-end services work when the Inria infra is down from make sure front-end services work when the Inria inra is down to make sure front-end services work when the Inria infra is down.
Jun 21 2019, 2:25 PM · System administration
zack triaged T1843: make sure front-end services work when the Inria infra is down as Normal priority.
Jun 21 2019, 2:25 PM · System administration
zack added a reviewer for D1625: api: docs: new graph API (path/ and explore/ endpoints): zack.
Jun 21 2019, 11:58 AM

Jun 20 2019

zack added a comment to T1839: Write glossary/taxonomy for push archival process and mechanism.

I second the need of properly defining terms related to push archival.

Jun 20 2019, 5:27 PM · Community Building, Documentation

Jun 19 2019

zack resigned from D1610: swh.lister.cgit.
Jun 19 2019, 11:44 PM
zack accepted D1590: Add comments to few columns in dbversion, task and task_run.
Jun 19 2019, 5:56 PM
zack accepted D1582: Add comments to tables dbversion, content, skipped_content and fetch_history.
Jun 19 2019, 5:55 PM
zack requested changes to D1590: Add comments to few columns in dbversion, task and task_run.

minor caseness issue

Jun 19 2019, 3:53 PM
zack requested changes to D1582: Add comments to tables dbversion, content, skipped_content and fetch_history.

Almost there !
(and thanks a lot for your persistence on this one)

Jun 19 2019, 3:52 PM
zack accepted D1611: pathslicing: Make sure data is flushed to disk before renaming the tempfile.
Jun 19 2019, 3:44 PM
zack added a comment to D1611: pathslicing: Make sure data is flushed to disk before renaming the tempfile.

LGTM.

Jun 19 2019, 3:44 PM
zack requested changes to D1611: pathslicing: Make sure data is flushed to disk before renaming the tempfile.
Jun 19 2019, 2:44 PM
zack requested changes to D1610: swh.lister.cgit.
Jun 19 2019, 2:40 PM
zack added a comment to T1738: Define and specify extrinsic origin metadata.

Thanks a lot for this recap Morane !

Jun 19 2019, 2:33 PM · Metadata workflow
zack added a comment to T1832: create a mailing list swh-user-announce.

There are two separate use cases here, so I'll comment on them separately:

Jun 19 2019, 12:23 PM · SWORD deposit

Jun 18 2019

zack triaged T1823: make DB/FS transactions nest properly as High priority.
Jun 18 2019, 12:38 PM · Object storage, Storage manager

Jun 17 2019

zack added a comment to T1659: rewrite the CGit lister as a proper lister.

Thanks for your interest in working on this @nahimilega , it would be very useful to move forward on a bunch of pending ingestions, including Tor !

Jun 17 2019, 10:01 PM · CGit lister
zack closed T239: preserve at least 2 copies of each content object as Resolved.

resolved (by T691)

Jun 17 2019, 4:45 PM · General
zack added a comment to T691: complete object storage mirror on Azure (meta task).
In T691#33551, @olasd wrote:

After processing the logs of the backfilling process to make sure to redo all the ranges that were interrupted in various database migrations, I'm now confident that this task is complete: we have a full mirror of all contents on Azure, which is kept up to date by the main archive storage backend writing synchronously to it.

Jun 17 2019, 4:45 PM · General
zack added a comment to T1815: Use a FOSS alternative or drop Google ReCAPTCHA use.

Getting rid of ReCaptcha for save code now LGTM too.
I just wasn't sure that rate limit applies to Web UI submissions (e.g., will API requests come from our own IP? and if so, is that whitelisted?); I'm assuming that is what @anlambert plans to check.

Jun 17 2019, 4:34 PM · Web app

Jun 14 2019

zack requested changes to D1582: Add comments to tables dbversion, content, skipped_content and fetch_history.
Jun 14 2019, 3:28 PM
zack added a comment to T1789: batch API to check for the presence of content in the archive.

Can we have the feature which will return the content of File Type, Language Type, and License not its URL

Jun 14 2019, 1:13 PM · Web app
zack closed T1804: Software Heritage api to accept batch request from FOSSology as Invalid.

Hi @sandipbhuyan , I had in fact already created a task for this, it's: T1789

Jun 14 2019, 12:06 PM

Jun 13 2019

zack updated the task description for T1801: List all origins from major phabricator instances.
Jun 13 2019, 10:09 AM · Lister
zack renamed T1801: List all origins from major phabricator instances from List major phabricator instances to list all origins from major phabricator instances.
Jun 13 2019, 10:08 AM · Lister

Jun 12 2019

zack added a comment to T1799: ingest Tor git repositories.

btw, the list is ~400 repos for now

Jun 12 2019, 11:11 PM · Archive coverage
zack added a comment to T1799: ingest Tor git repositories.

@anarcat please hold off from using save code now for now. As we're planning to have a proper cgit lister, we can just add your instance to your rotation once that's done (unless this is super urgent, that is). That will have the additional advantage that we will automatically notice when new repos show up.

Jun 12 2019, 11:10 PM · Archive coverage
zack added a comment to T1800: gitweb lister.

It's not really related, because gitweb and cgit are two different things.

Jun 12 2019, 6:28 PM · Lister
zack triaged T1800: gitweb lister as Normal priority.
Jun 12 2019, 5:21 PM · Lister
zack triaged T1799: ingest Tor git repositories as Normal priority.
Jun 12 2019, 5:20 PM · Archive coverage
zack triaged T1798: ingest Tor project source code (meta task) as Normal priority.
Jun 12 2019, 5:19 PM · Archive coverage
zack created T1798: ingest Tor project source code (meta task).
Jun 12 2019, 5:19 PM · Archive coverage
zack added a comment to T1389: Implement a base "package" loader for package managers.

Thanks @olasd, @ardumont, and @anlambert for this, it's a great plan and I like it a lot !

Jun 12 2019, 1:58 PM · Origin-Debian, Origin-CRAN, Origin-GNU, Origin-npm, Origin-Pypi, Archive coverage
zack requested changes to D1509: Write a specification of extrinsic origin metadata storage..

Thanks @vlorentz for this first draft. In spite of all the comments above, I think it's a very good start.

Jun 12 2019, 1:31 PM
zack added a comment to T1411: reach a minimum of 80% SLOC coverage across all components.

The most recent update of the state of this task has shown a regression in the journal test coverage, which, per se, is not a big deal (just a few points). But it does raise the question of how, once we have attained whatever "minimum" coverage we are OK with, we monitor overtime that there is no regression. For instance, I think that code reviews should show to the reviewers how the submitted diff affects code coverage. Ideally, reviewers should be able to so if it has a net positive or negative effect on coverage, and take that into account in their review decisions. (Which is not to say we should never accept diffs that decrease code coverage—there might be reasons to do so. But it is a data point that would be useful for reviewers to see.)

Jun 12 2019, 12:25 PM · Development environment, Sprint 2018 12
zack updated the task description for T1411: reach a minimum of 80% SLOC coverage across all components.
Jun 12 2019, 12:23 PM · Development environment, Sprint 2018 12

Jun 7 2019

zack updated the task description for T735: SourceForge lister.
Jun 7 2019, 9:16 PM · Origin-SourceForge
zack triaged T1791: Web API: do not leak internal, non-intrinsic origin identifiers as Low priority.
Jun 7 2019, 3:38 PM · Web app
zack changed the visibility for F3533133: haltode.pub.
Jun 7 2019, 10:50 AM
zack triaged T1789: batch API to check for the presence of content in the archive as Normal priority.
Jun 7 2019, 10:44 AM · Web app