Page MenuHomeSoftware Heritage
Feed Advanced Search

May 25 2018

zack moved T1060: More graceful web app behavior when the indexer storage is unavailable from Restricted Project Column to Restricted Project Column on the Restricted Project board.
May 25 2018, 1:01 PM · Restricted Project, Web app

May 24 2018

zack moved T1057: website: remove language chooser from top navigation bar from Restricted Project Column to Restricted Project Column on the Restricted Project board.
May 24 2018, 2:47 PM · Restricted Project, Website
zack moved T1061: SWHRemoteAPI: Enable to set timeout on the sent requests from Restricted Project Column to Restricted Project Column on the Restricted Project board.
May 24 2018, 2:46 PM · Restricted Project, Core & foundations
zack moved T1061: SWHRemoteAPI: Enable to set timeout on the sent requests from Restricted Project Column to Restricted Project Column on the Restricted Project board.
May 24 2018, 2:46 PM · Restricted Project, Core & foundations
zack triaged T1070: can no longer reply to bugs via email — Error Processing Mail (Empty Mail) as High priority.
May 24 2018, 2:13 PM · Phabricator, System administration
zack moved T1069: fully host the web UI on Azure from Restricted Project Column to Restricted Project Column on the Restricted Project board.
May 24 2018, 10:40 AM · Web app, System administration
zack triaged T1069: fully host the web UI on Azure as High priority.
May 24 2018, 10:40 AM · Web app, System administration
zack moved T1067: add donate button to archive.s.o from Restricted Project Column to Restricted Project Column on the Restricted Project board.
May 24 2018, 10:38 AM · Restricted Project, Web app
zack added a parent task for T1065: define a common top-level navigation across different websites: T1067: add donate button to archive.s.o.
May 24 2018, 10:35 AM · Web app, Restricted Project
zack added a subtask for T1067: add donate button to archive.s.o: T1065: define a common top-level navigation across different websites.
May 24 2018, 10:35 AM · Restricted Project, Web app
zack triaged T1067: add donate button to archive.s.o as Normal priority.
May 24 2018, 10:35 AM · Restricted Project, Web app
zack moved T1057: website: remove language chooser from top navigation bar from Restricted Project Column to Restricted Project Column on the Restricted Project board.
May 24 2018, 10:30 AM · Restricted Project, Website
zack moved T1060: More graceful web app behavior when the indexer storage is unavailable from Restricted Project Column to Restricted Project Column on the Restricted Project board.
May 24 2018, 10:30 AM · Restricted Project, Web app
zack moved T1064: link from archive.s.o back to www.s.o more prominently from Restricted Project Column to Restricted Project Column on the Restricted Project board.
May 24 2018, 10:30 AM · Web app, Restricted Project
zack moved T1065: define a common top-level navigation across different websites from Restricted Project Column to Restricted Project Column on the Restricted Project board.
May 24 2018, 10:30 AM · Web app, Restricted Project
zack moved T1066: wither help page of archive.s.o from Restricted Project Column to Restricted Project Column on the Restricted Project board.
May 24 2018, 10:30 AM · Documentation, Web app, Restricted Project
zack triaged T1066: wither help page of archive.s.o as Low priority.
May 24 2018, 8:59 AM · Documentation, Web app, Restricted Project
zack added a comment to T1065: define a common top-level navigation across different websites.

an example of this which I quite like is Ubuntu's, see, e.g.: https://www.ubuntu.com/ , https://developer.ubuntu.com/ , https://blog.ubuntu.com/ , etc.

May 24 2018, 8:56 AM · Web app, Restricted Project
zack triaged T1065: define a common top-level navigation across different websites as High priority.
May 24 2018, 8:55 AM · Web app, Restricted Project
zack triaged T1064: link from archive.s.o back to www.s.o more prominently as Normal priority.
May 24 2018, 8:53 AM · Web app, Restricted Project

May 22 2018

zack triaged T1057: website: remove language chooser from top navigation bar as Low priority.
May 22 2018, 1:43 PM · Restricted Project, Website
zack closed T973: Origin visits API: should sort by visit id as Invalid.

@olasd was right with the duplicate mention wrt T1050. I'm keeping that one instead of this one, as the problem statement since clearer there.
I've no specific idea/preference of where this should be implemented (storage v. webui)

May 22 2018, 1:41 PM · Web app
zack updated the task description for T1050: Web API: return visits most-recent-first and document the order.
May 22 2018, 1:40 PM · Easy hack, Web app
zack triaged T1007: Monitor nfs mount points on orangerie.internal.softwareheritage.org as Normal priority.
May 22 2018, 1:09 PM · System administration
zack added a project to T1007: Monitor nfs mount points on orangerie.internal.softwareheritage.org: System administration.
May 22 2018, 1:09 PM · System administration

May 18 2018

zack added a comment to T1040: identifiers: support optional contextual parts for line numbers and origin.

So I think the best option here is to used named parameters as optional parts in the identifiers. This will give us some flexibility regarding the adding of new ones in the future. Regarding the separator, we could either used \ or | as they should not interfere with origin urls to extract.

May 18 2018, 2:32 PM · Restricted Project, General

May 17 2018

zack added a comment to T1040: identifiers: support optional contextual parts for line numbers and origin.

the problems I see with optional URL parameters instead of modifying the identifiers themselves are the following:

May 17 2018, 9:06 PM · Restricted Project, General

May 16 2018

zack committed rMSLD159557f06b13: check-in slides for HACKarnaval! 2018 (authored by zack).
check-in slides for HACKarnaval! 2018
May 16 2018, 12:56 PM

May 4 2018

zack triaged T1050: Web API: return visits most-recent-first and document the order as Low priority.
May 4 2018, 9:39 PM · Easy hack, Web app
zack added a comment to T1049: calendar view usability: the link to the actual snapshot is too hard to find.

So, how about just redirecting systematically to the browsing of the most recent snapshot available, and have from there a link (under actions?) that points to the visit calendar? Isn't that what most users would want anyway? I.e., aren't we optimizing for the wrong use case currently?

May 4 2018, 9:48 AM · Web app
zack triaged T1049: calendar view usability: the link to the actual snapshot is too hard to find as High priority.
May 4 2018, 9:33 AM · Web app

May 3 2018

zack added a comment to T1048: Clean striped object storages from objects they should not be containing.

FWIW, the proposed policy looks good to me. Green light!

May 3 2018, 4:57 PM · Object storage

Apr 28 2018

zack updated the task description for T1042: support optional/contextual parts of persistent identifiers in the web app resolver.
Apr 28 2018, 3:31 PM · Web app
zack triaged T1042: support optional/contextual parts of persistent identifiers in the web app resolver as Normal priority.
Apr 28 2018, 3:30 PM · Web app
zack triaged T1041: document contextual parts of persistent identifiers as High priority.
Apr 28 2018, 3:29 PM · Restricted Project, Documentation
zack triaged T1040: identifiers: support optional contextual parts for line numbers and origin as Normal priority.
Apr 28 2018, 3:29 PM · Restricted Project, General
zack raised the priority of T1039: add swh-model CLI front-end to compute persistent identifiers from Normal to High.
Apr 28 2018, 3:24 PM · Data Model

Apr 27 2018

zack closed T509: Generate and publish Software Heritage Development Documentation as Resolved.

This is now done as a generic goal. Bits and pieces of the doc are still in progress, but the generic doc infrastructure is now in place.

Apr 27 2018, 8:09 PM · Documentation
zack triaged T1039: add swh-model CLI front-end to compute persistent identifiers as Normal priority.
Apr 27 2018, 8:08 PM · Data Model

Apr 20 2018

zack added a comment to T883: set up a replica of the main DB on azure.

can you consolidate the bits of docs somewhere on the intranet? they'll be easier to find than on a task in the future

Apr 20 2018, 11:44 AM · Unknown Object (Project), System administration

Apr 19 2018

zack committed rMSLDb76a2aa8fede: LLW 2018 lightning talk: update slides (authored by zack).
LLW 2018 lightning talk: update slides
Apr 19 2018, 5:45 PM

Apr 18 2018

zack added a comment to T782: Web API: make endpoints that expose extracted metadata return *lists* of factual information.
In T782#18978, @s wrote:

Can I:

  • Merge and push the changes in D301 to swh-indexer master branch?
  • Merge and push the changes in D302 to swh-web master branch?
Apr 18 2018, 2:31 PM · Indexer, Easy hack, Web app

Apr 6 2018

zack added a comment to T1003: Register a SWH uri scheme with IANA.

the procedure for the registration is here: https://tools.ietf.org/html/rfc7595

Apr 6 2018, 11:22 AM · Data Model
zack triaged T1015: document registration of swh: identifiers on identifiers.org as Normal priority.
Apr 6 2018, 11:17 AM · Documentation

Mar 27 2018

zack added a comment to T1002: ingest Hackage, the Haskell package repository (meta task).

relevant highlights:

Mar 27 2018, 6:02 PM · Hackage loader, Hackage lister, Archive coverage
zack renamed T1002: ingest Hackage, the Haskell package repository (meta task) from ingest Hackage into the Software Heritage archive (meta task) to ingest Hackage (Haskell package repository) into the Software Heritage archive (meta task).
Mar 27 2018, 6:01 PM · Hackage loader, Hackage lister, Archive coverage
zack added a project to T1004: Web-app: Provide a reference box with the stable SWH-ID URL : Web app.
Mar 27 2018, 5:59 PM · Restricted Project, Web app
zack added a project to T1005: webapp: Push logs to elasticsearch cluster: System administration.
Mar 27 2018, 2:17 PM · System administration, Web app
zack added a comment to T1005: webapp: Push logs to elasticsearch cluster.

It should be the other way around: making the elastic cluster does not need the web app to push it logs to the cluster to be completed

Mar 27 2018, 12:04 PM · System administration, Web app

Mar 25 2018

zack added a comment to T1002: ingest Hackage, the Haskell package repository (meta task).

update from joeyh, there is no need for any specific hack to maintain a local mirror, it is just an undocumented feature:

Mar 25 2018, 3:47 PM · Hackage loader, Hackage lister, Archive coverage

Mar 24 2018

zack triaged T1002: ingest Hackage, the Haskell package repository (meta task) as Normal priority.
Mar 24 2018, 10:26 PM · Hackage loader, Hackage lister, Archive coverage
zack committed rMSLDb5c75ba1d0de: librepanet 2018: add DAG slide (authored by zack).
librepanet 2018: add DAG slide
Mar 24 2018, 12:02 PM

Mar 23 2018

zack committed rMSLD9e1e11b13f2b: libreplanet 2018 talk: last touches (authored by zack).
libreplanet 2018 talk: last touches
Mar 23 2018, 9:07 PM
zack committed rMSLD121480336558: check-in slides for LibrePlanet 2018 talk (authored by zack).
check-in slides for LibrePlanet 2018 talk
Mar 23 2018, 8:54 PM
zack committed rMSLD611eb8fd5bb6: webui: new module with slides about the Web UI preview (authored by zack).
webui: new module with slides about the Web UI preview
Mar 23 2018, 8:54 PM
zack committed rMSLDbebd380e370f: Merge branch 'master' of ssh://forge.softwareheritage.org/diffusion/64/slides (authored by zack).
Merge branch 'master' of ssh://forge.softwareheritage.org/diffusion/64/slides
Mar 23 2018, 8:03 PM
zack committed rMSLDf80fca7480d9: makefile: make sure *.vrb files are cleaned up (authored by zack).
makefile: make sure *.vrb files are cleaned up
Mar 23 2018, 8:03 PM
zack committed rMSLDd242c1f1d485: status module: minor updates (authored by zack).
status module: minor updates
Mar 23 2018, 8:03 PM

Mar 12 2018

zack committed rDENV14ed60b98de8: Makefile.python: also check code for typos with codespell (authored by zack).
Makefile.python: also check code for typos with codespell
Mar 12 2018, 12:14 PM
zack committed rDLS4f2361ab0e02: fix typo in docstrings/comments (tnx codespell) (authored by zack).
fix typo in docstrings/comments (tnx codespell)
Mar 12 2018, 12:02 PM
zack committed rDWAPPS2ed370ca0e35: fix typo in docstrings/comments (tnx codespell) (authored by zack).
fix typo in docstrings/comments (tnx codespell)
Mar 12 2018, 12:02 PM
zack committed rDSTOb4147ec75396: fix typo in docstrings/comments (tnx codespell) (authored by zack).
fix typo in docstrings/comments (tnx codespell)
Mar 12 2018, 12:01 PM
zack committed rDOBJS6a917b1b5956: fix typo in docstrings/comments (tnx codespell) (authored by zack).
fix typo in docstrings/comments (tnx codespell)
Mar 12 2018, 12:01 PM
zack committed rDLDSVN75f445e877cc: fix typo in docstrings/comments (tnx codespell) (authored by zack).
fix typo in docstrings/comments (tnx codespell)
Mar 12 2018, 12:01 PM
zack committed rDLDHG5e386c6d6e1b: fix typo in docstrings/comments (tnx codespell) (authored by zack).
fix typo in docstrings/comments (tnx codespell)
Mar 12 2018, 12:01 PM
zack committed rDDOC9d3c7e097a8e: docs/.gitgnore: do not ignore swh-* in subdir (authored by zack).
docs/.gitgnore: do not ignore swh-* in subdir
Mar 12 2018, 12:00 PM

Mar 9 2018

zack updated subscribers of T782: Web API: make endpoints that expose extracted metadata return *lists* of factual information.

Hi s!, and thanks for your interest in helping us out!

Mar 9 2018, 5:22 PM · Indexer, Easy hack, Web app
zack renamed T989: Implement GitLab lister from GitLab.com lister to GitLab lister.
Mar 9 2018, 4:49 PM · Origin-GitLab
zack added a project to T989: Implement GitLab lister: Origin-GitLab.
Mar 9 2018, 4:49 PM · Origin-GitLab
zack created Origin-GitLab.
Mar 9 2018, 4:49 PM
zack raised the priority of T989: Implement GitLab lister from Wishlist to Normal.

Heya, thanks for the bug report!

Mar 9 2018, 4:48 PM · Origin-GitLab

Mar 3 2018

zack assigned T981: Replace origin_id with origin_url in URI documentation to anlambert.
Mar 3 2018, 3:35 PM · Web app, Documentation

Mar 2 2018

zack added a comment to T980: Identify and fix releases that are stored as revisions.

Thanks for the report. I haven't looked into this specific, so it's indeed possible it's a bug, but in the general case this is potentially normal behavior.
Branches can point to either releases or revisions (or, in fact, anything at all).
In the Git case, which looks like your case comes from, if one simply does a "git tag", that would create a ref pointing to a revision; whereas if one does "git tag -a" (annotated tag), that would create a release object (pointing to a revision) and a ref pointing to the release object. So an author that switched from using "git tag" to use "git tag -a" would justify what you have seen.

Mar 2 2018, 5:53 PM · Archive content
zack triaged T980: Identify and fix releases that are stored as revisions as Normal priority.
Mar 2 2018, 5:49 PM · Archive content

Feb 23 2018

zack changed the status of T787: abstract data model (Merkle DAG) documentation from Open to Work in Progress.

We now have a preliminary version of this at https://docs.softwareheritage.org/devel/swh-model/data-model.html#data-model .
We still lack prose description of the diagram though.

Feb 23 2018, 1:35 PM · Documentation
zack changed the status of T787: abstract data model (Merkle DAG) documentation, a subtask of T509: Generate and publish Software Heritage Development Documentation, from Open to Work in Progress.
Feb 23 2018, 1:35 PM · Documentation
zack edited Description on Web app.
Feb 23 2018, 12:09 PM
zack edited Description on Web app.
Feb 23 2018, 12:07 PM
zack edited Description on Web app.
Feb 23 2018, 12:04 PM
zack edited Description on Web app.
Feb 23 2018, 12:03 PM
zack edited Description on Web app.
Feb 23 2018, 12:03 PM
zack removed hashtags from Web app: #web_user_interface, #swh-web-ui, #web-ui, #webapp.
Feb 23 2018, 12:02 PM
zack edited Description on Web app.
Feb 23 2018, 12:02 PM

Feb 22 2018

zack committed rDSNIP3d51335f7686: swh-dedup: fix db name in driver shell script (authored by zack).
swh-dedup: fix db name in driver shell script
Feb 22 2018, 12:21 PM
zack committed rDSNIPd43f2eafcff5: add SQL snippets to dump (meaningful parts of) the origin table (authored by zack).
add SQL snippets to dump (meaningful parts of) the origin table
Feb 22 2018, 12:21 PM

Feb 20 2018

zack added a comment to T975: Write queries for the crossminer dataset and estimate cost.
In T975#18129, @olasd wrote:

I'm still not entirely convinced that this work should be done through raw SQL queries, but rather using the higher level Python APIs. This depends on the following considerations:

  • how often do you need to update the data, and for how long?
  • how many different parameters do you need to have? (pom.xml/build.gradle/...)
  • how often will the parameters change?
Feb 20 2018, 7:48 PM · Metadata workflow

Feb 14 2018

zack triaged T945: Separate system logs from application logs as Normal priority.
Feb 14 2018, 11:43 AM · System administration

Feb 9 2018

zack committed rDMODd1b975ae1c69: docs: add snapshot, branches, and visits to Merkle DAG diagram (authored by zack).
docs: add snapshot, branches, and visits to Merkle DAG diagram
Feb 9 2018, 4:50 PM
zack committed rDMOD6fe1595337b3: docs: generate PDF Merkle DAG diagram by default (authored by zack).
docs: generate PDF Merkle DAG diagram by default
Feb 9 2018, 4:50 PM
zack committed rXHLWRLD1a99a56b41e5: Merge branch 'doc' (authored by zack).
Merge branch 'doc'
Feb 9 2018, 3:52 PM
zack committed rXHLWRLD3d515253caea: README: add homepage link (authored by zack).
README: add homepage link
Feb 9 2018, 3:52 PM
zack committed rDMOD96114f66fbdb: docs: add preliminary version of the Merkle DAG diagram (authored by zack).
docs: add preliminary version of the Merkle DAG diagram
Feb 9 2018, 2:32 PM
zack updated subscribers of D295: Implement efficient comparison of revision trees.

Regarding the placement of the code, the (ideal) data structure you need for this is a lazy version of the DAG, because you need to only fetch the nodes that are different, and the only module that can do that for you is indeed swh.storage.
So I'm fine having this in swh.storage. I'm not sure I like "utils" (which sounds low-level), maybe "operations" or "algo" (which sound higher-level, like this functionality actually is) would be more appropriate here? I've no strong opinion either...
Either way, I'd like to uniform with what @seirl has done for toposort in swh.model. So whatever name we pick here, toposort in swh.model should go under swh.model.THATNAME.toposort .

Feb 9 2018, 2:14 PM
zack committed rDENVe232f9b85ee6: move services.txt file to swh-docs (authored by zack).
move services.txt file to swh-docs
Feb 9 2018, 12:50 PM
zack committed rDDOC745f980465de: import services file from swh-environment (authored by zack).
import services file from swh-environment
Feb 9 2018, 12:50 PM
zack committed rDSTO86d68a672bc3: doc: update table clusters in SQL diagram (authored by zack).
doc: update table clusters in SQL diagram
Feb 9 2018, 12:48 PM
zack added a comment to T923: Mount the asf svn repository mirror.

I don't see anything new here. Subversion offers no integrity guarantees, it applies to the ASF repos like it applies to any other SVN repo out there. We need to decide a policy about when (if at all), re-do full ingestions of Subversion repos (which will allow to re-inject modified objects at the cost of forking the resulting history on Software Heritage) or just say *shrug* and never re-ingest in a non-incremental way any Subversion repo we have previously ingested.

Feb 9 2018, 11:59 AM · SVN Loader

Feb 8 2018

zack committed rXHLWRLDc7640e8d6990: move source code to src/ (authored by zack).
move source code to src/
Feb 8 2018, 3:48 PM
zack committed rXHLWRLD43ef7dcd2220: add licensing information and README (authored by zack).
add licensing information and README
Feb 8 2018, 3:47 PM
zack committed rXHLWRLD3dbc8a2210ec: add licensing information and README (authored by zack).
add licensing information and README
Feb 8 2018, 10:57 AM