Page MenuHomeSoftware Heritage

rdicosmo (Roberto Di Cosmo)
UserAdministrator

User Details

User Since
Sep 9 2015, 9:17 PM (251 w, 3 d)
Roles
Administrator

Recent Activity

Yesterday

rdicosmo committed rMSLD3e6003581a83: Update SWHID animation (authored by rdicosmo).
Update SWHID animation
Sat, Jul 4, 3:31 PM
rdicosmo triaged T2475: Save again functionality: VCS type not pre-selected as Normal priority.
Sat, Jul 4, 2:31 PM · Web app

Thu, Jul 2

rdicosmo raised the priority of T1099: support origin blacklist for archive search and browse from Low to High.

This is an important feature: it has been dormant for a while, but we need to actually start implementing it.

Thu, Jul 2, 8:21 PM · General, Web app

Wed, Jul 1

rdicosmo added a comment to T2344: Build a connector for software deposit via Zenodo/InvenioRDM.

Great news !!

Does this mean we need to be SWORD 3 compatible?

Wed, Jul 1, 4:03 PM · Restricted Project, SWORD deposit, Scientific Community Building

Fri, Jun 26

rdicosmo closed T1733: Propose a @software BibTeX template as Resolved.

This is now published in CTAN (biblatex-software)

Fri, Jun 26, 9:57 PM · Scientific Community Building

Thu, Jun 25

rdicosmo closed T2465: Fix display of entries in administration interface for SaveCodeNow as Resolved.

I cannot reproduce this anymore either!
Closing this for now.

Thu, Jun 25, 1:08 PM · Web app
rdicosmo added a comment to T2465: Fix display of entries in administration interface for SaveCodeNow .

May be related to T2406

Thu, Jun 25, 11:18 AM · Web app
rdicosmo reassigned T2406: Allow filtering out test deposits on the deposit admin page from ardumont to anlambert.

This seems to be related to T2465 too

Thu, Jun 25, 11:17 AM · Web app
rdicosmo reopened T2406: Allow filtering out test deposits on the deposit admin page as "Open".

Filtering is broken in the admin interface for deposits.

Thu, Jun 25, 11:15 AM · Web app
rdicosmo triaged T2465: Fix display of entries in administration interface for SaveCodeNow as Normal priority.
Thu, Jun 25, 11:06 AM · Web app
rdicosmo renamed T2464: Ensure code fragment highlighting works for all variants of qualifiers in a SWHID from Ensure code fragment highlighting works even when visit is missing in SWHID to Ensure code fragment highlighting works for all variants of qualifiers in a SWHID.
Thu, Jun 25, 8:00 AM · Web app
rdicosmo triaged T2464: Ensure code fragment highlighting works for all variants of qualifiers in a SWHID as High priority.
Thu, Jun 25, 7:59 AM · Web app

Wed, Jun 24

rdicosmo committed rMSLD0002af3f60cc: Add SWHID schema conformant to v1.4 of SWHID (authored by moranegg).
Add SWHID schema conformant to v1.4 of SWHID
Wed, Jun 24, 3:19 PM

Tue, Jun 23

rdicosmo added a comment to T2430: lookup ingested tarballs (or similar source code containers) by container checksum.
In T2430#45767, @zimoun wrote:
  • if you still have that tarball at hand, then it can be ingested in SWH, and we keep the correspondence between SWHID and SHA256; in principle, you need to trust us, but one can foresee having external parties checking that the correspondence is real while the tarball is still there, and adding their observation to the chain of trust means you need to trust us less and less

By we keep the correspondence between SWHID and SHA256 you mean you on the SWH side?

Tue, Jun 23, 6:49 PM · Data Model
rdicosmo added a comment to T2430: lookup ingested tarballs (or similar source code containers) by container checksum.

@rdicosmo The discussion of the "source of trust" is an important one, and it's interesting to see how we can address it going forward.

The proposal of a correspondence table, as I wrote on swh-devel, leaves open the question of today's and yesterday's software, assuming SWHIDs become the de facto standard tomorrow. How can I check the integrity of code fetched from SWH if all I have is its tarball's SHA256 from its release announcement? How can I check its authenticity if all I have is an OpenPGP signature computed over a tarball?

Tue, Jun 23, 6:00 PM · Data Model

Mon, Jun 22

rdicosmo renamed T2451: Archive Newsletter on the Software Heritage website from How to archive Newsletter on the Software Heritage website to Archive Newsletter on the Software Heritage website.
Mon, Jun 22, 5:59 PM · Unknown Object (Project)
rdicosmo moved T2451: Archive Newsletter on the Software Heritage website from Restricted Project Column to Restricted Project Column on the Unknown Object (Project) board.
Mon, Jun 22, 5:52 PM · Unknown Object (Project)
rdicosmo committed R183:50cbfeb0a4ae: Add ESEM 2020 paper (authored by rdicosmo).
Add ESEM 2020 paper
Mon, Jun 22, 1:44 PM

Thu, Jun 18

rdicosmo added a comment to D3317: templates/browse: Improve navigation for origin/snapshot related views.

Clean and elegant, LGTM

Thu, Jun 18, 7:21 PM

Wed, Jun 17

rdicosmo added a comment to T2456: Source code view should span full browser width.

It works!

Wed, Jun 17, 4:27 PM · UX, Web app
rdicosmo added a comment to T2457: Properly handle font size modification in browser preferences.

The impression I get from playing with the interface is that at some moment the browser decides to "wrap" some of the components of the page in the source code view, that leads to the unwanted behaviour.
When we'll come to this, it might be interesting to compare with the behaviour of similar web front ends (GitHub, BitBucket, GitLab) that seem to handle this corner case seamlessly.

Wed, Jun 17, 1:58 PM · UX, Web app
rdicosmo added a comment to T2456: Source code view should span full browser width.

Better. but not yet there... if you play with Ctrl+ Ctrl- a bit you'll see that the source code pane still ends up having a size controlled by the central part of the top bar that show the path

Wed, Jun 17, 12:12 PM · UX, Web app

Tue, Jun 16

rdicosmo added a comment to T2451: Archive Newsletter on the Software Heritage website.

Mailchimp is closed source (*sigh*) but is well accepted and has interesting features, so we'll not move away unless there are strong reasons.
Ack for the PDF way, as it seems the HTML one is too fragile/cumbersome.

Tue, Jun 16, 7:56 PM · Unknown Object (Project)
rdicosmo added a comment to T2456: Source code view should span full browser width.

For the record, the same kind of rendering is obtained when browsing Github and playing with the browser zoom.

Tue, Jun 16, 5:07 PM · UX, Web app
rdicosmo added a comment to T2456: Source code view should span full browser width.

After some other tests, it seems this mostly happens with Chrome: you can trigger the behaviour by playing with Ctrl + and Ctrl -
Firefox seems much more resilient.

Tue, Jun 16, 3:27 PM · UX, Web app
rdicosmo added a comment to T2456: Source code view should span full browser width.

To reproduce, enlarge the browser window: if the browser width is small enough, the source code pane wraps below and uses all the available space.
In my case, I'm using a full screen on a 2560 pixel display

Tue, Jun 16, 3:23 PM · UX, Web app
rdicosmo added a comment to T2456: Source code view should span full browser width.

Fortunately, this is already the case.

I can not reproduce locally. I think this is due to your browser cache.

Is refreshing the page with Ctrl+F5 fixes the display ?

Tue, Jun 16, 3:11 PM · UX, Web app
rdicosmo triaged T2456: Source code view should span full browser width as High priority.
Tue, Jun 16, 2:07 PM · UX, Web app

Mon, Jun 15

rdicosmo added a comment to T2451: Archive Newsletter on the Software Heritage website.

A bare bone process may be something like this:

  • every time a newsletter is sent Marla does the following
    • get the HTML of the newsletter from Mailchimp
    • use it to create a new page on the SWH website
    • add a link to this new page at the bottom of the newsletter page on SWH (in all three languages)

This way the subscription page is also the archive page and may attract more traffic.

Any comments/improvements on this?

This sounds quite complicated and cumbersome to do ... We need a process that requires less copy / paste operations.

Possible solutions are:

  • We create the newsletter for each supported languages and send it directly from WordPress through the Newsletter plugin.
  • We keep using mailchimp to create the newsletter and print each mail to a PDF file. This works really well when using Chromium browser, see as an example. Apart the top mailchimp bar to remove, the PDF rendering looks great. We can then upload the PDF to our main site and offers a link to it in the Newsletter page. This way the only thing to maintain will be an archive table in the Newsletter page containing link to pdf files.
Mon, Jun 15, 1:56 PM · Unknown Object (Project)
rdicosmo added a comment to T2451: Archive Newsletter on the Software Heritage website.

Just follow these instructions and copy / paste the HTML code at the bottom of the Newsletter page.

Mon, Jun 15, 12:01 PM · Unknown Object (Project)
rdicosmo renamed T2451: Archive Newsletter on the Software Heritage website from How to archive Newsletter at Software Heritage archive to How to archive Newsletter on the Software Heritage website.
Mon, Jun 15, 11:12 AM · Unknown Object (Project)

Tue, Jun 9

rdicosmo added a comment to T2441: Update SWHID regexp used by Zenodo.

Le mar. 9 juin 2020 à 16:18, anlambert (Antoine Lambert) <
forge@softwareheritage.org> a écrit :

Tue, Jun 9, 5:23 PM · Archive content
rdicosmo added a comment to T2441: Update SWHID regexp used by Zenodo.

Is there a way to improve the regex in https://github.com/inveniosoftware/idutils/pull/60 to allow qualifiers to come in any order instead of the canonical one?

Tue, Jun 9, 3:59 PM · Archive content

Sat, Jun 6

rdicosmo added a comment to T2430: lookup ingested tarballs (or similar source code containers) by container checksum.

An important issue indeed :-)

Sat, Jun 6, 6:08 PM · Data Model

Fri, Jun 5

rdicosmo added a comment to D3223: browse/revision: Fix revision id in snapshot context.

A quick comment on the code above: it seems to depend on the use of sha1_git parameters passed in the urls used for browsing.
While this is perfectly ok for today, we need a way to be future proof, when different versions of SWHIDs will come in, using different hashing algorithms.
Since we commit to maintain forever the resolution of previous versions of identifiers, the navigation in the webapp will need to be able to accomodate multiple hashing algorithm at the same time, and we need to plan to structure the code accordingly.

Fri, Jun 5, 2:14 PM

May 29 2020

rdicosmo committed rMSLD2b4f1c0dab7b: Added team photo (authored by rdicosmo).
Added team photo
May 29 2020, 10:58 AM

May 27 2020

rdicosmo assigned T2420: website: Twitter icon missing in footer (missing font?) to anlambert.
May 27 2020, 9:59 AM · Website
rdicosmo shifted T2425: Missing twitter icon in footer of main website from the Restricted Space space to the S1 Public space.
May 27 2020, 9:46 AM · Website, Unknown Object (Project)
rdicosmo created T2425: Missing twitter icon in footer of main website.
May 27 2020, 9:46 AM · Website, Unknown Object (Project)

May 26 2020

rdicosmo committed rMSLDe549ba7022dc: Update Force11 presentation (authored by rdicosmo).
Update Force11 presentation
May 26 2020, 9:25 PM

May 25 2020

rdicosmo added a comment to D3154: Add artifact metadata to the extrinsic metadata storage specification..

Adding a last comment after a last reread

May 25 2020, 4:51 PM

May 21 2020

rdicosmo added a comment to T2418: Answer IPOL's interrogations on deposit metadata.

We decided to answer IPOL's question in a conservative way, by using the current metadata.xml approach, without introducing roles for authors, nor the JSON-LD format.

May 21 2020, 5:46 PM · Metadata workflow, SWORD deposit

May 20 2020

rdicosmo added a comment to T2418: Answer IPOL's interrogations on deposit metadata.

I removed the following sentence, that will be confusing for Jose Luis (and btw, here the external library is included so it is technically not a sparse deposit either :-))

This might be a good example for the sparse deposit use case (even if in this specific case, the deposit will include the libraries).

May 20 2020, 8:48 PM · Metadata workflow, SWORD deposit
rdicosmo updated the task description for T2418: Answer IPOL's interrogations on deposit metadata.
May 20 2020, 8:46 PM · Metadata workflow, SWORD deposit
rdicosmo triaged T2417: Wrong swh:rev anchor in SWHID computed for contents/directories in Permalink box as High priority.
May 20 2020, 2:26 PM · Web app
rdicosmo added a subtask for T2342: Add resolving of new SWHIDs contextual information: T2408: Repeated filename in path qualifier for SWHIDs.
May 20 2020, 2:15 PM · Web app
rdicosmo added a parent task for T2408: Repeated filename in path qualifier for SWHIDs: T2342: Add resolving of new SWHIDs contextual information.
May 20 2020, 2:15 PM · Web app

May 19 2020

rdicosmo triaged T2410: Check and complete the gitorious.org import as High priority.
May 19 2020, 9:49 AM · Git loader, Origin-Gitorious

May 18 2020

rdicosmo added a comment to D3162: admin/deposit: Extract origin from swh_anchor_id according to latest change.

Adapt implementation according to review

May 18 2020, 4:46 PM · SWORD deposit, Web app
rdicosmo added a comment to D3154: Add artifact metadata to the extrinsic metadata storage specification..

I applied most of your comments,

Great!

except:

For the snapshot key in the dictionary, use the snapshot core SWHID, not the visit integer (that is an internal implementation detail)

because I don't understand it.

May 18 2020, 4:45 PM
rdicosmo added a comment to D3154: Add artifact metadata to the extrinsic metadata storage specification..

Notice also that if/when we introduce new versions of SWHIDs with other
hashing algorithms, we need to maintain backward compatibility.
Metadata introduced with a given context must not be "migrated" to the new
SWHIDs, but "duplicated" for the new SWHIDs.
Better think right now of a schema that allows to "share" metadata payloads
among multiple versions of SWHIDs without actually copying them over.

May 18 2020, 3:29 PM
rdicosmo added a comment to D3162: admin/deposit: Extract origin from swh_anchor_id according to latest change.

Great, thanks. Dont think you need a test on a function as basic as this :-)

May 18 2020, 2:35 PM · SWORD deposit, Web app
rdicosmo added a comment to D3162: admin/deposit: Extract origin from swh_anchor_id according to latest change.

It would be better to make the code robust w.r.t. future potential changes in qualifier order.
Suggested pseudo-code attached (to be checked :-))

May 18 2020, 2:18 PM · SWORD deposit, Web app
rdicosmo triaged T2409: Remove trailing SWHID qualifiers from origin URL column in deposit admin page as Normal priority.
May 18 2020, 10:47 AM · Web app

May 16 2020

rdicosmo added a comment to D3154: Add artifact metadata to the extrinsic metadata storage specification..

Thanks for sharing this: we're definitely going forward!
Comments are in the text, but overall:

  • agree that we need a way to store and retrieve metadata that is contextless (or... valid in all contexts)
  • for the context, we need to use the SHWIDs themselves, not the sha1_git that is bound to version 1 of SWHIDS
May 16 2020, 9:04 AM

May 14 2020

rdicosmo triaged T2408: Repeated filename in path qualifier for SWHIDs as High priority.
May 14 2020, 3:38 PM · Web app
rdicosmo added a comment to T2398: Simplify/update the SWHIDs returned from a SWORD deposit.

I would say go... HAL identifiers may be updated (not a big deal for them to update them) or may not be updated, but as you say they will be resolvable, so better have a uniform status of all deposits.

May 14 2020, 12:11 PM · SWORD deposit
rdicosmo added a comment to T2405: Make the main links on the "save code now" request list a link to the archived copy.

I agree: the links in the current UI presentation are sometimes confusing.
We need to think this a bit over to get all the functionalities we want into a clean UI:

May 14 2020, 11:27 AM · UX, Web app
rdicosmo added a comment to T2406: Allow filtering out test deposits on the deposit admin page.

A generic way to address this feature is by adding "exclusion" criteria to the existing "inclusion" criteria.

May 14 2020, 11:19 AM · Web app

May 13 2020

rdicosmo added a comment to T2398: Simplify/update the SWHIDs returned from a SWORD deposit.

Indeed! Getting the swh:rev and swh:snp for the swh:dir for the deposit
should not be that complicated navigating the Merkle tree upwards, though,
as we expect little deduplication there, but we'll need to see..

May 13 2020, 10:54 AM · SWORD deposit

May 12 2020

rdicosmo added a comment to T2398: Simplify/update the SWHIDs returned from a SWORD deposit.
  • Do we migrate the old deposit values to the new ones? (sounds reasonable to do so)
May 12 2020, 10:38 AM · SWORD deposit

May 9 2020

rdicosmo moved T1766: Write blog post about the npm ingestion from Restricted Project Column to Restricted Project Column on the Unknown Object (Project) board.
May 9 2020, 5:33 PM · Origin-npm, Blog, Website, Unknown Object (Project)
rdicosmo moved T2305: CCSD, Inria, Software Heritage sprint from Restricted Project Column to Restricted Project Column on the Unknown Object (Project) board.
May 9 2020, 5:32 PM · Unknown Object (Project)
rdicosmo added a comment to T2398: Simplify/update the SWHIDs returned from a SWORD deposit.

Once this is ready, check with HAL that everything works in software deposits (see https://gitlab.ccsd.cnrs.fr/ccsd/hal/-/issues/264)

May 9 2020, 12:39 PM · SWORD deposit
rdicosmo added a parent task for T2398: Simplify/update the SWHIDs returned from a SWORD deposit: T2399: Update SWORD deposit documentation.
May 9 2020, 12:30 PM · SWORD deposit
rdicosmo added a subtask for T2399: Update SWORD deposit documentation: T2398: Simplify/update the SWHIDs returned from a SWORD deposit.
May 9 2020, 12:30 PM · SWORD deposit
rdicosmo triaged T2399: Update SWORD deposit documentation as Normal priority.
May 9 2020, 12:30 PM · SWORD deposit
rdicosmo triaged T2398: Simplify/update the SWHIDs returned from a SWORD deposit as High priority.
May 9 2020, 12:28 PM · SWORD deposit
rdicosmo added a comment to T1785: Setup staging infrastructure.

Is this task still open? Likewise for T1711 T1712 and T1872

May 9 2020, 12:11 PM · Staging environment, System administration
rdicosmo added a comment to T1872: staging infra: New vlan.

Is this now done? If that's the case this ticked should be closed.

May 9 2020, 12:09 PM · Staging environment, Staff, System administration
rdicosmo added a comment to T1694: Ask for a more human friendly message on HAL moderation platform when deposit has an error.

Now that we have access to CCSD's gitlab, this issue must move there.

May 9 2020, 12:07 PM · SWORD deposit
rdicosmo added a comment to T2369: Review IPOL deposit metadata.

@ardumont , @moranegg : may you check that everything is now ok? We should close this task it that's the case :-)

May 9 2020, 12:06 PM · Metadata workflow, SWORD deposit

May 8 2020

rdicosmo committed rMSLDa228b3746765: Added image of updated permalink box (authored by rdicosmo).
Added image of updated permalink box
May 8 2020, 3:05 PM

May 7 2020

rdicosmo added a comment to T2395: Save code now fails on a svn project.

Thanks for diving into this.

May 7 2020, 10:03 PM · SVN Loader
rdicosmo triaged T2395: Save code now fails on a svn project as Normal priority.
May 7 2020, 6:11 PM · SVN Loader

May 6 2020

rdicosmo committed R183:ea7f0fe2f903: Add SWHID and ICMS (authored by rdicosmo).
Add SWHID and ICMS
May 6 2020, 4:14 PM

May 5 2020

rdicosmo committed rMSLDa7db79361005: Final set of slides (authored by rdicosmo).
Final set of slides
May 5 2020, 7:01 PM
rdicosmo committed rMSLDa1cdbc14f08e: Added references to articles using biblatex-software (authored by rdicosmo).
Added references to articles using biblatex-software
May 5 2020, 7:01 PM
rdicosmo committed rMSLDd94554e5a5b1: Stable version with examples for biblatex-software (authored by rdicosmo).
Stable version with examples for biblatex-software
May 5 2020, 7:01 PM
rdicosmo committed rMSLDb8e73e4c7818: Slides for SCIWG presentation (authored by rdicosmo).
Slides for SCIWG presentation
May 5 2020, 7:01 PM
rdicosmo committed rMSLD86f2be9e75fc: Resized bibliography output Add simple example of loading the style Add… (authored by rdicosmo).
Resized bibliography output Add simple example of loading the style Add…
May 5 2020, 7:01 PM
rdicosmo committed rMSLDc1f698ebddf1: Add slides before changing bibtex typesetting with listings (authored by rdicosmo).
Add slides before changing bibtex typesetting with listings
May 5 2020, 2:11 PM
rdicosmo committed rMSLDfe0cfa6e6759: Updated biblio module (authored by rdicosmo).
Updated biblio module
May 5 2020, 2:11 PM

May 4 2020

rdicosmo closed T2377: Check why origin is created for IPOL with uuid as Resolved.

The issue has been clarified, no need to change anything in the code, only clarifications in the documentation with examples.

May 4 2020, 6:03 PM · Metadata workflow, SWORD deposit
rdicosmo closed T2377: Check why origin is created for IPOL with uuid, a subtask of T2369: Review IPOL deposit metadata, as Resolved.
May 4 2020, 6:03 PM · Metadata workflow, SWORD deposit
rdicosmo added a comment to T2377: Check why origin is created for IPOL with uuid.

Users of the deposit API command line client must add a --slug command line option to avoid the creation of a uuid.
For IPOL, we agreed on the following:
SWH configuration side: provider_url = https://doi.org/10.5201/
slug on the command line = rest of the url, that is, for example, ipol.2018.236

May 4 2020, 5:14 PM · Metadata workflow, SWORD deposit

May 1 2020

rdicosmo added a reverting change for R183:a31ca248f7e7: Fix author order: R183:ca49ec778181: Revert "Fix author order": non alpha was intended.
May 1 2020, 6:51 PM
rdicosmo committed R183:ca49ec778181: Revert "Fix author order": non alpha was intended (authored by rdicosmo).
Revert "Fix author order": non alpha was intended
May 1 2020, 6:51 PM
rdicosmo committed R183:a31ca248f7e7: Fix author order (authored by rdicosmo).
Fix author order
May 1 2020, 6:43 PM
rdicosmo committed R183:7988657747f4: Update reference to ReScience article (authored by rdicosmo).
Update reference to ReScience article
May 1 2020, 6:34 PM

Apr 30 2020

rdicosmo closed D3108: SWHID spec: full reread.
Apr 30 2020, 7:06 PM
rdicosmo committed rDMOD9f5d266b9ca0: SWHID spec: full reread (authored by zack).
SWHID spec: full reread
Apr 30 2020, 7:06 PM
rdicosmo accepted D3108: SWHID spec: full reread.

LGTM, let's land this!

Apr 30 2020, 7:05 PM
rdicosmo added a comment to T2387: Clarify visits in SWHIDs.

I hear your concerns, but the discussion already took place and we have now already 2 published articles out there using visit, plus documentation shared that uses this new terminology.
So, no, we cannot change this now.

Apr 30 2020, 12:30 PM · Data Model
rdicosmo added a comment to T2387: Clarify visits in SWHIDs.

We considered snapshot vs visit when choosing the qualifier name, and we settled with visit, as it conveys the idea that a repository snapshot is taken when a visit is performed (not necessarily through the same origin).

Apr 30 2020, 12:14 PM · Data Model

Apr 28 2020

rdicosmo added a comment to D3077: Restructure SWHID documentation.

Thanks @vlorentz !

Apr 28 2020, 9:17 PM
rdicosmo closed D3077: Restructure SWHID documentation.
Apr 28 2020, 9:17 PM
rdicosmo committed rDMOD8367eec593cf: Restructure SWHID documentation in preparation for T2385 - merge grammars into… (authored by rdicosmo).
Restructure SWHID documentation in preparation for T2385 - merge grammars into…
Apr 28 2020, 9:17 PM
rdicosmo created D3077: Restructure SWHID documentation.
Apr 28 2020, 8:52 PM

Apr 26 2020

rdicosmo committed R183:cb4bd76e1a9d: Add rescience paper (authored by rdicosmo).
Add rescience paper
Apr 26 2020, 11:47 AM