Page MenuHomeSoftware Heritage

rdicosmo (Roberto Di Cosmo)
UserAdministrator

User Details

User Since
Sep 9 2015, 9:17 PM (263 w, 4 d)
Roles
Administrator

Recent Activity

Thu, Sep 24

rdicosmo added a comment to T2636: Modify generation of full-context SWHID for root artifacts by omitting path classifier .

Yes please :-)

Thu, Sep 24, 1:13 PM · Web app, SWORD deposit, Data Model

Wed, Sep 23

rdicosmo added a comment to T2619: Make the front page "archive size" graphs consistent with one another.
In T2619#49514, @olasd wrote:
Wed, Sep 23, 1:56 PM · Web app

Tue, Sep 22

rdicosmo added a comment to T2173: Create guide to how to cite software using SWH.

There is a full section on citation in the ICMS 2020 paper, see https://dx.doi.org/10.1007/978-3-030-52200-1_36
The conceptual framework (data model, representation etc.) discussed in https://gitlab.inria.fr/gt-sw-citation/bibtex-sw-entry/-/blob/master/README.md and the biblatex-software documentation in https://gitlab.inria.fr/gt-sw-citation/bibtex-sw-entry/-/blob/master/biblatex/software-biblatex.pdf are of general interest.

Tue, Sep 22, 2:56 PM · Documentation

Mon, Sep 21

rdicosmo committed rMSLDb5e4923a7a6b: Update HAL workflow (authored by rdicosmo).
Update HAL workflow
Mon, Sep 21, 6:06 PM
rdicosmo committed rMSLD831e4bfb4ae3: Add context to ARDC (authored by rdicosmo).
Add context to ARDC
Mon, Sep 21, 5:10 PM

Sun, Sep 20

rdicosmo committed rMSLDe63061c85412: Add module for ARDC (authored by rdicosmo).
Add module for ARDC
Sun, Sep 20, 7:59 PM
rdicosmo committed rMSLD9a97fae3ded1: Add images (authored by rdicosmo).
Add images
Sun, Sep 20, 7:59 PM

Fri, Sep 18

rdicosmo committed rMSLD93be6ec241dd: Typos (authored by rdicosmo).
Typos
Fri, Sep 18, 11:23 AM

Thu, Sep 17

rdicosmo committed rMSLDe26d6351fd3f: EsLibre 2020 (authored by rdicosmo).
EsLibre 2020
Thu, Sep 17, 8:21 PM

Wed, Sep 16

rdicosmo committed rMSLD91c548e113e9: Update growth picture, add BB praise (authored by rdicosmo).
Update growth picture, add BB praise
Wed, Sep 16, 5:07 PM
rdicosmo committed rMSLD471ed4ea8beb: Add swmath.org logo (authored by rdicosmo).
Add swmath.org logo
Wed, Sep 16, 4:22 PM
rdicosmo committed rMSLDf5a686485bd3: Add IPOL image (authored by rdicosmo).
Add IPOL image
Wed, Sep 16, 4:13 PM

Thu, Sep 3

rdicosmo committed R183:30f9411296ae: Added CoSO GPLO note (authored by rdicosmo).
Added CoSO GPLO note
Thu, Sep 3, 8:18 PM
rdicosmo added a comment to T2559: Modify redirection on https://softwareheritage.org/swhid.

No objections on my side.

Thu, Sep 3, 1:35 PM · Website, SWORD deposit, Metadata workflow

Wed, Sep 2

rdicosmo committed rMSLD1093f8a3ef66: Add French National Plan for Open Science image (authored by rdicosmo).
Add French National Plan for Open Science image
Wed, Sep 2, 11:15 AM

Aug 26 2020

rdicosmo added a comment to T2430: lookup ingested tarballs (or similar source code containers) by container checksum.

Thank you @samplet for sharing this great work: I am really looking forward to see you and @vlorentz compare notes and see whether we can archive the three extra files you produce as extrinsic metadata in the SWH archive!

Aug 26 2020, 9:54 AM · Data Model

Aug 24 2020

rdicosmo added a comment to T1226: Save code now email notification for submitter.

Nice to see that this is becoming doable. I'm adding T2219 as a prerequisite to this task.

Aug 24 2020, 3:34 PM · Web app
rdicosmo added a parent task for T2219: Authentication / authorization: T1226: Save code now email notification for submitter.
Aug 24 2020, 3:33 PM · Restricted Project
rdicosmo added a subtask for T1226: Save code now email notification for submitter: T2219: Authentication / authorization.
Aug 24 2020, 3:33 PM · Web app
rdicosmo triaged T2532: Broken links in Save Code Now browse request pages as High priority.
Aug 24 2020, 1:13 PM · Web app

Aug 17 2020

rdicosmo committed R183:185c8f22ac89: Added IJDC 2020 (authored by rdicosmo).
Added IJDC 2020
Aug 17 2020, 3:40 PM

Jul 14 2020

rdicosmo updated the task description for T2491: Support identification of code fragments in diffs.
Jul 14 2020, 10:22 AM · Web app
rdicosmo triaged T2491: Support identification of code fragments in diffs as Low priority.
Jul 14 2020, 10:11 AM · Web app

Jul 13 2020

rdicosmo moved T2479: Blog post SWH YouTube channel from Restricted Project Column to Restricted Project Column on the Unknown Object (Project) board.
Jul 13 2020, 12:03 PM · Unknown Object (Project)

Jul 4 2020

rdicosmo committed rMSLD3e6003581a83: Update SWHID animation (authored by rdicosmo).
Update SWHID animation
Jul 4 2020, 3:31 PM
rdicosmo triaged T2475: Save again functionality: VCS type not pre-selected as Normal priority.
Jul 4 2020, 2:31 PM · Web app

Jul 2 2020

rdicosmo raised the priority of T1099: support origin blacklist for archive search and browse from Low to High.

This is an important feature: it has been dormant for a while, but we need to actually start implementing it.

Jul 2 2020, 8:21 PM · General, Web app

Jul 1 2020

rdicosmo added a comment to T2344: Build a connector for software deposit via Zenodo/InvenioRDM.

Great news !!

Does this mean we need to be SWORD 3 compatible?

Jul 1 2020, 4:03 PM · Restricted Project, SWORD deposit, Scientific Community Building

Jun 26 2020

rdicosmo closed T1733: Propose a @software BibTeX template as Resolved.

This is now published in CTAN (biblatex-software)

Jun 26 2020, 9:57 PM · Scientific Community Building

Jun 25 2020

rdicosmo closed T2465: Fix display of entries in administration interface for SaveCodeNow as Resolved.

I cannot reproduce this anymore either!
Closing this for now.

Jun 25 2020, 1:08 PM · Web app
rdicosmo added a comment to T2465: Fix display of entries in administration interface for SaveCodeNow .

May be related to T2406

Jun 25 2020, 11:18 AM · Web app
rdicosmo reassigned T2406: Allow filtering out test deposits on the deposit admin page from ardumont to anlambert.

This seems to be related to T2465 too

Jun 25 2020, 11:17 AM · Web app
rdicosmo reopened T2406: Allow filtering out test deposits on the deposit admin page as "Open".

Filtering is broken in the admin interface for deposits.

Jun 25 2020, 11:15 AM · Web app
rdicosmo triaged T2465: Fix display of entries in administration interface for SaveCodeNow as Normal priority.
Jun 25 2020, 11:06 AM · Web app
rdicosmo renamed T2464: Ensure code fragment highlighting works for all variants of qualifiers in a SWHID from Ensure code fragment highlighting works even when visit is missing in SWHID to Ensure code fragment highlighting works for all variants of qualifiers in a SWHID.
Jun 25 2020, 8:00 AM · Web app
rdicosmo triaged T2464: Ensure code fragment highlighting works for all variants of qualifiers in a SWHID as High priority.
Jun 25 2020, 7:59 AM · Web app

Jun 24 2020

rdicosmo committed rMSLD0002af3f60cc: Add SWHID schema conformant to v1.4 of SWHID (authored by moranegg).
Add SWHID schema conformant to v1.4 of SWHID
Jun 24 2020, 3:19 PM

Jun 23 2020

rdicosmo added a comment to T2430: lookup ingested tarballs (or similar source code containers) by container checksum.
In T2430#45767, @zimoun wrote:
  • if you still have that tarball at hand, then it can be ingested in SWH, and we keep the correspondence between SWHID and SHA256; in principle, you need to trust us, but one can foresee having external parties checking that the correspondence is real while the tarball is still there, and adding their observation to the chain of trust means you need to trust us less and less

By we keep the correspondence between SWHID and SHA256 you mean you on the SWH side?

Jun 23 2020, 6:49 PM · Data Model
rdicosmo added a comment to T2430: lookup ingested tarballs (or similar source code containers) by container checksum.

@rdicosmo The discussion of the "source of trust" is an important one, and it's interesting to see how we can address it going forward.

The proposal of a correspondence table, as I wrote on swh-devel, leaves open the question of today's and yesterday's software, assuming SWHIDs become the de facto standard tomorrow. How can I check the integrity of code fetched from SWH if all I have is its tarball's SHA256 from its release announcement? How can I check its authenticity if all I have is an OpenPGP signature computed over a tarball?

Jun 23 2020, 6:00 PM · Data Model

Jun 22 2020

rdicosmo renamed T2451: Archive Newsletter on the Software Heritage website from How to archive Newsletter on the Software Heritage website to Archive Newsletter on the Software Heritage website.
Jun 22 2020, 5:59 PM · Unknown Object (Project)
rdicosmo moved T2451: Archive Newsletter on the Software Heritage website from Restricted Project Column to Restricted Project Column on the Unknown Object (Project) board.
Jun 22 2020, 5:52 PM · Unknown Object (Project)
rdicosmo committed R183:50cbfeb0a4ae: Add ESEM 2020 paper (authored by rdicosmo).
Add ESEM 2020 paper
Jun 22 2020, 1:44 PM

Jun 18 2020

rdicosmo added a comment to D3317: templates/browse: Improve navigation for origin/snapshot related views.

Clean and elegant, LGTM

Jun 18 2020, 7:21 PM

Jun 17 2020

rdicosmo added a comment to T2456: Source code view should span full browser width.

It works!

Jun 17 2020, 4:27 PM · UX, Web app
rdicosmo added a comment to T2457: Properly handle font size modification in browser preferences.

The impression I get from playing with the interface is that at some moment the browser decides to "wrap" some of the components of the page in the source code view, that leads to the unwanted behaviour.
When we'll come to this, it might be interesting to compare with the behaviour of similar web front ends (GitHub, BitBucket, GitLab) that seem to handle this corner case seamlessly.

Jun 17 2020, 1:58 PM · UX, Web app
rdicosmo added a comment to T2456: Source code view should span full browser width.

Better. but not yet there... if you play with Ctrl+ Ctrl- a bit you'll see that the source code pane still ends up having a size controlled by the central part of the top bar that show the path

Jun 17 2020, 12:12 PM · UX, Web app

Jun 16 2020

rdicosmo added a comment to T2451: Archive Newsletter on the Software Heritage website.

Mailchimp is closed source (*sigh*) but is well accepted and has interesting features, so we'll not move away unless there are strong reasons.
Ack for the PDF way, as it seems the HTML one is too fragile/cumbersome.

Jun 16 2020, 7:56 PM · Unknown Object (Project)
rdicosmo added a comment to T2456: Source code view should span full browser width.

For the record, the same kind of rendering is obtained when browsing Github and playing with the browser zoom.

Jun 16 2020, 5:07 PM · UX, Web app
rdicosmo added a comment to T2456: Source code view should span full browser width.

After some other tests, it seems this mostly happens with Chrome: you can trigger the behaviour by playing with Ctrl + and Ctrl -
Firefox seems much more resilient.

Jun 16 2020, 3:27 PM · UX, Web app
rdicosmo added a comment to T2456: Source code view should span full browser width.

To reproduce, enlarge the browser window: if the browser width is small enough, the source code pane wraps below and uses all the available space.
In my case, I'm using a full screen on a 2560 pixel display

Jun 16 2020, 3:23 PM · UX, Web app
rdicosmo added a comment to T2456: Source code view should span full browser width.

Fortunately, this is already the case.

I can not reproduce locally. I think this is due to your browser cache.

Is refreshing the page with Ctrl+F5 fixes the display ?

Jun 16 2020, 3:11 PM · UX, Web app
rdicosmo triaged T2456: Source code view should span full browser width as High priority.
Jun 16 2020, 2:07 PM · UX, Web app

Jun 15 2020

rdicosmo added a comment to T2451: Archive Newsletter on the Software Heritage website.

A bare bone process may be something like this:

  • every time a newsletter is sent Marla does the following
    • get the HTML of the newsletter from Mailchimp
    • use it to create a new page on the SWH website
    • add a link to this new page at the bottom of the newsletter page on SWH (in all three languages)

This way the subscription page is also the archive page and may attract more traffic.

Any comments/improvements on this?

This sounds quite complicated and cumbersome to do ... We need a process that requires less copy / paste operations.

Possible solutions are:

  • We create the newsletter for each supported languages and send it directly from WordPress through the Newsletter plugin.
  • We keep using mailchimp to create the newsletter and print each mail to a PDF file. This works really well when using Chromium browser, see as an example. Apart the top mailchimp bar to remove, the PDF rendering looks great. We can then upload the PDF to our main site and offers a link to it in the Newsletter page. This way the only thing to maintain will be an archive table in the Newsletter page containing link to pdf files.
Jun 15 2020, 1:56 PM · Unknown Object (Project)
rdicosmo added a comment to T2451: Archive Newsletter on the Software Heritage website.

Just follow these instructions and copy / paste the HTML code at the bottom of the Newsletter page.

Jun 15 2020, 12:01 PM · Unknown Object (Project)
rdicosmo renamed T2451: Archive Newsletter on the Software Heritage website from How to archive Newsletter at Software Heritage archive to How to archive Newsletter on the Software Heritage website.
Jun 15 2020, 11:12 AM · Unknown Object (Project)

Jun 9 2020

rdicosmo added a comment to T2441: Update SWHID regexp used by Zenodo.

Le mar. 9 juin 2020 à 16:18, anlambert (Antoine Lambert) <
forge@softwareheritage.org> a écrit :

Jun 9 2020, 5:23 PM · Archive content
rdicosmo added a comment to T2441: Update SWHID regexp used by Zenodo.

Is there a way to improve the regex in https://github.com/inveniosoftware/idutils/pull/60 to allow qualifiers to come in any order instead of the canonical one?

Jun 9 2020, 3:59 PM · Archive content

Jun 6 2020

rdicosmo added a comment to T2430: lookup ingested tarballs (or similar source code containers) by container checksum.

An important issue indeed :-)

Jun 6 2020, 6:08 PM · Data Model

Jun 5 2020

rdicosmo added a comment to D3223: browse/revision: Fix revision id in snapshot context.

A quick comment on the code above: it seems to depend on the use of sha1_git parameters passed in the urls used for browsing.
While this is perfectly ok for today, we need a way to be future proof, when different versions of SWHIDs will come in, using different hashing algorithms.
Since we commit to maintain forever the resolution of previous versions of identifiers, the navigation in the webapp will need to be able to accomodate multiple hashing algorithm at the same time, and we need to plan to structure the code accordingly.

Jun 5 2020, 2:14 PM

May 29 2020

rdicosmo committed rMSLD2b4f1c0dab7b: Added team photo (authored by rdicosmo).
Added team photo
May 29 2020, 10:58 AM

May 27 2020

rdicosmo assigned T2420: website: Twitter icon missing in footer (missing font?) to anlambert.
May 27 2020, 9:59 AM · Website
rdicosmo shifted T2425: Missing twitter icon in footer of main website from the Restricted Space space to the S1 Public space.
May 27 2020, 9:46 AM · Website, Unknown Object (Project)
rdicosmo created T2425: Missing twitter icon in footer of main website.
May 27 2020, 9:46 AM · Website, Unknown Object (Project)

May 26 2020

rdicosmo committed rMSLDe549ba7022dc: Update Force11 presentation (authored by rdicosmo).
Update Force11 presentation
May 26 2020, 9:25 PM

May 25 2020

rdicosmo added a comment to D3154: Add artifact metadata to the extrinsic metadata storage specification..

Adding a last comment after a last reread

May 25 2020, 4:51 PM

May 21 2020

rdicosmo added a comment to T2418: Answer IPOL's interrogations on deposit metadata.

We decided to answer IPOL's question in a conservative way, by using the current metadata.xml approach, without introducing roles for authors, nor the JSON-LD format.

May 21 2020, 5:46 PM · Metadata workflow, SWORD deposit

May 20 2020

rdicosmo added a comment to T2418: Answer IPOL's interrogations on deposit metadata.

I removed the following sentence, that will be confusing for Jose Luis (and btw, here the external library is included so it is technically not a sparse deposit either :-))

This might be a good example for the sparse deposit use case (even if in this specific case, the deposit will include the libraries).

May 20 2020, 8:48 PM · Metadata workflow, SWORD deposit
rdicosmo updated the task description for T2418: Answer IPOL's interrogations on deposit metadata.
May 20 2020, 8:46 PM · Metadata workflow, SWORD deposit
rdicosmo triaged T2417: Wrong swh:rev anchor in SWHID computed for contents/directories in Permalink box as High priority.
May 20 2020, 2:26 PM · Web app
rdicosmo added a subtask for T2342: Add resolving of new SWHIDs contextual information: T2408: Repeated filename in path qualifier for SWHIDs.
May 20 2020, 2:15 PM · Web app
rdicosmo added a parent task for T2408: Repeated filename in path qualifier for SWHIDs: T2342: Add resolving of new SWHIDs contextual information.
May 20 2020, 2:15 PM · Web app

May 19 2020

rdicosmo triaged T2410: Check and complete the gitorious.org import as High priority.
May 19 2020, 9:49 AM · Git loader, Origin-Gitorious

May 18 2020

rdicosmo added a comment to D3162: admin/deposit: Extract origin from swh_anchor_id according to latest change.

Adapt implementation according to review

May 18 2020, 4:46 PM · SWORD deposit, Web app
rdicosmo added a comment to D3154: Add artifact metadata to the extrinsic metadata storage specification..

I applied most of your comments,

Great!

except:

For the snapshot key in the dictionary, use the snapshot core SWHID, not the visit integer (that is an internal implementation detail)

because I don't understand it.

May 18 2020, 4:45 PM
rdicosmo added a comment to D3154: Add artifact metadata to the extrinsic metadata storage specification..

Notice also that if/when we introduce new versions of SWHIDs with other
hashing algorithms, we need to maintain backward compatibility.
Metadata introduced with a given context must not be "migrated" to the new
SWHIDs, but "duplicated" for the new SWHIDs.
Better think right now of a schema that allows to "share" metadata payloads
among multiple versions of SWHIDs without actually copying them over.

May 18 2020, 3:29 PM
rdicosmo added a comment to D3162: admin/deposit: Extract origin from swh_anchor_id according to latest change.

Great, thanks. Dont think you need a test on a function as basic as this :-)

May 18 2020, 2:35 PM · SWORD deposit, Web app
rdicosmo added a comment to D3162: admin/deposit: Extract origin from swh_anchor_id according to latest change.

It would be better to make the code robust w.r.t. future potential changes in qualifier order.
Suggested pseudo-code attached (to be checked :-))

May 18 2020, 2:18 PM · SWORD deposit, Web app
rdicosmo triaged T2409: Remove trailing SWHID qualifiers from origin URL column in deposit admin page as Normal priority.
May 18 2020, 10:47 AM · Web app

May 16 2020

rdicosmo added a comment to D3154: Add artifact metadata to the extrinsic metadata storage specification..

Thanks for sharing this: we're definitely going forward!
Comments are in the text, but overall:

  • agree that we need a way to store and retrieve metadata that is contextless (or... valid in all contexts)
  • for the context, we need to use the SHWIDs themselves, not the sha1_git that is bound to version 1 of SWHIDS
May 16 2020, 9:04 AM

May 14 2020

rdicosmo triaged T2408: Repeated filename in path qualifier for SWHIDs as High priority.
May 14 2020, 3:38 PM · Web app
rdicosmo added a comment to T2398: Simplify/update the SWHIDs returned from a SWORD deposit.

I would say go... HAL identifiers may be updated (not a big deal for them to update them) or may not be updated, but as you say they will be resolvable, so better have a uniform status of all deposits.

May 14 2020, 12:11 PM · SWORD deposit
rdicosmo added a comment to T2405: Make the main links on the "save code now" request list a link to the archived copy.

I agree: the links in the current UI presentation are sometimes confusing.
We need to think this a bit over to get all the functionalities we want into a clean UI:

May 14 2020, 11:27 AM · UX, Web app
rdicosmo added a comment to T2406: Allow filtering out test deposits on the deposit admin page.

A generic way to address this feature is by adding "exclusion" criteria to the existing "inclusion" criteria.

May 14 2020, 11:19 AM · Web app

May 13 2020

rdicosmo added a comment to T2398: Simplify/update the SWHIDs returned from a SWORD deposit.

Indeed! Getting the swh:rev and swh:snp for the swh:dir for the deposit
should not be that complicated navigating the Merkle tree upwards, though,
as we expect little deduplication there, but we'll need to see..

May 13 2020, 10:54 AM · SWORD deposit

May 12 2020

rdicosmo added a comment to T2398: Simplify/update the SWHIDs returned from a SWORD deposit.
  • Do we migrate the old deposit values to the new ones? (sounds reasonable to do so)
May 12 2020, 10:38 AM · SWORD deposit

May 9 2020

rdicosmo moved T1766: Write blog post about the npm ingestion from Restricted Project Column to Restricted Project Column on the Unknown Object (Project) board.
May 9 2020, 5:33 PM · Origin-npm, Blog, Website, Unknown Object (Project)
rdicosmo moved T2305: CCSD, Inria, Software Heritage sprint from Restricted Project Column to Restricted Project Column on the Unknown Object (Project) board.
May 9 2020, 5:32 PM · Unknown Object (Project)
rdicosmo added a comment to T2398: Simplify/update the SWHIDs returned from a SWORD deposit.

Once this is ready, check with HAL that everything works in software deposits (see https://gitlab.ccsd.cnrs.fr/ccsd/hal/-/issues/264)

May 9 2020, 12:39 PM · SWORD deposit
rdicosmo added a parent task for T2398: Simplify/update the SWHIDs returned from a SWORD deposit: T2399: Update SWORD deposit documentation about the returned SWHID.
May 9 2020, 12:30 PM · SWORD deposit
rdicosmo added a subtask for T2399: Update SWORD deposit documentation about the returned SWHID: T2398: Simplify/update the SWHIDs returned from a SWORD deposit.
May 9 2020, 12:30 PM · SWORD deposit
rdicosmo triaged T2399: Update SWORD deposit documentation about the returned SWHID as Normal priority.
May 9 2020, 12:30 PM · SWORD deposit
rdicosmo triaged T2398: Simplify/update the SWHIDs returned from a SWORD deposit as High priority.
May 9 2020, 12:28 PM · SWORD deposit
rdicosmo added a comment to T1785: Setup staging infrastructure.

Is this task still open? Likewise for T1711 T1712 and T1872

May 9 2020, 12:11 PM · Staging environment, System administration
rdicosmo added a comment to T1872: staging infra: New vlan.

Is this now done? If that's the case this ticked should be closed.

May 9 2020, 12:09 PM · Staging environment, Staff, System administration
rdicosmo added a comment to T1694: Ask for a more human friendly message on HAL moderation platform when deposit has an error.

Now that we have access to CCSD's gitlab, this issue must move there.

May 9 2020, 12:07 PM · SWORD deposit
rdicosmo added a comment to T2369: Review IPOL deposit metadata.

@ardumont , @moranegg : may you check that everything is now ok? We should close this task it that's the case :-)

May 9 2020, 12:06 PM · Metadata workflow, SWORD deposit

May 8 2020

rdicosmo committed rMSLDa228b3746765: Added image of updated permalink box (authored by rdicosmo).
Added image of updated permalink box
May 8 2020, 3:05 PM

May 7 2020

rdicosmo added a comment to T2395: Save code now fails on a svn project.

Thanks for diving into this.

May 7 2020, 10:03 PM · SVN Loader
rdicosmo triaged T2395: Save code now fails on a svn project as Normal priority.
May 7 2020, 6:11 PM · SVN Loader

May 6 2020

rdicosmo committed R183:ea7f0fe2f903: Add SWHID and ICMS (authored by rdicosmo).
Add SWHID and ICMS
May 6 2020, 4:14 PM