Page MenuHomeSoftware Heritage
Paste P169

One year plan
ActivePublic

Authored by moranegg on Jul 18 2017, 6:10 PM.
Goal: Build semantic web of FOSS and promote SWH in citation and metadata workflows
1. implementation metadata infrastructure/workflow (all tasks are under #Metadata workflow)
- [x] strategy and design of metadata component [#T715]
- storage layer (tables and api entry points)
- [x] content level [#T733]
- [x] revision level [#T741]
- [x] origin level [#T737]
- indexer layer:
- [x] content indexer [#T715]
- [x] revision indexer [#T738]
- [x] origin indexer [#T1231]
- fetch extrinsic metadata
- [x] with deposit [#T832] + loader-core [#T852]
- [ ] when listing [#T833]
- [ ] when loading [TBD]
- [ ] from external catalogs (e.g libraries.io) [TBD]
- [ ] deploy
- [ ] document metadata workflow for coders and 'normal' users [TBD]
2. implementation of metadata tools for detection/extraction/translation
- [x] swh-metadata-traslator to CodeMeta schema
- [ ] coverage CodeMeta crosswalk table
- [x] swh-metadata-detector (heuristic for filenames containing metadata)
- [ ] CodeMeta generator
3. contacts and communication aka metadating
- [x] RDA- metadata presentation at 10th RDA plenary
- [x] Force11 SCIWG- hackathon participation
- [x] Wikidata for preservation- Pathways for Discovery of Free Software: Libre Planet 2018 submission
- HAL-CCSD- deposit
- [x] submit french poster for JSO2018
- [x] submit english poster for RDA & Liber 2018
- [x] write blog post [#T769]
- [x] prepare demo script for FOSDEM- GNU Hello [#T882]
- [x] Crossminer- define collaboration
- [ ] CodeMeta- contribute to crosswalk table and discussions
- [ ] create and keep metadata network ( SPN, TOTEM, Ontosoft, CHM, etc..)
- [ ] call for contribution to swh-metadata-tools
4. user features:
- [] view indexed metadata for artifact on web UI (content, revision, origin)
- [] search over metadata
- [] Basic search
- [] Advanced search (on specific terms)
5. Research and specify metadata use cases (not necessarily SWH):
• scholarly ecosystem and open science
- [x] Preserve research software (https://www.softwareheritage.org/save-and-reference-research-software/)
- [x] Credit and attribution (activity report)
- [x] Reference and identify software artifacts (activity report)
- [x] Moderate research software (activity report)
- [] Evaluate research software
• culture and digital preservation
- [x] Acquire and Archive legacy software source code (https://www.softwareheritage.org/swhap/)
- [x] collect
- [x] curate
- [x] archive
- [] present
- [] Emulate legacy software [Acquire -> Archive (in SWH) -> Retrieve (on EaaSi) -> Use]
- [] Archive cultural ephemera and context
• web semantics and linked data
- [] connect/ link software source code over the WWW
- [] create the FOSS graph
- [] use search engines to find software source code
• free and open source software
- [] recommendation systems: asses quality of software components and similarity between components
- [] provenance of software artifacts
- [] determine legality of usage, modification and redistribution
6. Document SWH user stories and provide detailed guide(s):
Existing user stories:
- [] deposit software with HAL
- [x] text (on HAL)
- [] video
- [] save code now (research software)
- [x] text
- [] video
- [] swhap
- [x] text
- [] video
In roadmap user stories:
- [] retrieve metadata for citation
- [] text
- [] video
- [] search SWH archive (url and metadata)
- [] text
- [] video
- [] View software artifact and metadata
- [] text
- [] video
- [] Fetch software artifact and metadata (API)
- [] text
- [] video
- [] Create codemeta.json with CodeMeta generator (after publishing tool on GitHub)
- [] text
- [] video
Wish-list user stories:
- [] retrieve BibTex for an artifact (WebApp and API) [is it really possible?]
- [] retrieve codemeta.json export for a revision or origin [is it really possible?]
- [] fetch software from SWH archive into EaaSi emulator - compile - execute - use
- [] FAIR for software with SWH archive