Goal: Build semantic web of FOSS and promote SWH in citation and metadata workflows 1. implementation metadata infrastructure/workflow (all tasks are under #Metadata workflow) - [x] strategy and design of metadata component [#T715] - storage layer (tables and api entry points) - [x] content level [#T733] - [x] revision level [#T741] - [x] origin level [#T737] - indexer layer: - [x] content indexer [#T715] - [x] revision indexer [#T738] - [x] origin indexer [#T1231] - fetch extrinsic metadata - [x] with deposit [#T832] + loader-core [#T852] - [ ] when listing [#T833] - [ ] when loading [TBD] - [ ] from external catalogs (e.g libraries.io) [TBD] - [ ] deploy - [ ] document metadata workflow for coders and 'normal' users [TBD] 2. implementation of metadata tools for detection/extraction/translation - [x] swh-metadata-traslator to CodeMeta schema - [ ] coverage CodeMeta crosswalk table - [x] swh-metadata-detector (heuristic for filenames containing metadata) - [ ] CodeMeta generator 3. contacts and communication aka metadating - [x] RDA- metadata presentation at 10th RDA plenary - [x] Force11 SCIWG- hackathon participation - [x] Wikidata for preservation- Pathways for Discovery of Free Software: Libre Planet 2018 submission - HAL-CCSD- deposit - [x] submit french poster for JSO2018 - [x] submit english poster for RDA & Liber 2018 - [x] write blog post [#T769] - [x] prepare demo script for FOSDEM- GNU Hello [#T882] - [x] Crossminer- define collaboration - [ ] CodeMeta- contribute to crosswalk table and discussions - [ ] create and keep metadata network ( SPN, TOTEM, Ontosoft, CHM, etc..) - [ ] call for contribution to swh-metadata-tools 4. user features: - [] view indexed metadata for artifact on web UI (content, revision, origin) - [] search over metadata - [] Basic search - [] Advanced search (on specific terms) 5. Research and specify metadata use cases (not necessarily SWH): • scholarly ecosystem and open science - [x] Preserve research software (https://www.softwareheritage.org/save-and-reference-research-software/) - [x] Credit and attribution (activity report) - [x] Reference and identify software artifacts (activity report) - [x] Moderate research software (activity report) - [] Evaluate research software • culture and digital preservation - [x] Acquire and Archive legacy software source code (https://www.softwareheritage.org/swhap/) - [x] collect - [x] curate - [x] archive - [] present - [] Emulate legacy software [Acquire -> Archive (in SWH) -> Retrieve (on EaaSi) -> Use] - [] Archive cultural ephemera and context • web semantics and linked data - [] connect/ link software source code over the WWW - [] create the FOSS graph - [] use search engines to find software source code • free and open source software - [] recommendation systems: asses quality of software components and similarity between components - [] provenance of software artifacts - [] determine legality of usage, modification and redistribution 6. Document SWH user stories and provide detailed guide(s): Existing user stories: - [] deposit software with HAL - [x] text (on HAL) - [] video - [] save code now (research software) - [x] text - [] video - [] swhap - [x] text - [] video In roadmap user stories: - [] retrieve metadata for citation - [] text - [] video - [] search SWH archive (url and metadata) - [] text - [] video - [] View software artifact and metadata - [] text - [] video - [] Fetch software artifact and metadata (API) - [] text - [] video - [] Create codemeta.json with CodeMeta generator (after publishing tool on GitHub) - [] text - [] video Wish-list user stories: - [] retrieve BibTex for an artifact (WebApp and API) [is it really possible?] - [] retrieve codemeta.json export for a revision or origin [is it really possible?] - [] fetch software from SWH archive into EaaSi emulator - compile - execute - use - [] FAIR for software with SWH archive