diff --git a/README.md b/README.md --- a/README.md +++ b/README.md @@ -4,12 +4,10 @@ Tools to compute multiple indexes on SWH's raw contents: - content: - mimetype - - ctags - - language - fossology-license - metadata -- revision: - - metadata +- origin: + - metadata (intrinsic, using the content indexer; and extrinsic) An indexer is in charge of: - looking up objects @@ -32,18 +30,13 @@ - mimetype (queue swh_indexer_content_mimetype): detect the encoding and mimetype -- language (queue swh_indexer_content_language): detect the - programming language - -- ctags (queue swh_indexer_content_ctags): compute tags information - - fossology-license (queue swh_indexer_fossology_license): compute the license -- metadata: translate file into translated_metadata dict +- metadata: translate file from an ecosystem-specific formats to JSON-LD + (using schema.org/CodeMeta vocabulary) -Current revision indexers: +Current origin indexers: -- metadata: detects files containing metadata and retrieves translated_metadata - in content_metadata table in storage or run content indexer to translate - files. +- metadata: translate file from an ecosystem-specific formats to JSON-LD + (using schema.org/CodeMeta and ForgeFed vocabularies)