Paths

Table of Contentst

Diffusion Metadata indexer f72d095f4252

Refactor metadata mappings using rdflib.Graph instead of JSON-LD internally
f72d095f4252
Actions

Tags

None

Subscribers

None

Description

Refactor metadata mappings using rdflib.Graph instead of JSON-LD internally

Motivation:

It makes it easier to visualize what is actually happening when modifying the graph, by working explicitly on triples instead of a JSON-LD (a tree serialization of the graph).

Remove the need for the hacky merge_values() function (and possibly merge_documents() in a future commit)

It also catches malformed data exactly where it is added in the document (the call to rdflib.Graph.add()) instead of at the end of the mapping when running compaction/expansion.

Downsides:

Tests are clunkier, because they relied on deterministic order of unordered lists; but rdflib does not guarantee it

Code is longer

Extra dependency (which we will need at some point if we want to import from RDF datasets, anyway)

Details

Provenance

vlorentz	Authored on Aug 22 2022, 2:20 PM
vlorentz	Pushed on Aug 23 2022, 11:28 AM

Differential Revision

D8279: Refactor metadata mappings using rdflib.Graph instead of JSON-LD internally

Parents

rDCIDX97f5fdcdcc3a: Remove 'keywords' from test files

Branches

Unknown

Tags

Unknown

Build Status

Buildable 30983
Build 48461: test-and-build	Jenkins console · Jenkins

Event Timeline

vlorentz committed rDCIDXf72d095f4252: Refactor metadata mappings using rdflib.Graph instead of JSON-LD internally (authored by vlorentz).Aug 23 2022, 10:50 AM

vlorentz added an edge: D8279: Refactor metadata mappings using rdflib.Graph instead of JSON-LD internally.Aug 23 2022, 11:28 AM

Harbormaster completed building B30983: rDCIDXf72d095f4252: Refactor metadata mappings using rdflib.Graph instead of JSON-LD internally.Aug 23 2022, 11:36 AM

Changes (26)

Path

Size

docs/

metadata-workflow.rst

requirements.txt

swh/

indexer/

metadata_dictionary/

tests/

metadata_dictionary/

test_composer.py

test_codemeta.py

rDCIDXf72d095f4252

docs/metadata-workflow.rst

Loading...

mypy.ini

Loading...

requirements.txt

Loading...

swh/indexer/codemeta.py

Loading...

swh/indexer/metadata_dictionary/base.py

Loading...

swh/indexer/metadata_dictionary/cff.py

Loading...

swh/indexer/metadata_dictionary/composer.py

Loading...

swh/indexer/metadata_dictionary/dart.py

Loading...

swh/indexer/metadata_dictionary/github.py

Loading...

swh/indexer/metadata_dictionary/maven.py

Loading...

swh/indexer/metadata_dictionary/npm.py

Loading...

swh/indexer/metadata_dictionary/nuget.py

Loading...

swh/indexer/metadata_dictionary/python.py

Loading...

swh/indexer/metadata_dictionary/ruby.py

Loading...

swh/indexer/metadata_dictionary/utils.py

Loading...

swh/indexer/namespaces.py

Loading...

swh/indexer/tests/metadata_dictionary/test_cff.py

Loading...

swh/indexer/tests/metadata_dictionary/test_composer.py

Loading...

swh/indexer/tests/metadata_dictionary/test_dart.py

Loading...

swh/indexer/tests/metadata_dictionary/test_github.py

Loading...

swh/indexer/tests/metadata_dictionary/test_maven.py

Loading...

swh/indexer/tests/metadata_dictionary/test_npm.py

Loading...

swh/indexer/tests/metadata_dictionary/test_nuget.py

Loading...

swh/indexer/tests/metadata_dictionary/test_python.py

Loading...

swh/indexer/tests/metadata_dictionary/test_ruby.py

Loading...

swh/indexer/tests/test_codemeta.py

Loading...