Page MenuHomeSoftware Heritage

Add Gitea metadata mapping
ClosedPublic

Authored by vlorentz on Sep 13 2022, 1:31 PM.

Details

Summary

Resolves T4457.

Depends on D8459

Diff Detail

Repository
rDCIDX Metadata indexer
Branch
master
Lint
No Linters Available
Unit
No Unit Test Coverage
Build Status
Buildable 31495
Build 49268: Phabricator diff pipeline on jenkinsJenkins console · Jenkins
Build 49267: arc lint + arc unit

Event Timeline

Build is green

Patch application report for D8460 (id=30488)

Could not rebase; Attempt merge onto e25a2f4e4a...

Merge made by the 'recursive' strategy.
 swh/indexer/data/Gitea.csv                         |  76 +++++++++++
 swh/indexer/metadata_dictionary/__init__.py        |  15 ++-
 swh/indexer/metadata_dictionary/base.py            |  14 +-
 swh/indexer/metadata_dictionary/cff.py             |   5 +-
 swh/indexer/metadata_dictionary/gitea.py           | 116 +++++++++++++++++
 swh/indexer/metadata_dictionary/github.py          |  10 +-
 .../tests/metadata_dictionary/test_gitea.py        | 142 +++++++++++++++++++++
 .../tests/metadata_dictionary/test_github.py       |   7 +-
 swh/indexer/tests/test_cli.py                      |   1 +
 9 files changed, 374 insertions(+), 12 deletions(-)
 create mode 100644 swh/indexer/data/Gitea.csv
 create mode 100644 swh/indexer/metadata_dictionary/gitea.py
 create mode 100644 swh/indexer/tests/metadata_dictionary/test_gitea.py
Changes applied before test
commit b23f52b6ebdbc7d88c02acfa4230b5c24ef4de5c
Merge: e25a2f4 9f6b75c
Author: Jenkins user <jenkins@localhost>
Date:   Tue Sep 13 11:31:51 2022 +0000

    Merge branch 'diff-target' into HEAD

commit 9f6b75cad02745311f3d29a564b3db2d5b756af7
Author: Valentin Lorentz <vlorentz@softwareheritage.org>
Date:   Tue Sep 13 13:30:54 2022 +0200

    Add Gitea metadata mapping

commit 3a3a348bd86e714ab016a93617bc197010ee145d
Author: Valentin Lorentz <vlorentz@softwareheritage.org>
Date:   Tue Sep 13 12:34:22 2022 +0200

    GitHub: use correct JSON-LD types for URLs and dates

See https://jenkins.softwareheritage.org/job/DCIDX/job/tests-on-diff/492/ for more details.

olasd added a subscriber: olasd.

At a glance, this seems okay.

Out of curiosity, do we have a plan to pull information from other API endpoints, e.g. the repository's topics (aka keywords): https://try.gitea.io/api/swagger#/repository/repoListTopics ?

This revision is now accepted and ready to land.Sep 27 2022, 4:55 PM

Build is green

Patch application report for D8460 (id=30857)

Could not rebase; Attempt merge onto e25a2f4e4a...

Updating e25a2f4..cb435e5
Fast-forward
 swh/indexer/data/Gitea.csv                         |  76 +++++++++++
 swh/indexer/metadata_dictionary/__init__.py        |  15 ++-
 swh/indexer/metadata_dictionary/base.py            |  14 +-
 swh/indexer/metadata_dictionary/cff.py             |   5 +-
 swh/indexer/metadata_dictionary/gitea.py           | 116 +++++++++++++++++
 swh/indexer/metadata_dictionary/github.py          |  10 +-
 .../tests/metadata_dictionary/test_gitea.py        | 142 +++++++++++++++++++++
 .../tests/metadata_dictionary/test_github.py       |   7 +-
 swh/indexer/tests/test_cli.py                      |   1 +
 9 files changed, 374 insertions(+), 12 deletions(-)
 create mode 100644 swh/indexer/data/Gitea.csv
 create mode 100644 swh/indexer/metadata_dictionary/gitea.py
 create mode 100644 swh/indexer/tests/metadata_dictionary/test_gitea.py
Changes applied before test
commit cb435e59ca91ac7b71cff18e5e6b3885e5be9ac1
Author: Valentin Lorentz <vlorentz@softwareheritage.org>
Date:   Tue Sep 13 13:30:54 2022 +0200

    Add Gitea metadata mapping

commit 20becf4a90fa6b626e972bba3d57db46604cb7b2
Author: Valentin Lorentz <vlorentz@softwareheritage.org>
Date:   Tue Sep 13 12:34:22 2022 +0200

    GitHub: use correct JSON-LD types for URLs and dates

See https://jenkins.softwareheritage.org/job/DCIDX/job/tests-on-diff/506/ for more details.

This revision was automatically updated to reflect the committed changes.