Page MenuHomeSoftware Heritage

Intrinsic metadataFolder
ActivePublic

Watchers

  • This project does not have any watchers.
  • View All

Recent Activity

Oct 19 2021

vsellier renamed T3671: staging - swh-search (metadata indexer) is unable to update a document due to an unparseable date from staging - swh-search unable to update a document due to an unparseable date to staging - swh-search (metadata indexer) is unable to update a document due to an unparseable date.
Oct 19 2021, 11:03 AM · Intrinsic metadata, Archive search
vsellier updated the task description for T3671: staging - swh-search (metadata indexer) is unable to update a document due to an unparseable date.
Oct 19 2021, 10:54 AM · Intrinsic metadata, Archive search
vsellier triaged T3671: staging - swh-search (metadata indexer) is unable to update a document due to an unparseable date as Normal priority.
Oct 19 2021, 10:48 AM · Intrinsic metadata, Archive search

Sep 6 2021

vlorentz removed a project from T3559: Enable the swh-search QL in staging: meta-task.
Sep 6 2021, 10:37 AM · Archive search, System administration, Intrinsic metadata, Extrinsic metadata
vlorentz removed a project from T3558: Enable the swh-search QL in production: meta-task.
Sep 6 2021, 10:37 AM · Archive search, System administration, Intrinsic metadata, Extrinsic metadata
vlorentz added a project to T3558: Enable the swh-search QL in production: Archive search.
Sep 6 2021, 10:36 AM · Archive search, System administration, Intrinsic metadata, Extrinsic metadata
vlorentz triaged T3559: Enable the swh-search QL in staging as Normal priority.
Sep 6 2021, 10:36 AM · Archive search, System administration, Intrinsic metadata, Extrinsic metadata
vlorentz added a project to T3558: Enable the swh-search QL in production: System administration.
Sep 6 2021, 10:36 AM · Archive search, System administration, Intrinsic metadata, Extrinsic metadata
vlorentz triaged T3558: Enable the swh-search QL in production as Normal priority.
Sep 6 2021, 10:36 AM · Archive search, System administration, Intrinsic metadata, Extrinsic metadata

Aug 27 2021

moranegg added a parent task for T3516: Implement API entry point for (intrinsic) citation metadata: T3494: Implement citation button for directories with codemeta or CFF.
Aug 27 2021, 4:41 PM · Intrinsic metadata
moranegg added a subtask for T3494: Implement citation button for directories with codemeta or CFF: T3516: Implement API entry point for (intrinsic) citation metadata.
Aug 27 2021, 4:41 PM · Intrinsic metadata, Web app
moranegg triaged T3516: Implement API entry point for (intrinsic) citation metadata as Normal priority.
Aug 27 2021, 4:41 PM · Intrinsic metadata

Aug 20 2021

vlorentz added projects to T3494: Implement citation button for directories with codemeta or CFF: Web app, Intrinsic metadata.
Aug 20 2021, 4:39 PM · Intrinsic metadata, Web app

Aug 18 2021

moranegg added a comment to T3078: Index CITATION.cff files.

@sdruskat Good news about the GitHub support.

Aug 18 2021, 5:08 PM · Intrinsic metadata, Easy hack

Jul 29 2021

sdruskat added a comment to T3078: Index CITATION.cff files.

Just to say I'm sorry I haven't made any progress here lately. Will get to it once CFF 1.2.0 is out the door. Has become a priority since GitHub and Zenodo now support it :).

Jul 29 2021, 10:53 AM · Intrinsic metadata, Easy hack

Jul 15 2021

vlorentz closed T2938: Create API endpoint to access raw_extrinsic_metadata, a subtask of T3097: Expose metadata in the WebApp and make it searchable, as Resolved.
Jul 15 2021, 12:18 PM · Intrinsic metadata, Extrinsic metadata, Roadmap 2021, meta-task

Jun 29 2021

vlorentz changed the status of T3097: Expose metadata in the WebApp and make it searchable from Open to Work in Progress.
Jun 29 2021, 10:53 AM · Intrinsic metadata, Extrinsic metadata, Roadmap 2021, meta-task

Apr 29 2021

vlorentz added a parent task for T2270: Add to intrinsic metadata files to be indexed: AUTHORS and CONTRIBUTORS: T3230: Add various markdown variants to list of intrinsic metadata files to be indexed .
Apr 29 2021, 12:26 PM · Intrinsic metadata, Easy hack, Indexer
vlorentz added a subtask for T3230: Add various markdown variants to list of intrinsic metadata files to be indexed : T2270: Add to intrinsic metadata files to be indexed: AUTHORS and CONTRIBUTORS.
Apr 29 2021, 12:26 PM · Intrinsic metadata, Easy hack, Indexer

Apr 9 2021

rdicosmo raised the priority of T3230: Add various markdown variants to list of intrinsic metadata files to be indexed from Low to Normal.
Apr 9 2021, 4:45 PM · Intrinsic metadata, Easy hack, Indexer
vlorentz triaged T3230: Add various markdown variants to list of intrinsic metadata files to be indexed as Low priority.
Apr 9 2021, 4:13 PM · Intrinsic metadata, Easy hack, Indexer
rdicosmo updated the task description for T3230: Add various markdown variants to list of intrinsic metadata files to be indexed .
Apr 9 2021, 1:33 PM · Intrinsic metadata, Easy hack, Indexer
rdicosmo created T3230: Add various markdown variants to list of intrinsic metadata files to be indexed .
Apr 9 2021, 1:32 PM · Intrinsic metadata, Easy hack, Indexer

Apr 8 2021

sdruskat added a comment to T3078: Index CITATION.cff files.

Hey @sdruskat, I have a few questions : (1) Any idea of when will be the new version available and (2) when will be the crosswalk file updated to at least 1.1.0? (3) The newer version will be backwards compatible, right?

Thanks

Apr 8 2021, 4:18 PM · Intrinsic metadata, Easy hack

Apr 7 2021

moranegg added a comment to T3078: Index CITATION.cff files.

Hey @KShivendu,
These are very good observation.

Apr 7 2021, 11:41 AM · Intrinsic metadata, Easy hack

Apr 4 2021

KShivendu added a comment to T3078: Index CITATION.cff files.

Hey @moranegg
I suggest the following modifications in adding-support-for-additional-metadata page:

  • python3 -m swh.indexer.metadata_dictionary MyMapping path/to/input/file doesn't work. Replace with swh indexer mapping translate cff path/to/input/file
  • Whenever adding new mappings, it has to be mentioned in the MAPPING_NAME variable of the swh/indexer/storage/__init__.py file which isn't mentioned in the documentation. (though it didn't throw error while testing or parsing) and in expected_output of test_cli_mapping_list
  • Mentioning a few examples about fields that are not string_fields
  • Elaborating how _traslate_dict function works. For example : it executes functions starting with 'normalize_'
  • Mention command : swh indexer mapping list-terms to display Supported CodeMeta terms
  • Add Youtube videos about JSONLD : JSON-LD Basics, JSON-LD: Core markup, Compaction and Expansion
Apr 4 2021, 11:16 AM · Intrinsic metadata, Easy hack

Apr 2 2021

vlorentz claimed T3097: Expose metadata in the WebApp and make it searchable.
Apr 2 2021, 10:11 AM · Intrinsic metadata, Extrinsic metadata, Roadmap 2021, meta-task

Mar 30 2021

KShivendu added a comment to T3078: Index CITATION.cff files.

Hey @sdruskat, I have a few questions : (1) Any idea of when will be the new version available and (2) when will be the crosswalk file updated to at least 1.1.0? (3) The newer version will be backwards compatible, right?

Mar 30 2021, 4:31 PM · Intrinsic metadata, Easy hack

Mar 27 2021

sdruskat added a comment to T3078: Index CITATION.cff files.

Hey @KShivendu, thanks, sounds good. Let me know if you have any questions.

Mar 27 2021, 5:05 PM · Intrinsic metadata, Easy hack
KShivendu added a comment to T3078: Index CITATION.cff files.

Hey @sdruskat, I noticed that the current crosswalk.csv has CFF version 1.0.2. And I agree on keeping the crosswalk.csv file updated as much as possible. But to the best of my knowledge, updating the crosswalk.csv file later (when you are done with creating the new version) won't break anything here. Plus, I am new here and I am learning about metadata which is helping me write a better GSoC proposal.
Also, I can assure you that I will update the crosswalk.csv file myself if you want :)

Mar 27 2021, 2:55 PM · Intrinsic metadata, Easy hack

Mar 26 2021

sdruskat added a comment to T3078: Index CITATION.cff files.

Hello again, seeing that you had asked about the crosswalk @KShivendu, and that we're currently in the process of creating a new CFF version, may I suggest that we put this on hold until we have at least updated the crosswalk to the currently latest version of CFF, 1.1.0?

Mar 26 2021, 12:25 PM · Intrinsic metadata, Easy hack
KShivendu added a comment to T3078: Index CITATION.cff files.
Mar 26 2021, 11:59 AM · Intrinsic metadata, Easy hack

Mar 23 2021

moranegg edited projects for T3078: Index CITATION.cff files, added: Intrinsic metadata; removed Metadata workflow.
Mar 23 2021, 5:33 PM · Intrinsic metadata, Easy hack
moranegg edited projects for T2203: Intrinsic metadata, added: Intrinsic metadata; removed Metadata workflow.
Mar 23 2021, 5:32 PM · Intrinsic metadata, Roadmap 2020
moranegg edited projects for T2472: Indexing intrinsic metadata in a deposit using a sub-folder for the content, added: Intrinsic metadata; removed Metadata workflow.
Mar 23 2021, 5:32 PM · Intrinsic metadata, Indexer, SWORD deposit

Mar 8 2021

vlorentz triaged T3097: Expose metadata in the WebApp and make it searchable as Normal priority.
Mar 8 2021, 11:41 AM · Intrinsic metadata, Extrinsic metadata, Roadmap 2021, meta-task
rdicosmo updated the task description for T3097: Expose metadata in the WebApp and make it searchable.
Mar 8 2021, 10:44 AM · Intrinsic metadata, Extrinsic metadata, Roadmap 2021, meta-task
rdicosmo added a parent task for T2270: Add to intrinsic metadata files to be indexed: AUTHORS and CONTRIBUTORS: T2064: Add metadata from deposits to metadata search.
Mar 8 2021, 10:34 AM · Intrinsic metadata, Easy hack, Indexer
rdicosmo added subtasks for T3097: Expose metadata in the WebApp and make it searchable: T2073: Index extrinsic metadata from the journal in swh-search/Elasticsearch, T2938: Create API endpoint to access raw_extrinsic_metadata, T2088: Specify and draw metadata view on web-app, T2191: Metadata Views.
Mar 8 2021, 10:33 AM · Intrinsic metadata, Extrinsic metadata, Roadmap 2021, meta-task
rdicosmo created T3097: Expose metadata in the WebApp and make it searchable.
Mar 8 2021, 10:31 AM · Intrinsic metadata, Extrinsic metadata, Roadmap 2021, meta-task

Feb 12 2021

moranegg edited projects for T2270: Add to intrinsic metadata files to be indexed: AUTHORS and CONTRIBUTORS, added: Intrinsic metadata; removed Metadata workflow.
Feb 12 2021, 3:44 PM · Intrinsic metadata, Easy hack, Indexer

Dec 22 2020

vlorentz closed T2876: metadata indexation : ES' dynamic mapping creation fails for field values that are of varying types as Resolved.
Dec 22 2020, 10:52 AM · Intrinsic metadata, Indexer, Archive search

Dec 21 2020

vlorentz added a parent task for T2876: metadata indexation : ES' dynamic mapping creation fails for field values that are of varying types: T2073: Index extrinsic metadata from the journal in swh-search/Elasticsearch.
Dec 21 2020, 12:23 PM · Intrinsic metadata, Indexer, Archive search

Dec 14 2020

vlorentz reopened T2876: metadata indexation : ES' dynamic mapping creation fails for field values that are of varying types as "Open".
Dec 14 2020, 10:56 AM · Intrinsic metadata, Indexer, Archive search
vlorentz closed T2876: metadata indexation : ES' dynamic mapping creation fails for field values that are of varying types as Resolved.
Dec 14 2020, 10:54 AM · Intrinsic metadata, Indexer, Archive search

Dec 11 2020

vlorentz added a comment to T2876: metadata indexation : ES' dynamic mapping creation fails for field values that are of varying types.

I suspect P905 might be the same issue. pyld tends to check if it's a list/array/string, and if it's not it assumes it's a number

Dec 11 2020, 7:00 PM · Intrinsic metadata, Indexer, Archive search
ardumont added a comment to T2876: metadata indexation : ES' dynamic mapping creation fails for field values that are of varying types.

no idea, it's coming straight from the journal.
Note that it is not the only errors (P905 demonstrates another error).

Dec 11 2020, 6:09 PM · Intrinsic metadata, Indexer, Archive search
vlorentz added a comment to T2876: metadata indexation : ES' dynamic mapping creation fails for field values that are of varying types.

why is there a tuple in the input? it should be a list

Dec 11 2020, 6:06 PM · Intrinsic metadata, Indexer, Archive search
ardumont added a comment to T2876: metadata indexation : ES' dynamic mapping creation fails for field values that are of varying types.

We tested with the latest search 0.3.3 but that failed [1]

Dec 11 2020, 5:46 PM · Intrinsic metadata, Indexer, Archive search
vlorentz added a revision to T2876: metadata indexation : ES' dynamic mapping creation fails for field values that are of varying types: D4722: Normalize Codemeta documents by expanding them..
Dec 11 2020, 1:44 PM · Intrinsic metadata, Indexer, Archive search