Page MenuHomeSoftware Heritage

Search tools on metadata (meta task)
Open, NormalPublic

Description

For now, metadata are only used for fulltext/keyword search. There is a lot more we can do. This meta-task will track ideas and progress of their design.

Ideas:

  • Better full-text search (for now, it's keyword-based)
  • Search per field
    • Single field
    • Combining fields
  • Filter results to a subset of metadata file types ("mappings")
  • Stats on frequency of values for each field (e.g., license, ...)
    • on all metadata
    • on a subset of metadata (filtered by a search query)
  • Metadata-only search (right now it's always in OR with URL-based search)

Event Timeline

vlorentz created this task.Feb 8 2019, 1:09 PM
vlorentz triaged this task as Normal priority.
vlorentz added a project: Restricted Project.Feb 11 2019, 11:40 AM
vlorentz claimed this task.
zack updated the task description. (Show Details)Feb 23 2019, 2:13 PM

I've added an item to the above list (metadata-only search); I think the ideal UI would be a single form with two checkboxes under it, one enabling URL-based search (enabled by default), one enabling metadata-based search (disabled by default).

Relatedly, I don't understand what the "stats" items do in this list, they don't seem to be related to metadata-based *search*.

right now it's always in OR with URL-based search

Are you sure? Looking at the code, it seems to me it searches either in URL or in metadata, but not both at the same time.

zack added a comment.Feb 25 2019, 9:57 AM

That's the impression i got from testing. Either way, the current UI & semantics are bad, the proposed ones would be much better.

That's the impression i got from testing.

The URL is often part of the metadata, in the codeRepository key.

Relatedly, I don't understand what the "stats" items do in this list, they don't seem to be related to metadata-based *search*.

The second item ("on a subset of metadata") involves search