Page MenuHomeSoftware Heritage
Feed Advanced Search

Jul 15 2021

KShivendu updated the diff for D5990: query_language: Setup tree-sitter and grammar.js.
  • Polish the code
Jul 15 2021, 2:59 PM
KShivendu added inline comments to D5990: query_language: Setup tree-sitter and grammar.js.
Jul 15 2021, 2:56 PM
KShivendu updated the diff for D5990: query_language: Setup tree-sitter and grammar.js.
  • Install tree-sitter-cli (NodeJS) during builds
Jul 15 2021, 12:38 PM
KShivendu updated the diff for D5990: query_language: Setup tree-sitter and grammar.js.
  • Generate parser before building swh_ql.so
Jul 15 2021, 12:34 PM
KShivendu updated the diff for D5990: query_language: Setup tree-sitter and grammar.js.
  • Fix installation/build errors
Jul 15 2021, 12:24 PM
KShivendu updated the diff for D5990: query_language: Setup tree-sitter and grammar.js.
  • Generate swh_ql.so at builds
Jul 15 2021, 9:47 AM
KShivendu updated the diff for D5990: query_language: Setup tree-sitter and grammar.js.
  • Fix failing build ( because of data_files )
Jul 15 2021, 9:16 AM

Jul 14 2021

KShivendu updated the diff for D5990: query_language: Setup tree-sitter and grammar.js.
  • Move parser to search_language dir
  • Introduce Makefile.local and add TreeSitter related commands
  • Set data_files of setup.py to 'generated/search_ql.so'
Jul 14 2021, 9:57 AM

Jul 13 2021

KShivendu updated the diff for D5990: query_language: Setup tree-sitter and grammar.js.
  • Add newline at the end of files
Jul 13 2021, 2:46 PM
KShivendu updated the summary of D5990: query_language: Setup tree-sitter and grammar.js.
Jul 13 2021, 1:11 PM
KShivendu updated the summary of D5990: query_language: Setup tree-sitter and grammar.js.
Jul 13 2021, 1:11 PM
KShivendu requested review of D5990: query_language: Setup tree-sitter and grammar.js.
Jul 13 2021, 1:04 PM
KShivendu closed D5964: origin_search: Filters and sorting for date_{created,modified,published}.
Jul 13 2021, 12:08 PM
KShivendu committed rDSEAfe7640f71024: origin_search: Filters and sorting for date_{created,modified,published} (authored by KShivendu).
origin_search: Filters and sorting for date_{created,modified,published}
Jul 13 2021, 12:08 PM
KShivendu updated the diff for D5964: origin_search: Filters and sorting for date_{created,modified,published}.
  • Add test for sort_by : ["date_created"]
  • Deduplicate calculation of some variables in _get_sorting_key
  • Use iso8601 library to validate date format in instrinsic_metadata fields
Jul 13 2021, 11:51 AM

Jul 12 2021

KShivendu added inline comments to D5964: origin_search: Filters and sorting for date_{created,modified,published}.
Jul 12 2021, 2:04 PM
KShivendu added a comment to P1090 swl_ql.

string_content and escape_sequence have been adapted from JSON Treesitter grammar

Jul 12 2021, 8:13 AM
KShivendu edited P1090 swl_ql.
Jul 12 2021, 8:08 AM
KShivendu edited P1090 swl_ql.
Jul 12 2021, 8:06 AM

Jul 9 2021

KShivendu created P1092 py-tree-sitter/swh_ql.
Jul 9 2021, 11:19 AM
KShivendu added a comment to P1091 swh_ql.grammar.js.

For input

url : "github.com/django/Django" metadata : something qewq
with_visit : true with_visit : false
nb_visits >= 0  nb_visits = 10 nb_visits != 256 nb_visits < 1000 
sort_by : ["nb_visits", "last_revision_date", last_release_date]
last_release_date < 2001-02-13 15:54:21
licenses in ["MIT","BSD X","Apache"]
Jul 9 2021, 9:33 AM
KShivendu created P1091 swh_ql.grammar.js.
Jul 9 2021, 9:31 AM
KShivendu added a comment to P1090 swl_ql.
  • I highly recommend you to generate/visualise the corresponding Railroad diagram with https://www.bottlecaps.de/rr/ui
  • Checkout P1091 for the Treesitter implementation and some example queries.
Jul 9 2021, 8:35 AM
KShivendu created P1090 swl_ql.
Jul 9 2021, 8:33 AM

Jul 7 2021

KShivendu updated the diff for D5964: origin_search: Filters and sorting for date_{created,modified,published}.
  • origin_update: Document rejection of metadata date fields if not parsable
Jul 7 2021, 1:28 PM

Jul 6 2021

KShivendu updated the diff for D5964: origin_search: Filters and sorting for date_{created,modified,published}.
  • elasticsearch.py: Use "linient: true"
  • origin_search: Validate intrinsic_metadata date field format before storing
  • test_search: Fix failing tests
Jul 6 2021, 11:46 AM

Jul 5 2021

KShivendu added a comment to D5964: origin_search: Filters and sorting for date_{created,modified,published}.

Can you either add tests, or deduplicate this code so we don't need to test every field?

Jul 5 2021, 6:38 PM
KShivendu created P1089 Elasticsearch date errro.
Jul 5 2021, 6:26 PM
KShivendu updated the diff for D5964: origin_search: Filters and sorting for date_{created,modified,published}.
  • Move get_expansion to utils.py
  • Add tests filters as well as sorting options
  • Polish existing code
Jul 5 2021, 6:22 PM
KShivendu closed D5963: origin_search: Add keyword search for instrinsic_metadata keywords/description.
Jul 5 2021, 12:41 PM
KShivendu committed rDSEAf378a989e972: origin_search: Add keyword search for instrinsic_metadata keywords/description (authored by KShivendu).
origin_search: Add keyword search for instrinsic_metadata keywords/description
Jul 5 2021, 12:41 PM
KShivendu updated the summary of D5963: origin_search: Add keyword search for instrinsic_metadata keywords/description.
Jul 5 2021, 12:32 PM
KShivendu updated the diff for D5963: origin_search: Add keyword search for instrinsic_metadata keywords/description.

Add commit body

Jul 5 2021, 12:32 PM
KShivendu updated the diff for D5963: origin_search: Add keyword search for instrinsic_metadata keywords/description.
  • Squash
  • Minor polishes
Jul 5 2021, 12:25 PM

Jul 2 2021

KShivendu added inline comments to D5963: origin_search: Add keyword search for instrinsic_metadata keywords/description.
Jul 2 2021, 6:21 PM
KShivendu updated the diff for D5963: origin_search: Add keyword search for instrinsic_metadata keywords/description.
  • origin_search: Polish the code with get_expansion and other methods
Jul 2 2021, 6:17 PM
KShivendu created P1085 Find origins with description in instrinsic_metadata.
Jul 2 2021, 5:29 PM
KShivendu requested review of D5964: origin_search: Filters and sorting for date_{created,modified,published}.
Jul 2 2021, 5:21 PM
KShivendu requested review of D5963: origin_search: Add keyword search for instrinsic_metadata keywords/description.
Jul 2 2021, 4:06 PM
KShivendu retitled D5949: origin_search: Filter for instrinsic_metadata language and license from journal_client: Store language and license from instrinsic_metadata to origin_search: Filter for instrinsic_metadata language and license.
Jul 2 2021, 10:31 AM
KShivendu closed D5949: origin_search: Filter for instrinsic_metadata language and license.
Jul 2 2021, 10:31 AM
KShivendu committed rDSEA2e1fb863871c: origin_search: Filter for instrinsic_metadata language and license (authored by KShivendu).
origin_search: Filter for instrinsic_metadata language and license
Jul 2 2021, 10:31 AM
KShivendu updated the diff for D5949: origin_search: Filter for instrinsic_metadata language and license.
  • Fix failing doctest
  • Add commit description
Jul 2 2021, 9:58 AM
KShivendu updated the diff for D5949: origin_search: Filter for instrinsic_metadata language and license.
  • Improve doctest for _nested_get
  • Squash
Jul 2 2021, 9:24 AM

Jul 1 2021

KShivendu added inline comments to D5949: origin_search: Filter for instrinsic_metadata language and license.
Jul 1 2021, 8:18 PM
KShivendu updated the diff for D5949: origin_search: Filter for instrinsic_metadata language and license.
  • in_memory: Allow list of licenses and programmingLanguages
Jul 1 2021, 8:13 PM
KShivendu updated the diff for D5949: origin_search: Filter for instrinsic_metadata language and license.
  • test_in_memory: Add test for _nested_get
Jul 1 2021, 1:47 PM
KShivendu added inline comments to D5949: origin_search: Filter for instrinsic_metadata language and license.
Jul 1 2021, 12:53 PM

Jun 30 2021

KShivendu updated the diff for D5949: origin_search: Filter for instrinsic_metadata language and license.
  • in_memory: Use expanded instrinsic_metadata
Jun 30 2021, 11:09 PM
KShivendu updated the diff for D5949: origin_search: Filter for instrinsic_metadata language and license.
  • test_search: Test for search on multiple instrinsic_metadata ields
Jun 30 2021, 9:02 PM
KShivendu added inline comments to D5949: origin_search: Filter for instrinsic_metadata language and license.
Jun 30 2021, 4:28 PM
KShivendu updated the diff for D5949: origin_search: Filter for instrinsic_metadata language and license.
  • Use analyzer on the list of licenses and langauges
Jun 30 2021, 4:20 PM
KShivendu added inline comments to D5949: origin_search: Filter for instrinsic_metadata language and license.
Jun 30 2021, 3:17 PM
KShivendu added inline comments to D5949: origin_search: Filter for instrinsic_metadata language and license.
Jun 30 2021, 3:16 PM
KShivendu updated the diff for D5949: origin_search: Filter for instrinsic_metadata language and license.
  • origin_search: Allow search for multiple licenses and languages at once
  • test_search: Separate tests for programming_language and license
Jun 30 2021, 2:58 PM
KShivendu updated the diff for D5949: origin_search: Filter for instrinsic_metadata language and license.
  • origin_search: Expose language and license from instrinsic_metadata
  • test_search: Add test for language and license
Jun 30 2021, 12:35 PM
KShivendu added a comment to D5949: origin_search: Filter for instrinsic_metadata language and license.

If the only issue is the slowdown, can we keep them nested for now, and benchmark later to see if un-nesting is worth it?

Jun 30 2021, 10:35 AM
KShivendu added a comment to D5949: origin_search: Filter for instrinsic_metadata language and license.

@vlorentz, just in case you missed it, the values don't get duplicated. I'm popping them out of instrinsic_metdadata.
I was trying to avoid nested documents as I've read at many places that they slow down searches. (not sure by how much)

Jun 30 2021, 7:13 AM

Jun 29 2021

KShivendu added inline comments to D5949: origin_search: Filter for instrinsic_metadata language and license.
Jun 29 2021, 8:07 PM
KShivendu requested review of D5949: origin_search: Filter for instrinsic_metadata language and license.
Jun 29 2021, 8:03 PM

Jun 28 2021

KShivendu closed D5918: origin_search: Add sort_by feature.
Jun 28 2021, 1:16 PM
KShivendu committed rDSEA6b1d563d42e9: origin_search: Add sort_by feature (authored by KShivendu).
origin_search: Add sort_by feature
Jun 28 2021, 1:16 PM
KShivendu updated the diff for D5918: origin_search: Add sort_by feature.

Rebase

Jun 28 2021, 12:56 PM

Jun 26 2021

KShivendu added inline comments to D5918: origin_search: Add sort_by feature.
Jun 26 2021, 8:17 AM
KShivendu updated the diff for D5918: origin_search: Add sort_by feature.
  • origin_search: Polish code related to sort_by
Jun 26 2021, 8:11 AM

Jun 25 2021

KShivendu added inline comments to D5918: origin_search: Add sort_by feature.
Jun 25 2021, 6:27 PM
KShivendu updated the diff for D5918: origin_search: Add sort_by feature.
  • origin_search: Allow sorting with multiple fields
Jun 25 2021, 6:18 PM
KShivendu updated the diff for D5918: origin_search: Add sort_by feature.
  • interface: Maintain SORT_BY_OPTIONS list
Jun 25 2021, 12:02 PM
KShivendu added inline comments to D5918: origin_search: Add sort_by feature.
Jun 25 2021, 11:39 AM
KShivendu updated the diff for D5918: origin_search: Add sort_by feature.
  • test_search: Improve tests for sort_by
Jun 25 2021, 8:10 AM
KShivendu added inline comments to D5918: origin_search: Add sort_by feature.
Jun 25 2021, 8:00 AM
KShivendu requested review of D5918: origin_search: Add sort_by feature.
Jun 25 2021, 7:48 AM

Jun 23 2021

KShivendu closed D5915: elasticsearch.py: Improve readability of painless script.
Jun 23 2021, 6:01 PM
KShivendu committed rDSEA3e153035287b: elasticsearch.py: Improve readability of painless script (authored by KShivendu).
elasticsearch.py: Improve readability of painless script
Jun 23 2021, 6:01 PM
KShivendu updated the summary of D5915: elasticsearch.py: Improve readability of painless script.
Jun 23 2021, 5:15 PM
KShivendu updated the diff for D5915: elasticsearch.py: Improve readability of painless script.
  • Improve commit body
Jun 23 2021, 5:12 PM
KShivendu updated the summary of D5915: elasticsearch.py: Improve readability of painless script.
Jun 23 2021, 5:02 PM
KShivendu requested review of D5915: elasticsearch.py: Improve readability of painless script.
Jun 23 2021, 5:02 PM
KShivendu closed D5883: Setup storage and store last revision/release date.
Jun 23 2021, 4:40 PM
KShivendu committed rDSEA9bedaa95a39e: journal_client: Store last revision/release date (authored by KShivendu).
journal_client: Store last revision/release date
Jun 23 2021, 4:40 PM
KShivendu updated the diff for D5883: Setup storage and store last revision/release date.

Limit each line in commit message to 80 chars

Jun 23 2021, 4:31 PM
KShivendu closed D5912: .gitignore: Add .vscode/.
Jun 23 2021, 4:18 PM
KShivendu committed rDTPL4cf219b2d014: .gitignore: Add .vscode/ (authored by KShivendu).
.gitignore: Add .vscode/
Jun 23 2021, 4:18 PM
KShivendu requested review of D5912: .gitignore: Add .vscode/.
Jun 23 2021, 2:10 PM
KShivendu updated the diff for D5883: Setup storage and store last revision/release date.

Update commit messsage body

Jun 23 2021, 1:53 PM
KShivendu added a comment to D5883: Setup storage and store last revision/release date.

Can you also please also take care of updating it at the same time?

Jun 23 2021, 1:37 PM
KShivendu updated the diff for D5883: Setup storage and store last revision/release date.
  • rename the type parameter to date_type
  • add .vscode to .gitignore
Jun 23 2021, 1:24 PM
KShivendu added a comment to D5883: Setup storage and store last revision/release date.

Is it okay if I add .vscode in .gitignore. It often gets included by mistake. swh-indexer, swh-storage and swh-web already have it in their .gitignore files.

Jun 23 2021, 6:36 AM
KShivendu updated the diff for D5883: Setup storage and store last revision/release date.
  • Improve code quality
Jun 23 2021, 6:30 AM
KShivendu updated the diff for D5883: Setup storage and store last revision/release date.
  • Improve code quality (as suggested by @anlambert)
Jun 23 2021, 6:27 AM

Jun 21 2021

KShivendu added inline comments to D5883: Setup storage and store last revision/release date.
Jun 21 2021, 8:18 AM
KShivendu updated the diff for D5883: Setup storage and store last revision/release date.
  • Changes suggested by vlorentz
Jun 21 2021, 8:11 AM
KShivendu added inline comments to D5883: Setup storage and store last revision/release date.
Jun 21 2021, 7:58 AM

Jun 18 2021

KShivendu updated the diff for D5883: Setup storage and store last revision/release date.
  • Throw error in absence of storage config
Jun 18 2021, 5:22 PM
KShivendu updated the diff for D5883: Setup storage and store last revision/release date.
  • Add test for fetch_last_revision_release_date in journal_client
  • Add missing arguments and tests related to last_release_date
Jun 18 2021, 5:09 PM
KShivendu created P1073 Cannot add revisions in swh.storage (atleast when using in_memory implementation).
Jun 18 2021, 4:20 PM
KShivendu added inline comments to D5883: Setup storage and store last revision/release date.
Jun 18 2021, 1:29 PM
KShivendu updated the diff for D5883: Setup storage and store last revision/release date.
  • Include last revision/release date only when available
Jun 18 2021, 1:23 PM
KShivendu updated the diff for D5883: Setup storage and store last revision/release date.
  • merge the fetch_last_*_date methods
  • Use snapshot_get_all_branches instead of snapshot_get_branches
  • Fix diff/commit description/message
  • Use in-memory backend in test_cli.py
Jun 18 2021, 1:06 PM
KShivendu updated the summary of D5883: Setup storage and store last revision/release date.
Jun 18 2021, 1:03 PM