diff --git a/docs/query-language.rst b/docs/query-language.rst
index ed6623a..11dba04 100644
--- a/docs/query-language.rst
+++ b/docs/query-language.rst
@@ -1,190 +1,190 @@
 Search Query Language
 =====================
 
 
 Every query is composed of filters separated by ``and`` or ``or``.
 These filters have 3 components in the order : ``Name Operator Value``
 
 Some of the examples are :
-    * ``origin = django and language in [python] and visits >= 5``
+    * ``origin : plasma and language in [python] and visits >= 5``
     * ``last_revision > 2020-01-01 and limit = 10``
     * ``last_visit > 2021-01-01 or last_visit < 2020-01-01``
-    * ``visited = false and metadata = "kubernetes" or origin = "minikube"``
+    * ``visited = false and metadata = "kubernetes" or origin : "minikube"``
     * ``keyword in ["orchestration", "kubectl"] and language in ["go", "rust"]``
-    * ``(origin = debian or visit_type = ["deb"]) and license in ["GPL-3"]``
+    * ``(origin : debian or visit_type = ["deb"]) and license in ["GPL-3"]``
 
 **Note**:
     * Whitespaces are optional between the three components of a filter.
     * The conjunction operators have left precedence. Therefore ``foo and bar and baz`` means ``(foo and bar) and baz``
     * ``and`` has higher precedence than ``or``. Therefore ``foo or bar and baz`` means ``foo or (bar and baz)``
     * Precedence can be overridden using parentheses: ``(`` and ``)``. For example, you can override the default precedence in the previous query as: ``(foo or bar) and baz``
     * To actually search for ``and`` or ``or`` as strings, just put them within quotes. Example : ``metadata : "vcs history and metadata"``, or even just ``metadata : "and"`` to search for the string ``and`` in the metadata
 
 The filters have been classified based on the type of value that they expects.
 
 
 Pattern filters
 ---------------
 Returns origins having the given keywords in their url or intrinsic metadata
 
     * Name:
         * ``origin``: Keywords from the origin url
         * ``metadata``: Keywords from all the intrinsic metadata fields
-    * Operator: ``=``
+    * Operator: ``:``
     * Value: String wrapped in quotation marks(``"`` or ``'``)
 
 **Note:** If a string has no whitespace then the quotation marks become optional.
 
 **Examples:**
 
-    * ``origin = https://github.com/Django/django``
-    * ``origin = kubernetes``
-    * ``origin = "github python"``
-    * ``metadata = orchestration``
-    * ``metadata = "javascript language"``
+    * ``origin : https://github.com/Django/django``
+    * ``origin : kubernetes``
+    * ``origin : "github python"``
+    * ``metadata : orchestration``
+    * ``metadata : "javascript language"``
 
 Boolean filters
 ---------------
 Returns origins having their boolean type values equal to given values
 
     * Name: ``visited`` : Whether the origin has been visited
     * Operator: ``=``
     * Value: ``true`` or ``false``
 
 **Examples:**
 
     * ``visited = true``
     * ``visited = false``
 
 
 Numeric filters
 ---------------
 Returns origins having their numeric type values in the given range
 
     * Name: ``visits`` : Number of visits of an origin
     * Operator: ``<`` ``<=`` ``=`` ``!=`` ``>`` ``>=``
     * Value: Positive integer
 
 **Examples:**
 
 
     * ``visits > 2``
     * ``visits = 5``
     * ``visits <= 10``
 
 
 Un-bounded List filters
 -----------------------
 
 Returns origins that satisfy the criteria based on a given list
 
     * Name:
         * ``language`` : Programming languages used
         * ``license`` : License used
         * ``keyword`` : keywords (often same as tags) or description (includes README) from the metadata
     * Operator: ``in`` ``not in``
     * Value: Array of strings
 
 **Note:**
     * If a string has no whitespace then the quotation marks become optional.
 
     * The ``keyword`` filter gives more priority to the keywords field of intrinsic metadata than the description field. So origins having the queried term in their intrinsic metadata keyword will appear first.
 
 
 **Examples:**
 
     * ``language in [python, js]``
     * ``license in ["GPL 3.0 or later", MIT]``
     * ``keyword in ["Software Heritage", swh]``
 
 
 Bounded List filters
 --------------------
 
 Returns origins that satisfy the criteria based on a list of fixed options
 
     **visit_type**
 
     * Name: ``visit_type`` : Returns only origins with at least one of the specified visit types
     * Operator: ``=``
     * Value: Array of the following values
 
         ``any``
         ``cran``
         ``deb``
         ``deposit``
         ``ftp``
         ``hg``
         ``git``
         ``nixguix``
         ``npm``
         ``pypi``
         ``svn``
         ``tar``
 
     **sort_by**
 
     * Name: ``sort_by`` : Sorts origins based on the given list of origin attributes
     * Operator: ``=``
     * Value: Array of the following values
 
         ``visits``
         ``last_visit``
         ``last_eventful_visit``
         ``last_revision``
         ``last_release``
         ``created``
         ``modified``
         ``published``
 
 **Examples:**
 
 
     * ``visit_type = [svn, npm]``
     * ``visit_type = [nixguix, "ftp"]``
     * ``sort_by = ["last_visit", created]``
     * ``sort_by = [visits, modified]``
 
 Date filters
 ------------
 
 Returns origins having their date type values in the given range
 
     * Name:
 
             * ``last_visit`` : Latest visit date
             * ``last_eventful_visit`` : Latest visit date where a new snapshot was detected
             * ``last_revision`` : Latest commit date
             * ``last_release`` : Latest release date
             * ``created`` Creation date
             * ``modified`` Modification date
             * ``published`` Published date
 
     * Operator: ``<`` ``<=`` ``=`` ``!=`` ``>`` ``>=``
     * Value: Date in ``Standard ISO`` format
 
     **Note:** The last three date filters are based on metadata that has to be manually entered
     by the repository authors. So they might not be correct or up-to-date.
 
 **Examples:**
 
     * ``last_visit > 2001-01-01 and last_visit < 2101-01-01``
     * ``last_revision = "2000-01-01 18:35Z"``
     * ``last_release != "2021-07-17T18:35:00Z"``
     * ``created <= "2021-07-17 18:35"``
 
 Limit filter
 ------------
 
 Limits the number of results to at most N
 
     * Name: ``limit``
     * Operator: ``=``
     * Value: Positive Integer
 
 **Note:** The default value of the limit is 50
 
 **Examples:**
 
     * ``limit = 1``
     * ``limit = 15``
diff --git a/swh/search/elasticsearch.py b/swh/search/elasticsearch.py
index a0ca953..5cc0451 100644
--- a/swh/search/elasticsearch.py
+++ b/swh/search/elasticsearch.py
@@ -1,555 +1,555 @@
 # Copyright (C) 2019-2021  The Software Heritage developers
 # See the AUTHORS file at the top-level directory of this distribution
 # License: GNU General Public License version 3, or any later version
 # See top-level LICENSE file for more information
 
 import base64
 from collections import Counter
 import logging
 import pprint
 from textwrap import dedent
 from typing import Any, Dict, Iterable, List, Optional
 
 from elasticsearch import Elasticsearch, helpers
 import msgpack
 
 from swh.indexer import codemeta
 from swh.model import model
 from swh.model.hashutil import hash_to_hex
 from swh.search.interface import (
     SORT_BY_OPTIONS,
     MinimalOriginDict,
     OriginDict,
     PagedResult,
 )
 from swh.search.metrics import send_metric, timed
 from swh.search.translator import Translator
 from swh.search.utils import escape, get_expansion, parse_and_format_date
 
 logger = logging.getLogger(__name__)
 
 INDEX_NAME_PARAM = "index"
 READ_ALIAS_PARAM = "read_alias"
 WRITE_ALIAS_PARAM = "write_alias"
 
 ORIGIN_DEFAULT_CONFIG = {
     INDEX_NAME_PARAM: "origin",
     READ_ALIAS_PARAM: "origin-read",
     WRITE_ALIAS_PARAM: "origin-write",
 }
 
 
 def _sanitize_origin(origin):
     origin = origin.copy()
 
     # Whitelist fields to be saved in Elasticsearch
     res = {"url": origin.pop("url")}
     for field_name in (
         "blocklisted",
         "has_visits",
         "intrinsic_metadata",
         "visit_types",
         "nb_visits",
         "snapshot_id",
         "last_visit_date",
         "last_eventful_visit_date",
         "last_revision_date",
         "last_release_date",
     ):
         if field_name in origin:
             res[field_name] = origin.pop(field_name)
 
     # Run the JSON-LD expansion algorithm
     # <https://www.w3.org/TR/json-ld-api/#expansion>
     # to normalize the Codemeta metadata.
     # This is required as Elasticsearch will needs each field to have a consistent
     # type across documents to be searchable; and non-expanded JSON-LD documents
     # can have various types in the same field. For example, all these are
     # equivalent in JSON-LD:
     # * {"author": "Jane Doe"}
     # * {"author": ["Jane Doe"]}
     # * {"author": {"@value": "Jane Doe"}}
     # * {"author": [{"@value": "Jane Doe"}]}
     # and JSON-LD expansion will convert them all to the last one.
     if "intrinsic_metadata" in res:
         intrinsic_metadata = res["intrinsic_metadata"]
         for date_field in ["dateCreated", "dateModified", "datePublished"]:
             if date_field in intrinsic_metadata:
                 date = intrinsic_metadata[date_field]
 
                 # If date{Created,Modified,Published} value isn't parsable
                 # It gets rejected and isn't stored (unlike other fields)
                 formatted_date = parse_and_format_date(date)
                 if formatted_date is None:
                     intrinsic_metadata.pop(date_field)
                 else:
                     intrinsic_metadata[date_field] = formatted_date
 
         res["intrinsic_metadata"] = codemeta.expand(intrinsic_metadata)
 
     return res
 
 
 def token_encode(index_to_tokenize: Dict[bytes, Any]) -> str:
     """Tokenize as string an index page result from a search"""
     page_token = base64.b64encode(msgpack.dumps(index_to_tokenize))
     return page_token.decode()
 
 
 def token_decode(page_token: str) -> Dict[bytes, Any]:
     """Read the page_token"""
     return msgpack.loads(base64.b64decode(page_token.encode()), raw=True)
 
 
 class ElasticSearch:
     def __init__(self, hosts: List[str], indexes: Dict[str, Dict[str, str]] = {}):
         self._backend = Elasticsearch(hosts=hosts)
         self._translator = Translator()
 
         # Merge current configuration with default values
         origin_config = indexes.get("origin", {})
         self.origin_config = {**ORIGIN_DEFAULT_CONFIG, **origin_config}
 
     def _get_origin_index(self) -> str:
         return self.origin_config[INDEX_NAME_PARAM]
 
     def _get_origin_read_alias(self) -> str:
         return self.origin_config[READ_ALIAS_PARAM]
 
     def _get_origin_write_alias(self) -> str:
         return self.origin_config[WRITE_ALIAS_PARAM]
 
     @timed
     def check(self):
         return self._backend.ping()
 
     def deinitialize(self) -> None:
         """Removes all indices from the Elasticsearch backend"""
         self._backend.indices.delete(index="*")
 
     def initialize(self) -> None:
         """Declare Elasticsearch indices, aliases and mappings"""
 
         if not self._backend.indices.exists(index=self._get_origin_index()):
             self._backend.indices.create(index=self._get_origin_index())
 
         if not self._backend.indices.exists_alias(name=self._get_origin_read_alias()):
             self._backend.indices.put_alias(
                 index=self._get_origin_index(), name=self._get_origin_read_alias()
             )
 
         if not self._backend.indices.exists_alias(name=self._get_origin_write_alias()):
             self._backend.indices.put_alias(
                 index=self._get_origin_index(), name=self._get_origin_write_alias()
             )
 
         self._backend.indices.put_mapping(
             index=self._get_origin_index(),
             body={
                 "dynamic_templates": [
                     {
                         "booleans_as_string": {
                             # All fields stored as string in the metadata
                             # even the booleans
                             "match_mapping_type": "boolean",
                             "path_match": "intrinsic_metadata.*",
                             "mapping": {"type": "keyword"},
                         }
                     }
                 ],
                 "date_detection": False,
                 "properties": {
                     # sha1 of the URL; used as the document id
                     "sha1": {"type": "keyword", "doc_values": True,},
                     # Used both to search URLs, and as the result to return
                     # as a response to queries
                     "url": {
                         "type": "text",
                         # To split URLs into token on any character
                         # that is not alphanumerical
                         "analyzer": "simple",
                         # 2-gram and partial-3-gram search (ie. with the end of the
                         # third word potentially missing)
                         "fields": {
                             "as_you_type": {
                                 "type": "search_as_you_type",
                                 "analyzer": "simple",
                             }
                         },
                     },
                     "visit_types": {"type": "keyword"},
                     # used to filter out origins that were never visited
                     "has_visits": {"type": "boolean",},
                     "nb_visits": {"type": "integer"},
                     "snapshot_id": {"type": "keyword"},
                     "last_visit_date": {"type": "date"},
                     "last_eventful_visit_date": {"type": "date"},
                     "last_release_date": {"type": "date"},
                     "last_revision_date": {"type": "date"},
                     "intrinsic_metadata": {
                         "type": "nested",
                         "properties": {
                             "@context": {
                                 # don't bother indexing tokens in these URIs, as the
                                 # are used as namespaces
                                 "type": "keyword",
                             },
                             "http://schema": {
                                 "properties": {
                                     "org/dateCreated": {
                                         "properties": {"@value": {"type": "date",}}
                                     },
                                     "org/dateModified": {
                                         "properties": {"@value": {"type": "date",}}
                                     },
                                     "org/datePublished": {
                                         "properties": {"@value": {"type": "date",}}
                                     },
                                 }
                             },
                         },
                     },
                     # Has this origin been taken down?
                     "blocklisted": {"type": "boolean",},
                 },
             },
         )
 
     @timed
     def flush(self) -> None:
         self._backend.indices.refresh(index=self._get_origin_write_alias())
 
     @timed
     def origin_update(self, documents: Iterable[OriginDict]) -> None:
         write_index = self._get_origin_write_alias()
         documents = map(_sanitize_origin, documents)
         documents_with_sha1 = (
             (hash_to_hex(model.Origin(url=document["url"]).id), document)
             for document in documents
         )
         # painless script that will be executed when updating an origin document
         update_script = dedent(
             """
             // utility function to get and parse date
             ZonedDateTime getDate(def ctx, String date_field) {
                 String default_date = "0001-01-01T00:00:00Z";
                 String date = ctx._source.getOrDefault(date_field, default_date);
                 return ZonedDateTime.parse(date);
             }
 
             // backup current visit_types field value
             List visit_types = ctx._source.getOrDefault("visit_types", []);
             int nb_visits = ctx._source.getOrDefault("nb_visits", 0);
 
             ZonedDateTime last_visit_date = getDate(ctx, "last_visit_date");
 
             String snapshot_id = ctx._source.getOrDefault("snapshot_id", "");
             ZonedDateTime last_eventful_visit_date =
                 getDate(ctx, "last_eventful_visit_date");
             ZonedDateTime last_revision_date = getDate(ctx, "last_revision_date");
             ZonedDateTime last_release_date = getDate(ctx, "last_release_date");
 
             // update origin document with new field values
             ctx._source.putAll(params);
 
             // restore previous visit types after visit_types field overriding
             if (ctx._source.containsKey("visit_types")) {
                 for (int i = 0; i < visit_types.length; ++i) {
                     if (!ctx._source.visit_types.contains(visit_types[i])) {
                         ctx._source.visit_types.add(visit_types[i]);
                     }
                 }
             }
 
             // Undo overwrite if incoming nb_visits is smaller
             if (ctx._source.containsKey("nb_visits")) {
                 int incoming_nb_visits = ctx._source.getOrDefault("nb_visits", 0);
                 if(incoming_nb_visits < nb_visits){
                     ctx._source.nb_visits = nb_visits;
                 }
             }
 
             // Undo overwrite if incoming last_visit_date is older
             if (ctx._source.containsKey("last_visit_date")) {
                 ZonedDateTime incoming_last_visit_date = getDate(ctx, "last_visit_date");
                 int difference =
                     // returns -1, 0 or 1
                     incoming_last_visit_date.compareTo(last_visit_date);
                 if(difference < 0){
                     ctx._source.last_visit_date = last_visit_date;
                 }
             }
 
             // Undo update of last_eventful_date and snapshot_id if
             // snapshot_id hasn't changed OR incoming_last_eventful_visit_date is older
             if (ctx._source.containsKey("snapshot_id")) {
                 String incoming_snapshot_id = ctx._source.getOrDefault("snapshot_id", "");
                 ZonedDateTime incoming_last_eventful_visit_date =
                     getDate(ctx, "last_eventful_visit_date");
                 int difference =
                     // returns -1, 0 or 1
                     incoming_last_eventful_visit_date.compareTo(last_eventful_visit_date);
                 if(snapshot_id == incoming_snapshot_id || difference < 0){
                     ctx._source.snapshot_id = snapshot_id;
                     ctx._source.last_eventful_visit_date = last_eventful_visit_date;
                 }
             }
 
             // Undo overwrite if incoming last_revision_date is older
             if (ctx._source.containsKey("last_revision_date")) {
                 ZonedDateTime incoming_last_revision_date =
                     getDate(ctx, "last_revision_date");
                 int difference =
                     // returns -1, 0 or 1
                     incoming_last_revision_date.compareTo(last_revision_date);
                 if(difference < 0){
                     ctx._source.last_revision_date = last_revision_date;
                 }
             }
 
             // Undo overwrite if incoming last_release_date is older
             if (ctx._source.containsKey("last_release_date")) {
                 ZonedDateTime incoming_last_release_date =
                     getDate(ctx, "last_release_date");
                 // returns -1, 0 or 1
                 int difference = incoming_last_release_date.compareTo(last_release_date);
                 if(difference < 0){
                     ctx._source.last_release_date = last_release_date;
                 }
             }
             """  # noqa
         )
 
         actions = [
             {
                 "_op_type": "update",
                 "_id": sha1,
                 "_index": write_index,
                 "scripted_upsert": True,
                 "upsert": {**document, "sha1": sha1,},
                 "retry_on_conflict": 10,
                 "script": {
                     "source": update_script,
                     "lang": "painless",
                     "params": document,
                 },
             }
             for (sha1, document) in documents_with_sha1
         ]
 
         indexed_count, errors = helpers.bulk(self._backend, actions, index=write_index)
         assert isinstance(errors, List)  # Make mypy happy
 
         send_metric("document:index", count=indexed_count, method_name="origin_update")
         send_metric(
             "document:index_error", count=len(errors), method_name="origin_update"
         )
 
     @timed
     def origin_search(
         self,
         *,
         query: str = "",
         url_pattern: Optional[str] = None,
         metadata_pattern: Optional[str] = None,
         with_visit: bool = False,
         visit_types: Optional[List[str]] = None,
         min_nb_visits: int = 0,
         min_last_visit_date: str = "",
         min_last_eventful_visit_date: str = "",
         min_last_revision_date: str = "",
         min_last_release_date: str = "",
         min_date_created: str = "",
         min_date_modified: str = "",
         min_date_published: str = "",
         programming_languages: Optional[List[str]] = None,
         licenses: Optional[List[str]] = None,
         keywords: Optional[List[str]] = None,
         sort_by: Optional[List[str]] = None,
         page_token: Optional[str] = None,
         limit: int = 50,
     ) -> PagedResult[MinimalOriginDict]:
         query_clauses: List[Dict[str, Any]] = []
 
         query_filters = []
         if url_pattern:
-            query_filters.append(f"origin = {escape(url_pattern)}")
+            query_filters.append(f"origin : {escape(url_pattern)}")
 
         if metadata_pattern:
-            query_filters.append(f"metadata = {escape(metadata_pattern)}")
+            query_filters.append(f"metadata : {escape(metadata_pattern)}")
 
         # if not query_clauses:
         #     raise ValueError(
         #         "At least one of url_pattern and metadata_pattern must be provided."
         #     )
 
         if with_visit:
             query_filters.append(f"visited = {'true' if with_visit else 'false'}")
         if min_nb_visits:
             query_filters.append(f"visits >= {min_nb_visits}")
         if min_last_visit_date:
             query_filters.append(
                 f"last_visit >= {min_last_visit_date.replace('Z', '+00:00')}"
             )
         if min_last_eventful_visit_date:
             query_filters.append(
                 "last_eventful_visit >= "
                 f"{min_last_eventful_visit_date.replace('Z', '+00:00')}"
             )
         if min_last_revision_date:
             query_filters.append(
                 f"last_revision >= {min_last_revision_date.replace('Z', '+00:00')}"
             )
         if min_last_release_date:
             query_filters.append(
                 f"last_release >= {min_last_release_date.replace('Z', '+00:00')}"
             )
         if keywords:
             query_filters.append(f"keyword in {escape(keywords)}")
         if licenses:
             query_filters.append(f"license in {escape(licenses)}")
 
         if programming_languages:
             query_filters.append(f"language in {escape(programming_languages)}")
 
         if min_date_created:
             query_filters.append(
                 f"created >= {min_date_created.replace('Z', '+00:00')}"
             )
         if min_date_modified:
             query_filters.append(
                 f"modified >= {min_date_modified.replace('Z', '+00:00')}"
             )
         if min_date_published:
             query_filters.append(
                 f"published >= {min_date_published.replace('Z', '+00:00')}"
             )
 
         if visit_types is not None:
             query_filters.append(f"visit_type = {escape(visit_types)}")
 
         combined_filters = " and ".join(query_filters)
         if combined_filters and query:
             query = f"{combined_filters} and {query}"
         else:
             query = combined_filters or query
         parsed_query = self._translator.parse_query(query)
         query_clauses.append(parsed_query["filters"])
 
         field_map = {
             "visits": "nb_visits",
             "last_visit": "last_visit_date",
             "last_eventful_visit": "last_eventful_visit_date",
             "last_revision": "last_revision_date",
             "last_release": "last_release_date",
             "created": "date_created",
             "modified": "date_modified",
             "published": "date_published",
         }
 
         if "sortBy" in parsed_query:
             if sort_by is None:
                 sort_by = []
             for sort_by_option in parsed_query["sortBy"]:
                 if sort_by_option[0] == "-":
                     sort_by.append("-" + field_map[sort_by_option[1:]])
                 else:
                     sort_by.append(field_map[sort_by_option])
         if parsed_query.get("limit", 0):
             limit = parsed_query["limit"]
 
         sorting_params: List[Dict[str, Any]] = []
 
         if sort_by:
             for field in sort_by:
                 order = "asc"
                 if field and field[0] == "-":
                     field = field[1:]
                     order = "desc"
 
                 if field in ["date_created", "date_modified", "date_published"]:
                     sorting_params.append(
                         {
                             get_expansion(field, "."): {
                                 "nested_path": "intrinsic_metadata",
                                 "order": order,
                             }
                         }
                     )
                 elif field in SORT_BY_OPTIONS:
                     sorting_params.append({field: order})
 
         sorting_params.extend(
             [{"_score": "desc"}, {"sha1": "asc"},]
         )
 
         body = {
             "query": {
                 "bool": {
                     "must": query_clauses,
                     "must_not": [{"term": {"blocklisted": True}}],
                 }
             },
             "sort": sorting_params,
         }
 
         if page_token:
             # TODO: use ElasticSearch's scroll API?
             page_token_content = token_decode(page_token)
             body["search_after"] = [
                 page_token_content[b"score"],
                 page_token_content[b"sha1"].decode("ascii"),
             ]
 
         if logger.isEnabledFor(logging.DEBUG):
             formatted_body = pprint.pformat(body)
             logger.debug("Search query body: %s", formatted_body)
 
         res = self._backend.search(
             index=self._get_origin_read_alias(), body=body, size=limit
         )
 
         hits = res["hits"]["hits"]
 
         next_page_token: Optional[str] = None
 
         if len(hits) == limit:
             # There are more results after this page; return a pagination token
             # to get them in a future query
             last_hit = hits[-1]
             next_page_token_content = {
                 b"score": last_hit["_score"],
                 b"sha1": last_hit["_source"]["sha1"],
             }
             next_page_token = token_encode(next_page_token_content)
 
         assert len(hits) <= limit
 
         return PagedResult(
             results=[{"url": hit["_source"]["url"]} for hit in hits],
             next_page_token=next_page_token,
         )
 
     def visit_types_count(self) -> Counter:
         body = {
             "aggs": {
                 "not_blocklisted": {
                     "filter": {"bool": {"must_not": [{"term": {"blocklisted": True}}]}},
                     "aggs": {
                         "visit_types": {"terms": {"field": "visit_types", "size": 1000}}
                     },
                 }
             }
         }
 
         res = self._backend.search(
             index=self._get_origin_read_alias(), body=body, size=0
         )
 
         buckets = (
             res.get("aggregations", {})
             .get("not_blocklisted", {})
             .get("visit_types", {})
             .get("buckets", [])
         )
         return Counter({bucket["key"]: bucket["doc_count"] for bucket in buckets})
diff --git a/swh/search/query_language/grammar.js b/swh/search/query_language/grammar.js
index 594a934..eb901f1 100644
--- a/swh/search/query_language/grammar.js
+++ b/swh/search/query_language/grammar.js
@@ -1,192 +1,193 @@
 // Copyright (C) 2021  The Software Heritage developers
 // See the AUTHORS file at the top-level directory of this distribution
 // License: GNU General Public License version 3, or any later version
 // See top-level LICENSE file for more information
 
 const { visitTypeField, sortByField, limitField } = require("./tokens.js");
 const {
     patternFields,
     booleanFields,
     numericFields,
     listFields,
     dateFields
 } = require("./tokens.js");
-const { equalOp, rangeOp, choiceOp } = require("./tokens.js");
+const { equalOp, containOp, rangeOp, choiceOp } = require("./tokens.js");
 const { sortByOptions, visitTypeOptions } = require("./tokens.js");
 const { OR, AND, TRUE, FALSE } = require("./tokens.js");
 
 const PRECEDENCE = {
     or: 2,
     and: 3,
     bracket: 4,
 }
 
 module.exports = grammar({
     name: 'swh_search_ql',
 
     rules: {
         query: $ => seq(
             $.filters,
             optional(seq(
                 optional($.and),
                 choice(
                     seq($.sortBy, optional($.and), optional($.limit)),
                     seq($.limit, optional($.and), optional($.sortBy)),
                 ),
             ))
         ),
 
         filters: $ => choice(
             prec.left(PRECEDENCE.and,
                 seq(
                     field('left', $.filters),
                     field('operator', $.and),
                     field('right', $.filters),
                 )
             ),
             prec.left(PRECEDENCE.or,
                 seq(
                     field('left', $.filters),
                     field('operator', $.or),
                     field('right', $.filters),
                 )
             ),
             prec.left(PRECEDENCE.bracket,
                 seq("(", $.filters, ")"),
             ),
             $.filter
         ),
 
         sortBy: $ => annotateFilter($.sortByField, $.sortByOp, $.sortByVal),
         sortByField: $ => token(sortByField),
         sortByOp: $ => $.equalOp,
         sortByVal: $ => createArray(optionalWrapWith($.sortByOptions, ["'", '"'])),
         sortByOptions: $ => seq(
             optional('-'),
             choice(...sortByOptions)
         ),
 
         limit: $ => annotateFilter($.limitField, $.equalOp, $.number),
         limitField: $ => token(limitField),
 
         filter: $ => field('category', choice(
             $.patternFilter,
             $.booleanFilter,
             $.numericFilter,
             $.boundedListFilter,
             $.unboundedListFilter,
             $.dateFilter
         )),
 
         patternFilter: $ => annotateFilter($.patternField, $.patternOp, $.patternVal),
         patternField: $ => token(choice(...patternFields)),
-        patternOp: $ => $.equalOp,
+        patternOp: $ => $.containOp,
         patternVal: $ => $.string,
 
         booleanFilter: $ => annotateFilter($.booleanField, $.booleanOp, $.booleanVal),
         booleanField: $ => token(choice(...booleanFields)),
         booleanOp: $ => $.equalOp,
         booleanVal: $ => choice($.booleanTrue, $.booleanFalse),
 
         numericFilter: $ => annotateFilter($.numericField, $.numericOp, $.numberVal),
         numericField: $ => token(choice(...numericFields)),
         numericOp: $ => $.rangeOp,
         numberVal: $ => $.number,
 
         // Array members must be from the given options
         boundedListFilter: $ => choice($.visitTypeFilter),
 
         visitTypeFilter: $ => annotateFilter($.visitTypeField, $.visitTypeOp, $.visitTypeVal),
         visitTypeField: $ => token(visitTypeField),
         visitTypeOp: $ => $.equalOp,
         visitTypeVal: $ => createArray(optionalWrapWith($.visitTypeOptions, ["'", '"'])),
         visitTypeOptions: $ => choice(...visitTypeOptions),
         // TODO: fetch visitTypeOptions choices dynamically from other swh services?
 
         // Array members can be any string
         unboundedListFilter: $ => annotateFilter($.listField, $.listOp, $.listVal),
         listField: $ => token(choice(...listFields)),
         listOp: $ => $.choiceOp,
         listVal: $ => createArray($.string),
 
         dateFilter: $ => annotateFilter($.dateField, $.dateOp, $.dateVal),
         dateField: $ => token(choice(...dateFields)),
         dateOp: $ => $.rangeOp,
         dateVal: $ => $.isoDateTime,
 
         rangeOp: $ => token(choice(...rangeOp)),
         equalOp: $ => token(choice(...equalOp)),
+        containOp: $ => token(choice(...containOp)),
         choiceOp: $ => token(choice(...choiceOp)),
 
         isoDateTime: $ => {
             const dateRegex = (/\d{4}[-]\d{2}[-]\d{2}/).source
             const dateTimeSepRegex = (/(\s|T)*/).source
             const timeRegex = (/(\d{2}:\d{2}(:\d{2}(\.\d{6})?)?)?/).source
             const timezoneRegex = (/(\+\d{2}:\d{2}|Z)?/).source
             return new RegExp(dateRegex + dateTimeSepRegex + timeRegex + timezoneRegex)
         },
 
         string: $ => choice(wrapWith($.stringContent, ["'", '"']), $.singleWord),
         number: $ => /\d+/,
         booleanTrue: $ => TRUE,
         booleanFalse: $ => FALSE,
 
         or: $ => OR,
         and: $ => AND,
 
         singleWord: $ => /[^\s"'\[\]\(\),]+/,
 
         // Based on tree-sitter-json grammar:
         stringContent: $ => repeat1(choice(
             token.immediate(/[^\\'"\n]+/),
             $.escape_sequence
         )),
         escape_sequence: $ => token.immediate(seq(
             '\\',
             /(\"|\'|\\|\/|b|n|r|t|u)/
         )),
 
     }
 });
 
 
 function joinBySep1(rule, sep) {
     // At least one repetition of the rule separated by `sep`
     return seq(rule, repeat(seq(sep, optional(rule))))
 }
 
 function joinBySep(rule, sep = ",") {
     // Any number of repetitions of the rule separated by `sep`
     return optional(joinBySep1(rule, sep))
 }
 
 function createArray(rule) {
     // An array having `rule` as its member
     return seq(
         "[",
         joinBySep(
             field('array_member', rule),
             ","
         ),
         "]"
     )
 }
 
 function wrapWith(rule, wrappers = ["'", '"']) {
     // The rule must be wrapped with one of the wrappers
     const wrappedRules = wrappers.map(wrapper => seq(wrapper, rule, wrapper))
     return choice(...wrappedRules)
 }
 
 function optionalWrapWith(rule, wrappers = ["'", '"']) {
     // The rule may or may not be wrapped with the wrappers
     return choice(wrapWith(rule, wrappers), rule)
 }
 
 function annotateFilter(filterField, filterOp, filterVal) {
     return seq(
         field('field', filterField),
         field('op', filterOp),
         field('value', filterVal)
     );
 }
diff --git a/swh/search/query_language/sample_query b/swh/search/query_language/sample_query
index 3d8c08d..1ea3ad5 100644
--- a/swh/search/query_language/sample_query
+++ b/swh/search/query_language/sample_query
@@ -1,6 +1,6 @@
-(origin = django/django and language in ["python"] or visits >= 5) or
+(origin : django/django and language in ["python"] or visits >= 5) or
 (last_revision > 2020-01-01 and limit = 10) or
 (last_visit > 2021-01-01 or last_visit < 2020-01-01) or
-(visited = false and metadata = "gitlab") or
+(visited = false and metadata : "gitlab") or
 (keyword in ["orchestration", "kubectl"] and language in ["go", "rust"]) or
 (visit_type = [deb] and license in ["GPL-3"])
diff --git a/swh/search/query_language/test/corpus/combinations.txt b/swh/search/query_language/test/corpus/combinations.txt
index 999ed76..7953e96 100644
--- a/swh/search/query_language/test/corpus/combinations.txt
+++ b/swh/search/query_language/test/corpus/combinations.txt
@@ -1,82 +1,82 @@
 ==============================
 Empty query (should throw error)
 ==============================
 
 ---
 
 (ERROR)
 
 
 ==================
 Origins with django as keyword, python language, and more than 5 visits
 ==================
 
-origin = django and language in ["python"] and visits >= 5
+origin : django and language in ["python"] and visits >= 5
 
 ---
-(query (filters (filters (filters (filter (patternFilter (patternField) (patternOp (equalOp)) (patternVal (string (singleWord)))))) (and) (filters (filter (unboundedListFilter (listField) (listOp (choiceOp)) (listVal (string (stringContent))))))) (and) (filters (filter (numericFilter (numericField) (numericOp (rangeOp)) (numberVal (number)))))))
+(query (filters (filters (filters (filter (patternFilter (patternField) (patternOp (containOp)) (patternVal (string (singleWord)))))) (and) (filters (filter (unboundedListFilter (listField) (listOp (choiceOp)) (listVal (string (stringContent))))))) (and) (filters (filter (numericFilter (numericField) (numericOp (rangeOp)) (numberVal (number)))))))
 
 ==================
 10 origins with latest revision after 2020-01-01
 ==================
 last_revision > 2020-01-01 limit = 10
 ---
 (query (filters (filter (dateFilter (dateField) (dateOp (rangeOp)) (dateVal (isoDateTime))))) (limit (limitField) (equalOp) (number)))
 
 ==================
 Origins with last visit date not in 2020-2021 (sorted by number of visits)
 ==================
 
 last_visit > 2021-01-01 or last_visit < 2020-01-01 sort_by = ["visits"]
 ---
 (query (filters (filters (filter (dateFilter (dateField) (dateOp (rangeOp)) (dateVal (isoDateTime))))) (or) (filters (filter (dateFilter (dateField) (dateOp (rangeOp)) (dateVal (isoDateTime)))))) (sortBy (sortByField) (sortByOp (equalOp)) (sortByVal (sortByOptions))))
 
 ==================
 Unvisited origins with kubernetes in metadata or minikube in url
 ==================
 
-visited = false and metadata = "kubernetes" or origin = "minikube"
+visited = false and metadata : "kubernetes" or origin : "minikube"
 
 ---
-(query (filters (filters (filters (filter (booleanFilter (booleanField) (booleanOp (equalOp)) (booleanVal (booleanFalse))))) (and) (filters (filter (patternFilter (patternField) (patternOp (equalOp)) (patternVal (string (stringContent))))))) (or) (filters (filter (patternFilter (patternField) (patternOp (equalOp)) (patternVal (string (stringContent))))))))
+(query (filters (filters (filters (filter (booleanFilter (booleanField) (booleanOp (equalOp)) (booleanVal (booleanFalse))))) (and) (filters (filter (patternFilter (patternField) (patternOp (containOp)) (patternVal (string (stringContent))))))) (or) (filters (filter (patternFilter (patternField) (patternOp (containOp)) (patternVal (string (stringContent))))))))
 
 ==================
 Origins with "orchestration" or "kubectl" as keywords and language as "go" or "rust"
 ==================
 
 keyword in ["orchestration", "kubectl"] and language in ["go", "rust"]
 
 ---
 (query (filters (filters (filter (unboundedListFilter (listField) (listOp (choiceOp)) (listVal (string (stringContent)) (string (stringContent)))))) (and) (filters (filter (unboundedListFilter (listField) (listOp (choiceOp)) (listVal (string (stringContent)) (string (stringContent))))))))
 
 ==================
 Origins with a GPL-3 license that have "debian" in their url or have visit type as "deb"
 ==================
-(origin = debian or visit_type = ["deb"]) and license in ["GPL-3"]
+(origin : debian or visit_type = ["deb"]) and license in ["GPL-3"]
 ---
 
-(query (filters (filters (filters (filters (filter (patternFilter (patternField) (patternOp (equalOp)) (patternVal (string (singleWord)))))) (or) (filters (filter (boundedListFilter (visitTypeFilter (visitTypeField) (visitTypeOp (equalOp)) (visitTypeVal (visitTypeOptions)))))))) (and) (filters (filter (unboundedListFilter (listField) (listOp (choiceOp)) (listVal (string (stringContent))))))))
+(query (filters (filters (filters (filters (filter (patternFilter (patternField) (patternOp (containOp)) (patternVal (string (singleWord)))))) (or) (filters (filter (boundedListFilter (visitTypeFilter (visitTypeField) (visitTypeOp (equalOp)) (visitTypeVal (visitTypeOptions)))))))) (and) (filters (filter (unboundedListFilter (listField) (listOp (choiceOp)) (listVal (string (stringContent))))))))
 
 ==================
 Origins with `and` and `or` inside filter values
 ==================
-(origin = "foo and bar or baz")
+(origin : "foo and bar or baz")
 ---
 
-(query (filters (filters (filter (patternFilter (patternField) (patternOp (equalOp)) (patternVal (string (stringContent))))))))
+(query (filters (filters (filter (patternFilter (patternField) (patternOp (containOp)) (patternVal (string (stringContent))))))))
 
 
 ==================
 Origins with `'` and `"` inside filter values
 ==================
-(origin = "foo \\ \'bar\' \"baz\" ")
+(origin : "foo \\ \'bar\' \"baz\" ")
 ---
 
-(query (filters (filters (filter (patternFilter (patternField) (patternOp (equalOp)) (patternVal (string (stringContent (escape_sequence) (escape_sequence) (escape_sequence) (escape_sequence) (escape_sequence)))))))))
+(query (filters (filters (filter (patternFilter (patternField) (patternOp (containOp)) (patternVal (string (stringContent (escape_sequence) (escape_sequence) (escape_sequence) (escape_sequence) (escape_sequence)))))))))
 
 ==================
 Incomplete conjunction operators should throw error
 ==================
 visits > 5 and
 ---
 (query (filters (filter (numericFilter (numericField) (numericOp (rangeOp)) (numberVal (number))))) (ERROR (and)))
diff --git a/swh/search/query_language/tokens.js b/swh/search/query_language/tokens.js
index 9b29164..c8859cb 100644
--- a/swh/search/query_language/tokens.js
+++ b/swh/search/query_language/tokens.js
@@ -1,108 +1,110 @@
 // Copyright (C) 2021  The Software Heritage developers
 // See the AUTHORS file at the top-level directory of this distribution
 // License: GNU General Public License version 3, or any later version
 // See top-level LICENSE file for more information
 
 // Field tokens
 const visitTypeField = 'visit_type';
 const sortByField = 'sort_by';
 const limitField = 'limit';
 
 // Field categories
 const patternFields = ['origin', 'metadata'];
 const booleanFields = ['visited'];
 const numericFields = ['visits'];
 const boundedListFields = [visitTypeField];
 const listFields = ['language', 'license', 'keyword'];
 const dateFields = [
     'last_visit',
     'last_eventful_visit',
     'last_revision',
     'last_release',
     'created',
     'modified',
     'published'
 ];
 
 const fields = [].concat(
     patternFields,
     booleanFields,
     numericFields,
     boundedListFields,
     listFields,
     dateFields
 );
 
 // Operators
 const equalOp = ['='];
+const containOp = [':'];
 const rangeOp = ['<', '<=', '=', '!=', '>=', '>'];
 const choiceOp = ['in', 'not in'];
 
 
 // Values
 const sortByOptions = [
     'visits',
     'last_visit',
     'last_eventful_visit',
     'last_revision',
     'last_release',
     'created',
     'modified',
     'published'
 ];
 
 const visitTypeOptions = [
     "any",
     "bzr",
     "cran",
     "cvs",
     "deb",
     "deposit",
     "ftp",
     "hg",
     "git",
     "nixguix",
     "npm",
     "opam",
     "pypi",
     "svn",
     "tar"
 ];
 
 // Extra tokens
 const OR = "or";
 const AND = "and";
 
 const TRUE = "true";
 const FALSE = "false";
 
 module.exports = {
     // Field tokens
     visitTypeField,
     sortByField,
     limitField,
 
     // Field categories
     patternFields,
     booleanFields,
     numericFields,
     boundedListFields,
     listFields,
     dateFields,
     fields,
 
     // Operators
     equalOp,
+    containOp,
     rangeOp,
     choiceOp,
 
     // Values
     sortByOptions,
     visitTypeOptions,
 
     // Extra tokens
     OR,
     AND,
     TRUE,
     FALSE
 }
diff --git a/swh/search/tests/test_elasticsearch.py b/swh/search/tests/test_elasticsearch.py
index 9dc8dd5..b152b34 100644
--- a/swh/search/tests/test_elasticsearch.py
+++ b/swh/search/tests/test_elasticsearch.py
@@ -1,192 +1,192 @@
 # Copyright (C) 2019-2022  The Software Heritage developers
 # See the AUTHORS file at the top-level directory of this distribution
 # License: GNU General Public License version 3, or any later version
 # See top-level LICENSE file for more information
 
 from datetime import datetime, timedelta, timezone
 from textwrap import dedent
 import types
 import unittest
 
 from elasticsearch.helpers.errors import BulkIndexError
 import pytest
 
 from swh.search.exc import SearchQuerySyntaxError
 from swh.search.metrics import OPERATIONS_METRIC
 
 from .test_search import CommonSearchTest
 
 now = datetime.now(tz=timezone.utc).isoformat()
 now_minus_5_hours = (datetime.now(tz=timezone.utc) - timedelta(hours=5)).isoformat()
 now_plus_5_hours = (datetime.now(tz=timezone.utc) + timedelta(hours=5)).isoformat()
 
 ORIGINS = [
     {
         "url": "http://foobar.1.com",
         "nb_visits": 1,
         "last_visit_date": now_minus_5_hours,
         "last_eventful_visit_date": now_minus_5_hours,
     },
     {
         "url": "http://foobar.2.com",
         "nb_visits": 2,
         "last_visit_date": now,
         "last_eventful_visit_date": now,
     },
     {
         "url": "http://foobar.3.com",
         "nb_visits": 3,
         "last_visit_date": now_plus_5_hours,
         "last_eventful_visit_date": now_minus_5_hours,
     },
     {
         "url": "http://barbaz.4.com",
         "nb_visits": 3,
         "last_visit_date": now_plus_5_hours,
         "last_eventful_visit_date": now_minus_5_hours,
     },
 ]
 
 
 class BaseElasticsearchTest(unittest.TestCase):
     @pytest.fixture(autouse=True)
     def _instantiate_search(self, swh_search, elasticsearch_host, mocker):
         self._elasticsearch_host = elasticsearch_host
         self.search = swh_search
         self.mocker = mocker
 
         # override self.search.origin_update to catch painless script errors
         # and pretty print them
         origin_update = self.search.origin_update
 
         def _origin_update(self, *args, **kwargs):
             script_error = False
             error_detail = ""
             try:
                 origin_update(*args, **kwargs)
             except BulkIndexError as e:
                 error = e.errors[0].get("update", {}).get("error", {}).get("caused_by")
                 if error and "script_stack" in error:
                     script_error = True
                     error_detail = dedent(
                         f"""
                         Painless update script failed ({error.get('reason')}).
                         error type: {error.get('caused_by', {}).get('type')}
                         error reason: {error.get('caused_by', {}).get('reason')}
                         script stack:
 
                         """
                     )
                     error_detail += "\n".join(error["script_stack"])
                 else:
                     raise e
             assert script_error is False, error_detail[1:]
 
         self.search.origin_update = types.MethodType(_origin_update, self.search)
 
     def reset(self):
         self.search.deinitialize()
         self.search.initialize()
 
 
 class TestElasticsearchSearch(CommonSearchTest, BaseElasticsearchTest):
     def test_metrics_update_duration(self):
         mock = self.mocker.patch("swh.search.metrics.statsd.timing")
 
         for url in ["http://foobar.bar", "http://foobar.baz"]:
             self.search.origin_update([{"url": url}])
 
         assert mock.call_count == 2
 
     def test_metrics_search_duration(self):
         mock = self.mocker.patch("swh.search.metrics.statsd.timing")
 
         for url_pattern in ["foobar", "foobaz"]:
             self.search.origin_search(url_pattern=url_pattern, with_visit=True)
 
         assert mock.call_count == 2
 
     def test_metrics_indexation_counters(self):
         mock_es = self.mocker.patch("elasticsearch.helpers.bulk")
         mock_es.return_value = 2, ["error"]
 
         mock_metrics = self.mocker.patch("swh.search.metrics.statsd.increment")
 
         self.search.origin_update([{"url": "http://foobar.baz"}])
 
         assert mock_metrics.call_count == 2
 
         mock_metrics.assert_any_call(
             OPERATIONS_METRIC,
             2,
             tags={
                 "endpoint": "origin_update",
                 "object_type": "document",
                 "operation": "index",
             },
         )
         mock_metrics.assert_any_call(
             OPERATIONS_METRIC,
             1,
             tags={
                 "endpoint": "origin_update",
                 "object_type": "document",
                 "operation": "index_error",
             },
         )
 
     def test_write_alias_usage(self):
         mock = self.mocker.patch("elasticsearch.helpers.bulk")
         mock.return_value = 2, ["result"]
 
         self.search.origin_update([{"url": "http://foobar.baz"}])
 
         assert mock.call_args[1]["index"] == "test-write"
 
     def test_read_alias_usage(self):
         mock = self.mocker.patch("elasticsearch.Elasticsearch.search")
         mock.return_value = {"hits": {"hits": []}}
 
         self.search.origin_search(url_pattern="foobar.baz")
 
         assert mock.call_args[1]["index"] == "test-read"
 
     def test_sort_by_and_limit_query(self):
 
         self.search.origin_update(ORIGINS)
         self.search.flush()
 
         def _check_results(query, origin_indices):
             page = self.search.origin_search(url_pattern="foobar", query=query)
             results = [r["url"] for r in page.results]
             assert results == [ORIGINS[index]["url"] for index in origin_indices]
 
         _check_results("sort_by = [-visits]", [2, 1, 0])
         _check_results("sort_by = [last_visit]", [0, 1, 2])
         _check_results("sort_by = [-last_eventful_visit, visits]", [1, 0, 2])
         _check_results("sort_by = [last_eventful_visit,-last_visit]", [2, 0, 1])
 
         _check_results("sort_by = [-visits] limit = 1", [2])
         _check_results("sort_by = [last_visit] and limit = 2", [0, 1])
         _check_results("sort_by = [-last_eventful_visit, visits] limit = 3", [1, 0, 2])
 
     def test_search_ql_simple(self):
         self.search.origin_update(ORIGINS)
         self.search.flush()
 
         results = {
             r["url"]
-            for r in self.search.origin_search(query='origin = "foobar"').results
+            for r in self.search.origin_search(query='origin : "foobar"').results
         }
         assert results == {
             "http://foobar.1.com",
             "http://foobar.2.com",
             "http://foobar.3.com",
         }
 
     def test_query_syntax_error(self):
         self.search.origin_update(ORIGINS)
         self.search.flush()
 
         with pytest.raises(SearchQuerySyntaxError):
             self.search.origin_search(query="foobar")
diff --git a/swh/search/tests/test_translator.py b/swh/search/tests/test_translator.py
index 6de5bd5..23bf73d 100644
--- a/swh/search/tests/test_translator.py
+++ b/swh/search/tests/test_translator.py
@@ -1,405 +1,405 @@
 # Copyright (C) 2021  The Software Heritage developers
 # See the AUTHORS file at the top-level directory of this distribution
 # License: GNU General Public License version 3, or any later version
 # See top-level LICENSE file for more information
 
 import pytest
 
 from swh.search.translator import Translator
 from swh.search.utils import get_expansion
 
 
 def _test_results(query, expected):
     output = Translator().parse_query(query)
     assert output == expected
 
 
 def test_empty_query():
     query = ""
     with pytest.raises(Exception):
         _test_results(query, {})
 
 
 def test_conjunction_operators():
     query = "visited = true or visits > 2 and visits < 5"
     expected = {
         "filters": {
             "bool": {
                 "should": [
                     {"term": {"has_visits": True}},
                     {
                         "bool": {
                             "must": [
                                 {"range": {"nb_visits": {"gt": 2}}},
                                 {"range": {"nb_visits": {"lt": 5}}},
                             ]
                         }
                     },
                 ]
             }
         }
     }
     _test_results(query, expected)
 
 
 def test_conjunction_op_precedence_override():
     query = "(visited = false or visits > 2) and visits < 5"
     expected = {
         "filters": {
             "bool": {
                 "must": [
                     {
                         "bool": {
                             "should": [
                                 {"term": {"has_visits": False}},
                                 {"range": {"nb_visits": {"gt": 2}}},
                             ]
                         }
                     },
                     {"range": {"nb_visits": {"lt": 5}}},
                 ]
             }
         }
     }
 
     _test_results(query, expected)
 
 
 def test_limit_and_sortby():
     query = "visited = true sort_by = [-visits,last_visit] limit = 15"
     expected = {
         "filters": {"term": {"has_visits": True}},
         "sortBy": ["-visits", "last_visit"],
         "limit": 15,
     }
 
     _test_results(query, expected)
 
 
 def test_deeply_nested_filters():
     query = "(((visited = true and visits > 0)))"
     expected = {
         "filters": {
             "bool": {
                 "must": [
                     {"term": {"has_visits": True},},
                     {"range": {"nb_visits": {"gt": 0}}},
                 ]
             }
         },
     }
 
     _test_results(query, expected)
 
 
 def test_origin_and_metadata_filters():
-    query = 'origin = django or metadata = "framework and web"'
+    query = 'origin : django or metadata : "framework and web"'
     expected = {
         "filters": {
             "bool": {
                 "should": [
                     {
                         "multi_match": {
                             "query": "django",
                             "type": "bool_prefix",
                             "operator": "and",
                             "fields": [
                                 "url.as_you_type",
                                 "url.as_you_type._2gram",
                                 "url.as_you_type._3gram",
                             ],
                         }
                     },
                     {
                         "nested": {
                             "path": "intrinsic_metadata",
                             "query": {
                                 "multi_match": {
                                     "query": "framework and web",
                                     "type": "cross_fields",
                                     "operator": "and",
                                     "fields": ["intrinsic_metadata.*"],
                                     "lenient": True,
                                 }
                             },
                         }
                     },
                 ]
             }
         }
     }
 
     _test_results(query, expected)
 
 
 def test_visits_not_equal_to_filter():
     query = "visits != 5"
     expected = {
         "filters": {
             "bool": {"must_not": [{"range": {"nb_visits": {"gte": 5, "lte": 5}}},]}
         },
     }
 
     _test_results(query, expected)
 
 
 def test_visit_type_filter():
     query = 'visit_type = [git,"pypi"]'
     expected = {"filters": {"terms": {"visit_types": ["git", "pypi"]}}}
 
     _test_results(query, expected)
 
 
 def test_keyword_filter():
     query = r"""keyword in [word1, "word2 \" \' word3"]"""
     expected = {
         "filters": {
             "nested": {
                 "path": "intrinsic_metadata",
                 "query": {
                     "multi_match": {
                         "query": r"""word1 word2 " ' word3""",
                         "fields": [
                             get_expansion("keywords", ".") + "^2",
                             get_expansion("descriptions", "."),
                         ],
                     }
                 },
             }
         }
     }
 
     _test_results(query, expected)
 
 
 def test_language_filter():
     query = 'language in [python, "go lang", cpp]'
     expected = {
         "filters": {
             "nested": {
                 "path": "intrinsic_metadata",
                 "query": {
                     "bool": {
                         "should": [
                             {
                                 "match": {
                                     get_expansion(
                                         "programming_languages", "."
                                     ): "python"
                                 }
                             },
                             {
                                 "match": {
                                     get_expansion(
                                         "programming_languages", "."
                                     ): "go lang"
                                 }
                             },
                             {
                                 "match": {
                                     get_expansion("programming_languages", "."): "cpp"
                                 }
                             },
                         ]
                     }
                 },
             }
         }
     }
 
     _test_results(query, expected)
 
 
 def test_license_filter():
     query = 'license in ["GPL 3", Apache, MIT]'
     expected = {
         "filters": {
             "nested": {
                 "path": "intrinsic_metadata",
                 "query": {
                     "bool": {
                         "should": [
                             {"match": {get_expansion("licenses", "."): "GPL 3"}},
                             {"match": {get_expansion("licenses", "."): "Apache"}},
                             {"match": {get_expansion("licenses", "."): "MIT"}},
                         ]
                     }
                 },
             }
         }
     }
 
     _test_results(query, expected)
 
 
 def test_date_created_not_equal_to_filter():
     query = "created != 2020-01-01"
     expected = {
         "filters": {
             "nested": {
                 "path": "intrinsic_metadata",
                 "query": {
                     "bool": {
                         "must_not": [
                             {
                                 "range": {
                                     get_expansion("date_created", "."): {
                                         "gte": "2020-01-01",
                                         "lte": "2020-01-01",
                                     }
                                 }
                             }
                         ]
                     }
                 },
             }
         }
     }
 
     _test_results(query, expected)
 
 
 def test_date_created_greater_than_filter():
     query = "created >= 2020-01-01"
     expected = {
         "filters": {
             "nested": {
                 "path": "intrinsic_metadata",
                 "query": {
                     "bool": {
                         "must": [
                             {
                                 "range": {
                                     get_expansion("date_created", "."): {
                                         "gte": "2020-01-01",
                                     }
                                 }
                             }
                         ]
                     }
                 },
             }
         }
     }
 
     _test_results(query, expected)
 
 
 def test_last_eventful_visit_not_equal_to_filter():
     query = "last_visit != 2020-01-01"
     expected = {
         "filters": {
             "bool": {
                 "must_not": [
                     {
                         "range": {
                             "last_visit_date": {
                                 "gte": "2020-01-01",
                                 "lte": "2020-01-01",
                             }
                         }
                     }
                 ]
             }
         }
     }
 
     _test_results(query, expected)
 
 
 def test_last_eventful_visit_less_than_to_filter():
     query = "last_visit < 2020-01-01"
     expected = {"filters": {"range": {"last_visit_date": {"lt": "2020-01-01"}}}}
 
     _test_results(query, expected)
 
 
 def test_keyword_no_escape_inside_filter():
     # any keyword (filter name/operator/value) inside a filter
     # must be considered a string.
-    query = r'''origin = "language in [\'go lang\', python]"'''
+    query = r'''origin : "language in [\'go lang\', python]"'''
     expected = {
         "filters": {
             "multi_match": {
                 "query": r"""language in ['go lang', python]""",
                 "type": "bool_prefix",
                 "operator": "and",
                 "fields": [
                     "url.as_you_type",
                     "url.as_you_type._2gram",
                     "url.as_you_type._3gram",
                 ],
             }
         }
     }
     _test_results(query, expected)
 
 
 def test_escaped_punctuation_parsing():
     query = r"""keyword in ["foo \'\" bar"]"""
     expected = {
         "filters": {
             "nested": {
                 "path": "intrinsic_metadata",
                 "query": {
                     "multi_match": {
                         "query": r"""foo '" bar""",
                         "fields": [
                             get_expansion("keywords", ".") + "^2",
                             get_expansion("descriptions", "."),
                         ],
                     }
                 },
             }
         }
     }
     _test_results(query, expected)
 
 
 def test_nonascii():
     query = r"""keyword in ["café"]"""
     expected = {
         "filters": {
             "nested": {
                 "path": "intrinsic_metadata",
                 "query": {
                     "multi_match": {
                         "query": r"""café""",
                         "fields": [
                             get_expansion("keywords", ".") + "^2",
                             get_expansion("descriptions", "."),
                         ],
                     }
                 },
             }
         }
     }
     _test_results(query, expected)
 
 
 def test_nonascii_before_operator():
     query = r"""keyword in ["🐍"] and visited = true"""
     expected = {
         "filters": {
             "bool": {
                 "must": [
                     {
                         "nested": {
                             "path": "intrinsic_metadata",
                             "query": {
                                 "multi_match": {
                                     "query": r"""🐍""",
                                     "fields": [
                                         get_expansion("keywords", ".") + "^2",
                                         get_expansion("descriptions", "."),
                                     ],
                                 }
                             },
                         },
                     },
                     {"term": {"has_visits": True,},},
                 ],
             }
         }
     }
     _test_results(query, expected)