diff --git a/swh/web/ui/templates/includes/apidoc-header.html b/swh/web/ui/templates/includes/apidoc-header.html
index ab66caff3..ba6213ff3 100644
--- a/swh/web/ui/templates/includes/apidoc-header.html
+++ b/swh/web/ui/templates/includes/apidoc-header.html
@@ -1,103 +1,104 @@
 <p>This document describes the <strong>Software Heritage Web API</strong>.</p>
 <ul>
 <li>
 <a href="{{ url_for('browse_api_doc') }}#endpoint-index">Endpoint index</a>
 </li>
 <li>
 <a href="{{ url_for('browse_api_doc') }}#data-model">Data model</a>
 </li>
 <li>
 <a href="{{ url_for('browse_api_doc') }}#version">Version</a>
 </li>
 <li>
 <a href="{{ url_for('browse_api_doc') }}#schema">Schema</a>
 </li>
 <li>
 <a href="{{ url_for('browse_api_doc') }}#parameters">Parameters</a>
 </li>
 <li>
 <a href="{{ url_for('browse_api_doc') }}#errors">Errors</a>
 </li>
 <li>
 <a href="{{ url_for('browse_api_doc') }}#pagination">Pagination</a>
 </li>
 <li>
 <a href="{{ url_for('browse_api_doc') }}#errors">Rate limiting</a>
 </li>
 </ul>
 <h3 id="endpoint-index">Endpoint index</h3>
 <p>You can jump directly to the <strong><a href="/api/1/">endpoint index</a></strong>, which lists all available API functionalities, or read on for more general information about the API.</p>
 <h3 id="data-model">Data model</h3>
 <p>The <a href="https://www.softwareheritage.org/">Software Heritage</a> project harvests publicly available source code by tracking software distribution channels such as version control systems, tarball releases, and distribution packages.</p>
 <p>All retrieved source code and related metadata are stored in the Software Heritage archive, that is conceptually a <a href="https://en.wikipedia.org/wiki/Merkle_tree">Merkle DAG</a>. All nodes in the graph are content-addressable, i.e., their node identifiers are computed by hashing their content and, transitively, that of all nodes reachable from them; and no node or edge is ever removed from the graph: the Software Heritage archive is an append-only data structure.</p>
 <p>The following types of objects (i.e., graph nodes) can be found in the Software Heritage archive <small>(for more information see the <a href="https://wiki.softwareheritage.org/index.php?title=Glossary">Software Heritage glossary</a>)</small>:</p>
 <ul>
 <li><strong>Content</strong>: a specific version of a file stored in the archive, identified by its cryptographic hashes (currently: SHA1, Git-like &quot;salted&quot; SHA1, SHA256). Note that content objects are nameless; their names are context-dependent and stored as part of directory entries (see below).<br /> <em>Also known as:</em> &quot;blob&quot;</li>
 <li><strong>Directory</strong>: a list of directory entries, where each entry can point to content objects (&quot;file entries&quot;), revisions (&quot;revision entries&quot;), or transitively to other directories (&quot;directory entries&quot;). All entries are associated to the local name of the entry (i.e., a relative path without any path separator) and permission metadata (e.g., chmod value or equivalent).</li>
 <li><strong>Revision</strong>: a point in time snapshot of the content of a directory, together with associated development metadata (e.g., author, timestamp, log message, etc).<br /> <em>Also known as:</em> &quot;commit&quot;.</li>
 <li><strong>Release</strong>: a revision that has been marked as noteworthy with a specific name (e.g., a version number), together with associated development metadata (e.g., author, timestamp, etc).<br /> <em>Also known as:</em> &quot;tag&quot;</li>
 <li><strong>Origin</strong>: an Internet-based location from which a coherent set of objects (contents, revisions, releases, etc.) archived by Software Heritage has been obtained. Origins are currently identified by URLs.</li>
 <li><strong>Visit</strong>: the passage of Software Heritage on a given origin, to retrieve all source code and metadata available there at the time. A visit object stores the state of all visible branches (if any) available at the origin at visit time; each of them points to a revision object in the archive. Future visits of the same origin will create new visit objects, without removing previous ones.</li>
+<li><strong>Person</strong>: an entity referenced by a revision as either the author or the committer of the corresponding change. A person is associated to a full name and/or an email address.</li>
 </ul>
 <h3 id="version">Version</h3>
 <p>The current version of the API is <strong>v1</strong>.</p>
 <h3 id="schema">Schema</h3>
 <p>API access is over HTTPS.</p>
 <p>All API endpoints are rooted at <a href="https://archive.softwareheritage.org/api/1/" class="uri">https://archive.softwareheritage.org/api/1/</a>.</p>
 <p>Data is sent and received as JSON by default.</p>
 <p>Example:</p>
 <ul>
 <li><p>from the command line:</p>
 <pre class="shell"><code>curl -i https://archive.softwareheritage.org/api/1/stat/counters/</code></pre></li>
 </ul>
 <h4 id="response-format-override">Response format override</h4>
 <p>The response format can be overridden using the <code>Accept</code> request header. In particular, <code>Accept: text/html</code> (that web browsers send by default) requests HTML pretty-printing, whereas <code>Accept: application/yaml</code> requests YAML-encoded responses.</p>
 <p>Example:</p>
 <ul>
 <li><a href="/api/1/stat/counters/" class="uri">/api/1/stat/counters/</a></li>
 <li><p>from the command line:</p>
 <pre class="shell"><code>curl -i -H &#39;Accept: application/yaml&#39; https://archive.softwareheritage.org/api/1/stat/counters/</code></pre></li>
 </ul>
 <h3 id="parameters">Parameters</h3>
 <p>Some API endpoints can be tweaked by passing optional parameters. For GET requests, optional parameters can be passed as an HTTP query string.</p>
 <p>The optional parameter <code>fields</code> is accepted by all endpoints that return dictionaries and can be used to restrict the list of fields returned by the API, in case you are not interested in all of them. By default, all available fields are returned.</p>
 <p>Example:</p>
 <ul>
 <li><a href="/api/1/stat/counters/?fields=content,directory,revision" class="uri">/api/1/stat/counters/?fields=content,directory,revision</a></li>
 <li><p>from the command line:</p>
 <pre class="shell"><code>curl https://archive.softwareheritage.org/api/1/stat/counters/?fields=content,directory,revision</code></pre></li>
 </ul>
 <h3 id="errors">Errors</h3>
 <p>While API endpoints will return different kinds of errors depending on their own semantics, some error patterns are common across all endpoints.</p>
 <p>Sending malformed data, including syntactically incorrect object identifiers, will result in a <code>400 Bad Request</code> HTTP response. Example:</p>
 <ul>
 <li><a href="/api/1/content/deadbeef/" class="uri">/api/1/content/deadbeef/</a> (client error: &quot;deadbeef&quot; is too short to be a syntactically valid object identifier)</li>
 <li><p>from the command line:</p>
 <pre class="shell"><code>curl -i https://archive.softwareheritage.org/api/1/content/deadbeef/</code></pre></li>
 </ul>
 <p>Requesting non existent resources will result in a <code>404 Not Found</code> HTTP response. Example:</p>
 <ul>
 <li><a href="/api/1/content/0123456789abcdef0123456789abcdef01234567/" class="uri">/api/1/content/0123456789abcdef0123456789abcdef01234567/</a> (error: no object with that identifier is available [yet?])</li>
 <li><p>from the command line:</p>
 <pre class="shell"><code>curl -i https://archive.softwareheritage.org/api/1/content/04740277a81c5be6c16f6c9da488ca073b770d7f/</code></pre></li>
 </ul>
 <h3 id="pagination">Pagination</h3>
 <p>Requests that might potentially return many items will be paginated.</p>
 <p>Page size is set to a default (usually: 10 items), but might be overridden with the <code>per_page</code> query parameter up to a maximum (usually: 50 items). Example:</p>
 <pre class="shell"><code>curl https://archive.softwareheritage.org/api/1/origin/1/visits/?per_page=2</code></pre>
 <p>To navigate through paginated results, a <code>Link</code> HTTP response header is available to link the current result page to the next one. Example:</p>
 <pre><code>curl -i https://archive.softwareheritage.org/api/1/origin/1/visits/?per_page=2 | grep ^Link:
 Link: &lt;/api/1/origin/1/visits/?last_visit=2&amp;per_page=2&gt;; rel=&quot;next&quot;,</code></pre>
 <h3 id="rate-limiting">Rate limiting</h3>
 <p>Due to limited resource availability on the back end side, API usage is currently rate limited. Furthermore, as API usage is currently entirely anonymous (i.e., without any authentication), API &quot;users&quot; are currently identified by their origin IP address.</p>
 <p>Three HTTP response fields will inform you about the current state of limits that apply to your current rate limiting bucket:</p>
 <ul>
 <li><code>X-RateLimit-Limit</code>: maximum number of permitted requests per hour</li>
 <li><code>X-RateLimit-Remaining</code>: number of permitted requests remaining before the next reset</li>
 <li><code>X-RateLimit-Reset</code>: the time (expressed in <a href="https://en.wikipedia.org/wiki/Unix_time">Unix time</a> seconds) at which the current rate limiting will expire, resetting to a fresh <code>X-RateLimit-Limit</code></li>
 </ul>
 <p>Example:</p>
 <pre><code>curl -i https://archive.softwareheritage.org/api/1/stat/counters/ | grep ^X-RateLimit
 X-RateLimit-Limit: 60
 X-RateLimit-Remaining: 54
 X-RateLimit-Reset: 1485794532</code></pre>
diff --git a/swh/web/ui/templates/includes/apidoc-header.md b/swh/web/ui/templates/includes/apidoc-header.md
index 1d9c495bd..d744a1b61 100644
--- a/swh/web/ui/templates/includes/apidoc-header.md
+++ b/swh/web/ui/templates/includes/apidoc-header.md
@@ -1,189 +1,192 @@
 This document describes the <strong>Software Heritage Web API</strong>.
 
 <ul>
   <li><a href="{{ url_for('browse_api_doc') }}#endpoint-index">Endpoint index</a></li>
   <li><a href="{{ url_for('browse_api_doc') }}#data-model">Data model</a></li>
   <li><a href="{{ url_for('browse_api_doc') }}#version">Version</a></li>
   <li><a href="{{ url_for('browse_api_doc') }}#schema">Schema</a></li>
   <li><a href="{{ url_for('browse_api_doc') }}#parameters">Parameters</a></li>
   <li><a href="{{ url_for('browse_api_doc') }}#errors">Errors</a></li>
   <li><a href="{{ url_for('browse_api_doc') }}#pagination">Pagination</a></li>
   <li><a href="{{ url_for('browse_api_doc') }}#errors">Rate limiting</a></li>
 </ul>
 
 ### Endpoint index
 
 You can jump directly to the <strong><a href="/api/1/">endpoint
 index</a></strong>, which lists all available API functionalities, or read on
 for more general information about the API.
 
 
 ### Data model
 
 The [Software Heritage](https://www.softwareheritage.org/) project harvests
 publicly available source code by tracking software distribution channels such
 as version control systems, tarball releases, and distribution packages.
 
 All retrieved source code and related metadata are stored in the Software
 Heritage archive, that is conceptually
 a [Merkle DAG](https://en.wikipedia.org/wiki/Merkle_tree). All nodes in the
 graph are content-addressable, i.e., their node identifiers are computed by
 hashing their content and, transitively, that of all nodes reachable from them;
 and no node or edge is ever removed from the graph: the Software Heritage
 archive is an append-only data structure.
 
 The following types of objects (i.e., graph nodes) can be found in the Software
 Heritage archive <small>(for more information see
 the
 [Software Heritage glossary](https://wiki.softwareheritage.org/index.php?title=Glossary))</small>:
 
 - **Content**: a specific version of a file stored in the archive, identified
   by its cryptographic hashes (currently: SHA1, Git-like "salted" SHA1,
   SHA256). Note that content objects are nameless; their names are
   context-dependent and stored as part of directory entries (see below).<br />
   *Also known as:* "blob"
 - **Directory**: a list of directory entries, where each entry can point to
   content objects ("file entries"), revisions ("revision entries"), or
   transitively to other directories ("directory entries"). All entries are
   associated to the local name of the entry (i.e., a relative path without any
   path separator) and permission metadata (e.g., chmod value or equivalent).
 - **Revision**: a point in time snapshot of the content of a directory,
   together with associated development metadata (e.g., author, timestamp, log
   message, etc).<br />
   *Also known as:* "commit".
 - **Release**: a revision that has been marked as noteworthy with a specific
   name (e.g., a version number), together with associated development metadata
   (e.g., author, timestamp, etc).<br />
   *Also known as:* "tag"
 - **Origin**: an Internet-based location from which a coherent set of objects
   (contents, revisions, releases, etc.) archived by Software Heritage has been
   obtained. Origins are currently identified by URLs.
 - **Visit**: the passage of Software Heritage on a given origin, to retrieve
   all source code and metadata available there at the time. A visit object
   stores the state of all visible branches (if any) available at the origin at
   visit time; each of them points to a revision object in the archive. Future
   visits of the same origin will create new visit objects, without removing
   previous ones.
+- **Person**: an entity referenced by a revision as either the author or the
+  committer of the corresponding change. A person is associated to a full name
+  and/or an email address.
 
 
 ### Version
 
 The current version of the API is **v1**.
 
 
 ### Schema
 
 API access is over HTTPS.
 
 All API endpoints are rooted at <https://archive.softwareheritage.org/api/1/>.
 
 Data is sent and received as JSON by default.
 
 Example:
 
 - from the command line:
 ``` shell
 curl -i https://archive.softwareheritage.org/api/1/stat/counters/
 ```
 
 #### Response format override
 
 The response format can be overridden using the `Accept` request header. In
 particular, `Accept: text/html` (that web browsers send by default) requests
 HTML pretty-printing, whereas `Accept: application/yaml` requests YAML-encoded
 responses.
 
 Example:
 
 - [/api/1/stat/counters/](/api/1/stat/counters/)
 - from the command line:
 ``` shell
 curl -i -H 'Accept: application/yaml' https://archive.softwareheritage.org/api/1/stat/counters/
 ```
 
 ### Parameters
 
 Some API endpoints can be tweaked by passing optional parameters. For GET
 requests, optional parameters can be passed as an HTTP query string.
 
 The optional parameter `fields` is accepted by all endpoints that return
 dictionaries and can be used to restrict the list of fields returned by the
 API, in case you are not interested in all of them. By default, all available
 fields are returned.
 
 Example:
 
 - [/api/1/stat/counters/\?fields\=content,directory,revision](/api/1/stat/counters/?fields=content,directory,revision)
 - from the command line:
 ``` shell
 curl https://archive.softwareheritage.org/api/1/stat/counters/?fields=content,directory,revision
 ```
 
 
 ### Errors
 
 While API endpoints will return different kinds of errors depending on their
 own semantics, some error patterns are common across all endpoints.
 
 Sending malformed data, including syntactically incorrect object identifiers,
 will result in a `400 Bad Request` HTTP response. Example:
 
 - [/api/1/content/deadbeef/](/api/1/content/deadbeef/) (client error:
   "deadbeef" is too short to be a syntactically valid object identifier)
 - from the command line:
 ``` shell
 curl -i https://archive.softwareheritage.org/api/1/content/deadbeef/
 ```
 
 Requesting non existent resources will result in a `404 Not Found` HTTP
 response. Example:
 
 - [/api/1/content/0123456789abcdef0123456789abcdef01234567/](/api/1/content/0123456789abcdef0123456789abcdef01234567/)
   (error: no object with that identifier is available [yet?])
 - from the command line:
 ``` shell
 curl -i https://archive.softwareheritage.org/api/1/content/04740277a81c5be6c16f6c9da488ca073b770d7f/
 ```
 
 
 ### Pagination
 
 Requests that might potentially return many items will be paginated.
 
 Page size is set to a default (usually: 10 items), but might be overridden with
 the `per_page` query parameter up to a maximum (usually: 50 items). Example:
 
 ``` shell
 curl https://archive.softwareheritage.org/api/1/origin/1/visits/?per_page=2
 ```
 
 To navigate through paginated results, a `Link` HTTP response header is
 available to link the current result page to the next one. Example:
 
     curl -i https://archive.softwareheritage.org/api/1/origin/1/visits/?per_page=2 | grep ^Link:
     Link: </api/1/origin/1/visits/?last_visit=2&per_page=2>; rel="next",
 
 
 ### Rate limiting
 
 Due to limited resource availability on the back end side, API usage is
 currently rate limited.  Furthermore, as API usage is currently entirely
 anonymous (i.e., without any authentication), API "users" are currently
 identified by their origin IP address.
 
 Three HTTP response fields will inform you about the current state of limits
 that apply to your current rate limiting bucket:
 
 - `X-RateLimit-Limit`: maximum number of permitted requests per hour
 - `X-RateLimit-Remaining`: number of permitted requests remaining before the
   next reset
 - `X-RateLimit-Reset`: the time (expressed
   in [Unix time](https://en.wikipedia.org/wiki/Unix_time) seconds) at which the
   current rate limiting will expire, resetting to a fresh `X-RateLimit-Limit`
 
 Example:
 
     curl -i https://archive.softwareheritage.org/api/1/stat/counters/ | grep ^X-RateLimit
     X-RateLimit-Limit: 60
     X-RateLimit-Remaining: 54
     X-RateLimit-Reset: 1485794532
diff --git a/swh/web/ui/views/api.py b/swh/web/ui/views/api.py
index 4e8c1b0e3..4d0bf1d59 100644
--- a/swh/web/ui/views/api.py
+++ b/swh/web/ui/views/api.py
@@ -1,1081 +1,1095 @@
 # Copyright (C) 2015-2017  The Software Heritage developers
 # See the AUTHORS file at the top-level directory of this distribution
 # License: GNU Affero General Public License version 3, or any later version
 # See top-level LICENSE file for more information
 
 from types import GeneratorType
 
 from flask import request, url_for
 
 from swh.web.ui import service, utils, apidoc as doc
 from swh.web.ui.exc import NotFoundExc
 from swh.web.ui.main import app
 
 
 # canned doc string snippets that are used in several doc strings
 
 _doc_arg_content_id = """A "[HASH_TYPE:]HASH" content identifier, where
    HASH_TYPE is one of "sha1" (the default), "sha1_git", "sha256", and HASH is
    a checksum obtained with the HASH_TYPE hashing algorithm."""
 
 _doc_exc_bad_id = 'syntax error in the given identifier(s)'
 _doc_exc_id_not_found = 'no object matching the given criteria could be found'
 
 _doc_header_link = """indicates that a subsequent result page is available,
     pointing to it"""
 
 
 @app.route('/api/1/stat/counters/')
 @doc.route('/api/1/stat/counters/', noargs=True)
 @doc.returns(rettype=doc.rettypes.dict,
              retdoc="A dictionary of SWH's most important statistics")
 def api_stats():
     """Return statistics on SWH storage.
 
     """
     return service.stat_counters()
 
 
 @app.route('/api/1/origin/<int:origin_id>/visits/')
 @doc.route('/api/1/origin/visits/')
 @doc.arg('origin_id',
          default=1,
          argtype=doc.argtypes.int,
-         argdoc='A software origin identifier')
+         argdoc='software origin identifier')
 @doc.header('Link', doc=_doc_header_link)
 @doc.param('last_visit', default=None,
            argtype=doc.argtypes.int,
            doc='visit to start listing from, for pagination purposes')
 @doc.param('per_page', default=10,
            argtype=doc.argtypes.int,
            doc='number of visits to list, for pagination purposes')
 @doc.returns(rettype=doc.rettypes.list,
              retdoc="""a list of dictionaries describing individual visits.
              For each visit, its identifier, timestamp (as UNIX time), outcome,
              and visit-specific URL for more information are given.""")
 def api_origin_visits(origin_id):
     """Get information about all visits of a given software origin.
 
     """
     result = {}
     per_page = int(request.args.get('per_page', '10'))
     last_visit = request.args.get('last_visit')
     if last_visit:
         last_visit = int(last_visit)
 
     def _lookup_origin_visits(
             origin_id, last_visit=last_visit, per_page=per_page):
         return service.lookup_origin_visits(
             origin_id, last_visit=last_visit, per_page=per_page)
 
     def _enrich_origin_visit(origin_visit):
         ov = origin_visit.copy()
         ov['origin_visit_url'] = url_for('api_origin_visit',
                                          origin_id=ov['origin'],
                                          visit_id=ov['visit'])
         return ov
 
     r = _api_lookup(
         origin_id,
         _lookup_origin_visits,
         error_msg_if_not_found='No origin %s found' % origin_id,
         enrich_fn=_enrich_origin_visit)
 
     if r:
         l = len(r)
         if l == per_page:
             new_last_visit = r[-1]['visit']
             params = {
                 'origin_id': origin_id,
                 'last_visit': new_last_visit
             }
 
             if request.args.get('per_page'):
                 params['per_page'] = per_page
 
             result['headers'] = {
                 'link-next': url_for('api_origin_visits', **params)
             }
 
     result.update({
         'results': r
     })
 
     return result
 
 
 @app.route('/api/1/origin/<int:origin_id>/visit/<int:visit_id>/')
 @doc.route('/api/1/origin/visit/')
 @doc.arg('origin_id',
          default=1,
          argtype=doc.argtypes.int,
-         argdoc='A software origin identifier')
+         argdoc='software origin identifier')
 @doc.arg('visit_id',
          default=1,
          argtype=doc.argtypes.int,
-         argdoc='A visit identifier, for origin identified by ORIGIN_ID')
+         argdoc="""visit identifier, relative to the origin identified by
+         origin_id""")
 @doc.raises(exc=doc.excs.notfound, doc=_doc_exc_id_not_found)
-@doc.returns(rettype=doc.rettypes.list,
-             retdoc="""A dictionary containing both metadata for the entire
+@doc.returns(rettype=doc.rettypes.dict,
+             retdoc="""dictionary containing both metadata for the entire
              visit (e.g., timestamp as UNIX time, visit outcome, etc.) and what
              was at the software origin during the visit (i.e., a mapping from
              branches to other archive objects""")
 def api_origin_visit(origin_id, visit_id):
     """Get information about a specific visit of a software origin.
 
     """
     def _enrich_origin_visit(origin_visit):
         ov = origin_visit.copy()
         ov['origin_url'] = url_for('api_origin', origin_id=ov['origin'])
         if 'occurrences' in ov:
             ov['occurrences'] = {
                 k: utils.enrich_object(v)
                 for k, v in ov['occurrences'].items()
             }
         return ov
 
     return _api_lookup(
         origin_id,
         service.lookup_origin_visit,
         'No visit %s for origin %s found' % (visit_id, origin_id),
         _enrich_origin_visit,
         visit_id)
 
 
 @app.route('/api/1/content/symbol/', methods=['POST'])
 @app.route('/api/1/content/symbol/<string:q>/')
 @doc.route('/api/1/content/symbol/', tags=['upcoming'])
 @doc.arg('q',
          default='hello',
          argtype=doc.argtypes.str,
          argdoc="""An expression string to lookup in swh's raw content""")
 @doc.header('Link', doc=_doc_header_link)
 @doc.param('last_sha1', default=None,
            argtype=doc.argtypes.str,
            doc="""Optional parameter to start returning page results from.""")
 @doc.param('per_page', default=10,
            argtype=doc.argtypes.int,
            doc="""Optional parameter to limit the number of data per page
                   to retrieve on a per request basis.
                   The default is 10, up to 50 max.""")
 @doc.returns(rettype=doc.rettypes.list,
              retdoc="""A list of dict whose content matches the expression.
              Each dict has the following keys:
              - id (bytes): identifier of the content
              - name (text): symbol whose content match the expression
              - kind (text): kind of the symbol that matched
              - lang (text): Language for that entry
              - line (int): Number line for the symbol
 
              """)
 def api_content_symbol(q=None):
     """Search content objects by `Ctags <http://ctags.sourceforge.net/>`_-style
     symbol (e.g., function name, data type, method, etc.)
 
     """
     result = {}
     last_sha1 = request.args.get('last_sha1', None)
     per_page = int(request.args.get('per_page', '10'))
 
     def lookup_exp(exp, last_sha1=last_sha1, per_page=per_page):
         return service.lookup_expression(exp, last_sha1, per_page)
 
     symbols = _api_lookup(
         q,
         lookup_fn=lookup_exp,
         error_msg_if_not_found='No indexed raw content match expression \''
         '%s\'.' % q,
         enrich_fn=lambda x: utils.enrich_content(x, top_url=True))
 
     if symbols:
         l = len(symbols)
 
         if l == per_page:
             new_last_sha1 = symbols[-1]['sha1']
             params = {
                 'q': q,
                 'last_sha1': new_last_sha1,
             }
 
             if request.args.get('per_page'):
                 params['per_page'] = per_page
 
             result['headers'] = {
                 'link-next': url_for('api_content_symbol', **params),
             }
 
     result.update({
         'results': symbols
     })
 
     return result
 
 
 @app.route('/api/1/content/known/', methods=['POST'])
 @app.route('/api/1/content/known/<string:q>/')
 @doc.route('/api/1/content/known/')
 @doc.arg('q',
          default='adc83b19e793491b1c6ea0fd8b46cd9f32e592fc',
          argtype=doc.argtypes.algo_and_hash,
          argdoc=_doc_arg_content_id)
 @doc.param('q', default=None,
            argtype=doc.argtypes.str,
            doc="""(POST request) An algo_hash:hash string, where algo_hash
                   is one of sha1, sha1_git or sha256 and hash is the hash to
                   search for in SWH""")
 @doc.raises(exc=doc.excs.badinput, doc=_doc_exc_bad_id)
 @doc.returns(rettype=doc.rettypes.dict,
              retdoc="""A dict with keys:
 
              - search_res: a list of dicts corresponding to queried content
                with key 'found' to True if found, 'False' if not
              - search_stats: a dict containing number of files searched and
                percentage of files found
              """)
 def api_check_content_known(q=None):
-    """Check whether some content (AKA "blob") is present in the archive
+    """Check whether some content (AKA "blob") is present in the archive.
 
     **TODO** pending review
 
     Lookup can be performed by various means:
 
     - a GET request with many hashes (limited to the size of parameter
       we can pass in url) a POST request with many hashes, with the
     - request body containing identifiers (typically filenames) as
     - keys and corresponding hashes as values.
 
     """
 
     response = {'search_res': None,
                 'search_stats': None}
     search_stats = {'nbfiles': 0, 'pct': 0}
     search_res = None
 
     queries = []
     # GET: Many hash separated values request
     if q:
         hashes = q.split(',')
         for v in hashes:
             queries.append({'filename': None, 'sha1': v})
 
     # POST: Many hash requests in post form submission
     elif request.method == 'POST':
         data = request.form
         # Remove potential inputs with no associated value
         for k, v in data.items():
             if v is not None:
                 if k == 'q' and len(v) > 0:
                     queries.append({'filename': None, 'sha1': v})
                 elif v != '':
                     queries.append({'filename': k, 'sha1': v})
 
     if queries:
         lookup = service.lookup_multiple_hashes(queries)
         result = []
         l = len(queries)
         for el in lookup:
             result.append({'filename': el['filename'],
                            'sha1': el['sha1'],
                            'found': el['found']})
             search_res = result
             nbfound = len([x for x in lookup if x['found']])
             search_stats['nbfiles'] = l
             search_stats['pct'] = (nbfound / l) * 100
 
     response['search_res'] = search_res
     response['search_stats'] = search_stats
     return response
 
 
 def _api_lookup(criteria,
                 lookup_fn,
                 error_msg_if_not_found,
                 enrich_fn=lambda x: x,
                 *args):
     """Capture a redundant behavior of:
     - looking up the backend with a criteria (be it an identifier or checksum)
     passed to the function lookup_fn
     - if nothing is found, raise an NotFoundExc exception with error
     message error_msg_if_not_found.
     - Otherwise if something is returned:
         - either as list, map or generator, map the enrich_fn function to it
         and return the resulting data structure as list.
         - either as dict and pass to enrich_fn and return the dict enriched.
 
     Args:
         - criteria: discriminating criteria to lookup
         - lookup_fn: function expects one criteria and optional supplementary
         *args.
         - error_msg_if_not_found: if nothing matching the criteria is found,
         raise NotFoundExc with this error message.
         - enrich_fn: Function to use to enrich the result returned by
         lookup_fn. Default to the identity function if not provided.
         - *args: supplementary arguments to pass to lookup_fn.
 
     Raises:
         NotFoundExp or whatever `lookup_fn` raises.
 
     """
     res = lookup_fn(criteria, *args)
     if not res:
         raise NotFoundExc(error_msg_if_not_found)
     if isinstance(res, (map, list, GeneratorType)):
         return [enrich_fn(x) for x in res]
     return enrich_fn(res)
 
 
 @app.route('/api/1/origin/<int:origin_id>/')
 @app.route('/api/1/origin/<string:origin_type>/url/<path:origin_url>')
 @doc.route('/api/1/origin/')
 @doc.arg('origin_id',
          default=1,
          argtype=doc.argtypes.int,
-         argdoc='A origin identifier (when looking up by ID)')
+         argdoc='origin identifier (when looking up by ID)')
 @doc.arg('origin_type',
          default='git',
          argtype=doc.argtypes.str,
-         argdoc='A origin type (when looking up by type+URL)')
+         argdoc='origin type (when looking up by type+URL)')
 @doc.arg('origin_url',
          default='https://github.com/hylang/hy',
          argtype=doc.argtypes.path,
-         argdoc='A origin URL (when looking up by type+URL')
+         argdoc='origin URL (when looking up by type+URL')
 @doc.raises(exc=doc.excs.notfound, doc=_doc_exc_id_not_found)
 @doc.returns(rettype=doc.rettypes.dict,
              retdoc="""The metadata of the origin corresponding to the given
              criteria""")
 def api_origin(origin_id=None, origin_type=None, origin_url=None):
     """Get information about a software origin
 
     Software origins might be looked up by origin type and canonical URL (e.g.,
     "git" + a "git clone" URL), or by their unique (but otherwise meaningless)
     identifier.
 
     """
     ori_dict = {
         'id': origin_id,
         'type': origin_type,
         'url': origin_url
     }
     ori_dict = {k: v for k, v in ori_dict.items() if ori_dict[k]}
     if 'id' in ori_dict:
         error_msg = 'Origin with id %s not found.' % ori_dict['id']
     else:
         error_msg = 'Origin with type %s and URL %s not found' % (
             ori_dict['type'], ori_dict['url'])
 
     def _enrich_origin(origin):
         if 'id' in origin:
             o = origin.copy()
             o['origin_visits_url'] = url_for('api_origin_visits',
                                              origin_id=o['id'])
             return o
 
         return origin
 
     return _api_lookup(
         ori_dict, lookup_fn=service.lookup_origin,
         error_msg_if_not_found=error_msg,
         enrich_fn=_enrich_origin)
 
 
 @app.route('/api/1/person/<int:person_id>/')
 @doc.route('/api/1/person/')
 @doc.arg('person_id',
          default=1,
          argtype=doc.argtypes.int,
-         argdoc="The person's SWH identifier")
+         argdoc='person identifier')
 @doc.raises(exc=doc.excs.notfound, doc=_doc_exc_id_not_found)
 @doc.returns(rettype=doc.rettypes.dict,
              retdoc='The metadata of the person identified by person_id')
 def api_person(person_id):
-    """Return information about person with identifier person_id.
+    """Get information about a person.
+
     """
     return _api_lookup(
         person_id, lookup_fn=service.lookup_person,
         error_msg_if_not_found='Person with id %s not found.' % person_id)
 
 
 @app.route('/api/1/release/<string:sha1_git>/')
 @doc.route('/api/1/release/')
 @doc.arg('sha1_git',
          default='97d8dcd0c589b1d94a5d26cf0c1e8f2f44b92bfd',
          argtype=doc.argtypes.sha1_git,
-         argdoc="The release's sha1_git identifier")
+         argdoc='release identifier')
 @doc.raises(exc=doc.excs.badinput, doc=_doc_exc_bad_id)
 @doc.raises(exc=doc.excs.notfound, doc=_doc_exc_id_not_found)
 @doc.returns(rettype=doc.rettypes.dict,
              retdoc='The metadata of the release identified by sha1_git')
 def api_release(sha1_git):
-    """Return information about release with id sha1_git.
+    """Get information about a release.
+
+    Releases are identified by SHA1 checksums, compatible with Git tag
+    identifiers. See ``release_identifier`` in our `data model module
+    <https://forge.softwareheritage.org/source/swh-model/browse/master/swh/model/identifiers.py>`_
+    for details about how they are computed.
+
     """
     error_msg = 'Release with sha1_git %s not found.' % sha1_git
     return _api_lookup(
         sha1_git,
         lookup_fn=service.lookup_release,
         error_msg_if_not_found=error_msg,
         enrich_fn=utils.enrich_release)
 
 
 def _revision_directory_by(revision, path, request_path,
                            limit=100, with_data=False):
     """Compute the revision matching criterion's directory or content data.
 
     Args:
         revision: dictionary of criterions representing a revision to lookup
         path: directory's path to lookup
         request_path: request path which holds the original context to
         limit: optional query parameter to limit the revisions log
         (default to 100). For now, note that this limit could impede the
         transitivity conclusion about sha1_git not being an ancestor of
         with_data: indicate to retrieve the content's raw data if path resolves
         to a content.
 
     """
     def enrich_directory_local(dir, context_url=request_path):
         return utils.enrich_directory(dir, context_url)
 
     rev_id, result = service.lookup_directory_through_revision(
         revision, path, limit=limit, with_data=with_data)
 
     content = result['content']
     if result['type'] == 'dir':  # dir_entries
         result['content'] = list(map(enrich_directory_local, content))
     else:  # content
         result['content'] = utils.enrich_content(content)
 
     return result
 
 
 @app.route('/api/1/revision'
            '/origin/<int:origin_id>'
            '/directory/')
 @app.route('/api/1/revision'
            '/origin/<int:origin_id>'
            '/directory/<path:path>/')
 @app.route('/api/1/revision'
            '/origin/<int:origin_id>'
            '/branch/<path:branch_name>'
            '/directory/')
 @app.route('/api/1/revision'
            '/origin/<int:origin_id>'
            '/branch/<path:branch_name>'
            '/directory/<path:path>/')
 @app.route('/api/1/revision'
            '/origin/<int:origin_id>'
            '/branch/<path:branch_name>'
            '/ts/<string:ts>'
            '/directory/')
 @app.route('/api/1/revision'
            '/origin/<int:origin_id>'
            '/branch/<path:branch_name>'
            '/ts/<string:ts>'
            '/directory/<path:path>/')
 @doc.route('/api/1/revision/origin/directory/', tags=['hidden'])
 @doc.arg('origin_id',
          default=1,
          argtype=doc.argtypes.int,
          argdoc="The revision's origin's SWH identifier")
 @doc.arg('branch_name',
          default='refs/heads/master',
          argtype=doc.argtypes.path,
          argdoc="""The optional branch for the given origin (default
          to master""")
 @doc.arg('ts',
          default='2000-01-17T11:23:54+00:00',
          argtype=doc.argtypes.ts,
          argdoc="""Optional timestamp (default to the nearest time
          crawl of timestamp)""")
 @doc.arg('path',
          default='Dockerfile',
          argtype=doc.argtypes.path,
          argdoc='The path to the directory or file to display')
 @doc.raises(exc=doc.excs.notfound, doc=_doc_exc_id_not_found)
 @doc.returns(rettype=doc.rettypes.dict,
              retdoc="""The metadata of the revision corresponding to the
              given criteria""")
 def api_directory_through_revision_origin(origin_id,
                                           branch_name="refs/heads/master",
                                           ts=None,
                                           path=None,
                                           with_data=False):
     """Display directory or content information through a revision identified
     by origin/branch/timestamp.
     """
     if ts:
         ts = utils.parse_timestamp(ts)
 
     return _revision_directory_by(
         {
             'origin_id': origin_id,
             'branch_name': branch_name,
             'ts': ts
         },
         path,
         request.path,
         with_data=with_data)
 
 
 @app.route('/api/1/revision'
            '/origin/<int:origin_id>/')
 @app.route('/api/1/revision'
            '/origin/<int:origin_id>'
            '/branch/<path:branch_name>/')
 @app.route('/api/1/revision'
            '/origin/<int:origin_id>'
            '/branch/<path:branch_name>'
            '/ts/<string:ts>/')
 @app.route('/api/1/revision'
            '/origin/<int:origin_id>'
            '/ts/<string:ts>/')
 @doc.route('/api/1/revision/origin/')
 @doc.arg('origin_id',
          default=1,
          argtype=doc.argtypes.int,
          argdoc="The queried revision's origin identifier in SWH")
 @doc.arg('branch_name',
          default='refs/heads/master',
          argtype=doc.argtypes.path,
          argdoc="""The optional branch for the given origin (default
          to master)""")
 @doc.arg('ts',
          default='2000-01-17T11:23:54+00:00',
          argtype=doc.argtypes.ts,
          argdoc="The time at which the queried revision should be constrained")
 @doc.raises(exc=doc.excs.notfound, doc=_doc_exc_id_not_found)
 @doc.returns(rettype=doc.rettypes.dict,
              retdoc="""The metadata of the revision identified by the given
              criteria""")
 def api_revision_with_origin(origin_id,
                              branch_name="refs/heads/master",
                              ts=None):
     """Display revision information through its identification by
     origin/branch/timestamp.
     """
     if ts:
         ts = utils.parse_timestamp(ts)
 
     return _api_lookup(
         origin_id,
         service.lookup_revision_by,
         'Revision with (origin_id: %s, branch_name: %s'
         ', ts: %s) not found.' % (origin_id,
                                   branch_name,
                                   ts),
         utils.enrich_revision,
         branch_name,
         ts)
 
 
 @app.route('/api/1/revision/<string:sha1_git>/prev/<path:context>/')
 @doc.route('/api/1/revision/prev/', tags=['hidden'])
 @doc.arg('sha1_git',
          default='ec72c666fb345ea5f21359b7bc063710ce558e39',
          argtype=doc.argtypes.sha1_git,
          argdoc="The revision's sha1_git identifier")
 @doc.arg('context',
          default='6adc4a22f20bbf3bbc754f1ec8c82be5dfb5c71a',
          argtype=doc.argtypes.path,
          argdoc='The navigation breadcrumbs -- use at your own risk')
 @doc.raises(exc=doc.excs.badinput, doc=_doc_exc_bad_id)
 @doc.raises(exc=doc.excs.notfound, doc=_doc_exc_id_not_found)
 @doc.returns(rettype=doc.rettypes.dict,
              retdoc='The metadata of the revision identified by sha1_git')
 def api_revision_with_context(sha1_git, context):
     """Return information about revision with id sha1_git.
     """
     def _enrich_revision(revision, context=context):
         return utils.enrich_revision(revision, context)
 
     return _api_lookup(
         sha1_git,
         service.lookup_revision,
         'Revision with sha1_git %s not found.' % sha1_git,
         _enrich_revision)
 
 
 @app.route('/api/1/revision/<string:sha1_git>/')
 @doc.route('/api/1/revision/')
 @doc.arg('sha1_git',
          default='ec72c666fb345ea5f21359b7bc063710ce558e39',
          argtype=doc.argtypes.sha1_git,
          argdoc="The revision's sha1_git identifier")
 @doc.raises(exc=doc.excs.badinput, doc=_doc_exc_bad_id)
 @doc.raises(exc=doc.excs.notfound, doc=_doc_exc_id_not_found)
 @doc.returns(rettype=doc.rettypes.dict,
              retdoc='The metadata of the revision identified by sha1_git')
 def api_revision(sha1_git):
-    """Return information about revision with id sha1_git.
+    """Get information about a revision.
+
+    Revisions are identified by SHA1 checksums, compatible with Git commit
+    identifiers. See ``revision_identifier`` in our `data model module
+    <https://forge.softwareheritage.org/source/swh-model/browse/master/swh/model/identifiers.py>`_
+    for details about how they are computed.
+
     """
     return _api_lookup(
         sha1_git,
         service.lookup_revision,
         'Revision with sha1_git %s not found.' % sha1_git,
         utils.enrich_revision)
 
 
 @app.route('/api/1/revision/<string:sha1_git>/raw/')
 @doc.route('/api/1/revision/raw/', tags=['hidden'])
 @doc.arg('sha1_git',
          default='ec72c666fb345ea5f21359b7bc063710ce558e39',
          argtype=doc.argtypes.sha1_git,
          argdoc="The queried revision's sha1_git identifier")
 @doc.raises(exc=doc.excs.badinput, doc=_doc_exc_bad_id)
 @doc.raises(exc=doc.excs.notfound, doc=_doc_exc_id_not_found)
 @doc.returns(rettype=doc.rettypes.octet_stream,
              retdoc="""The message of the revision identified by sha1_git
              as a downloadable octet stream""")
 def api_revision_raw_message(sha1_git):
     """Return the raw data of the message of revision identified by sha1_git
     """
     raw = service.lookup_revision_message(sha1_git)
     return app.response_class(raw['message'],
                               headers={'Content-disposition': 'attachment;'
                                        'filename=rev_%s_raw' % sha1_git},
                               mimetype='application/octet-stream')
 
 
 @app.route('/api/1/revision/<string:sha1_git>/directory/')
 @app.route('/api/1/revision/<string:sha1_git>/directory/<path:dir_path>/')
 @doc.route('/api/1/revision/directory/')
 @doc.arg('sha1_git',
          default='ec72c666fb345ea5f21359b7bc063710ce558e39',
          argtype=doc.argtypes.sha1_git,
          argdoc="The revision's sha1_git identifier.")
 @doc.arg('dir_path',
          default='Documentation/BUG-HUNTING',
          argtype=doc.argtypes.path,
          argdoc='The path from the top level directory')
 @doc.raises(exc=doc.excs.badinput, doc=_doc_exc_bad_id)
 @doc.raises(exc=doc.excs.notfound, doc=_doc_exc_id_not_found)
 @doc.returns(rettype=doc.rettypes.dict,
              retdoc="""The metadata of the directory pointed by revision id
              sha1-git and dir_path""")
 def api_revision_directory(sha1_git,
                            dir_path=None,
                            with_data=False):
     """Return information on directory pointed by revision with sha1_git.
     If dir_path is not provided, display top level directory.
     Otherwise, display the directory pointed by dir_path (if it exists).
     """
     return _revision_directory_by(
         {
             'sha1_git': sha1_git
         },
         dir_path,
         request.path,
         with_data=with_data)
 
 
 @app.route('/api/1/revision/<string:sha1_git>/log/')
 @app.route('/api/1/revision/<string:sha1_git>/prev/<path:prev_sha1s>/log/')
 @doc.route('/api/1/revision/log/')
 @doc.arg('sha1_git',
          default='ec72c666fb345ea5f21359b7bc063710ce558e39',
          argtype=doc.argtypes.sha1_git,
          argdoc="The revision's identifier queried")
 @doc.arg('prev_sha1s',
          default='6adc4a22f20bbf3bbc754f1ec8c82be5dfb5c71a',
          argtype=doc.argtypes.path,
          argdoc="""(Optional) Navigation breadcrumbs (descendant revisions
 previously visited).  If multiple values, use / as delimiter.  """)
 @doc.header('Link', doc=_doc_header_link)
 @doc.param('per_page', default=10,
            argtype=doc.argtypes.int,
            doc="""Default parameter used to group result by page of the
 specified number.""")
 @doc.raises(exc=doc.excs.badinput, doc=_doc_exc_bad_id)
 @doc.raises(exc=doc.excs.notfound, doc=_doc_exc_id_not_found)
 @doc.returns(rettype=doc.rettypes.dict,
              retdoc="""The log data starting at the revision identified by
              sha1_git, completed with the navigation breadcrumbs,
              if any""")
 def api_revision_log(sha1_git, prev_sha1s=None):
     """Return all revisions (~ git log) starting from sha1_git.  The first
     element returned is the given sha1_git, or the first breadcrumb,
     if any.
 
     """
     result = {}
     per_page = int(request.args.get('per_page', '10'))
 
     def lookup_revision_log_with_limit(s, limit=per_page+1):
         return service.lookup_revision_log(s, limit)
 
     error_msg = 'Revision with sha1_git %s not found.' % sha1_git
     rev_get = _api_lookup(sha1_git,
                           lookup_fn=lookup_revision_log_with_limit,
                           error_msg_if_not_found=error_msg,
                           enrich_fn=utils.enrich_revision)
 
     l = len(rev_get)
     if l == per_page+1:
         rev_backward = rev_get[:-1]
         new_last_sha1 = rev_get[-1]['id']
         params = {
             'sha1_git': new_last_sha1,
         }
 
         if request.args.get('per_page'):
             params['per_page'] = per_page
 
         result['headers'] = {
             'link-next': url_for('api_revision_log', **params)
         }
 
     else:
         rev_backward = rev_get
 
     if not prev_sha1s:  # no nav breadcrumbs, so we're done
         revisions = rev_backward
 
     else:
         rev_forward_ids = prev_sha1s.split('/')
         rev_forward = _api_lookup(rev_forward_ids,
                                   lookup_fn=service.lookup_revision_multiple,
                                   error_msg_if_not_found=error_msg,
                                   enrich_fn=utils.enrich_revision)
         revisions = rev_forward + rev_backward
 
     result.update({
         'results': revisions
     })
     return result
 
 
 @app.route('/api/1/revision'
            '/origin/<int:origin_id>/log/')
 @app.route('/api/1/revision'
            '/origin/<int:origin_id>'
            '/branch/<path:branch_name>/log/')
 @app.route('/api/1/revision'
            '/origin/<int:origin_id>'
            '/branch/<path:branch_name>'
            '/ts/<string:ts>/log/')
 @app.route('/api/1/revision'
            '/origin/<int:origin_id>'
            '/ts/<string:ts>/log/')
 @doc.route('/api/1/revision/origin/log/')
 @doc.arg('origin_id',
          default=1,
          argtype=doc.argtypes.int,
          argdoc="The revision's SWH origin identifier")
 @doc.arg('branch_name',
          default='refs/heads/master',
          argtype=doc.argtypes.path,
          argdoc="""(Optional) The revision's branch name within the origin specified.
 Defaults to 'refs/heads/master'.""")
 @doc.arg('ts',
          default='2000-01-17T11:23:54+00:00',
          argtype=doc.argtypes.ts,
          argdoc="""(Optional) A time or timestamp string to parse""")
 @doc.header('Link', doc=_doc_header_link)
 @doc.param('per_page', default=10,
            argtype=doc.argtypes.int,
            doc="""Default parameter used to group result by page of the specified
 number.""")
 @doc.raises(exc=doc.excs.notfound, doc=_doc_exc_id_not_found)
 @doc.returns(rettype=doc.rettypes.dict,
              retdoc="""The metadata of the revision log starting at the revision
              matching the given criteria.""")
 def api_revision_log_by(origin_id,
                         branch_name='refs/heads/master',
                         ts=None):
     """Show all revisions (~ git log) starting from the revision targeted
     by the origin_id provided and optionally a branch name or/and a
     timestamp.
 
     """
     result = {}
     per_page = int(request.args.get('per_page', '10'))
 
     if ts:
         ts = utils.parse_timestamp(ts)
 
     def lookup_revision_log_by_with_limit(o_id, br, ts, limit=per_page+1):
         return service.lookup_revision_log_by(o_id, br, ts, limit)
 
     error_msg = 'No revision matching origin %s ' % origin_id
     error_msg += ', branch name %s' % branch_name
     error_msg += (' and time stamp %s.' % ts) if ts else '.'
 
     rev_get = _api_lookup(origin_id,
                           lookup_revision_log_by_with_limit,
                           error_msg,
                           utils.enrich_revision,
                           branch_name,
                           ts)
     l = len(rev_get)
     if l == per_page+1:
         revisions = rev_get[:-1]
         last_sha1_git = rev_get[-1]['id']
 
         params = {
             'origin_id': origin_id,
             'branch_name': branch_name,
             'ts': ts,
             'sha1_git': last_sha1_git,
         }
 
         if request.args.get('per_page'):
             params['per_page'] = per_page
 
         result['headers'] = {
             'link-next': url_for('api_revision_log_by', **params),
         }
 
     else:
         revisions = rev_get
 
     result.update({'results': revisions})
 
     return result
 
 
 @app.route('/api/1/directory/<string:sha1_git>/')
 @app.route('/api/1/directory/<string:sha1_git>/<path:path>/')
 @doc.route('/api/1/directory/')
 @doc.arg('sha1_git',
          default='1bd0e65f7d2ff14ae994de17a1e7fe65111dcad8',
          argtype=doc.argtypes.sha1_git,
          argdoc='directory identifier')
 @doc.arg('path',
          default='codec/demux',
          argtype=doc.argtypes.path,
          argdoc='path relative to directory identified by SHA1_GIT')
 @doc.raises(exc=doc.excs.badinput, doc=_doc_exc_bad_id)
 @doc.raises(exc=doc.excs.notfound, doc=_doc_exc_id_not_found)
 @doc.returns(rettype=doc.rettypes.dict,
              retdoc="""either a list of directory entries with their metadata,
              or the metadata of a single directory entry""")
 def api_directory(sha1_git,
                   path=None):
     """Get information about directory or directory entry objects.
 
     Directories are identified by SHA1 checksums, compatible with Git directory
     identifiers. See ``directory_identifier`` in our `data model module
     <https://forge.softwareheritage.org/source/swh-model/browse/master/swh/model/identifiers.py>`_
     for details about how they are computed.
 
     When given only a directory identifier, this endpoint returns information
     about the directory itself, returning its content (usually a list of
     directory entries). When given a directory identifier and a path, this
     endpoint returns information about the directory entry pointed by the
     relative path, starting path resolution from the given directory.
 
     """
     if path:
         error_msg_path = ('Entry with path %s relative to directory '
                           'with sha1_git %s not found.') % (path, sha1_git)
         return _api_lookup(
             sha1_git,
             service.lookup_directory_with_path,
             error_msg_path,
             utils.enrich_directory,
             path)
     else:
         error_msg_nopath = 'Directory with sha1_git %s not found.' % sha1_git
         return _api_lookup(
             sha1_git,
             service.lookup_directory,
             error_msg_nopath,
             utils.enrich_directory)
 
 
 @app.route('/api/1/provenance/<string:q>/')
 @doc.route('/api/1/provenance/', tags=['hidden'])
 @doc.arg('q',
          default='sha1_git:88b9b366facda0b5ff8d8640ee9279bed346f242',
          argtype=doc.argtypes.algo_and_hash,
          argdoc=_doc_arg_content_id)
 @doc.raises(exc=doc.excs.badinput, doc=_doc_exc_bad_id)
 @doc.raises(exc=doc.excs.notfound, doc=_doc_exc_id_not_found)
 @doc.returns(rettype=doc.rettypes.dict,
              retdoc="""List of provenance information (dict) for the matched
 content.""")
 def api_content_provenance(q):
     """Return content's provenance information if any.
 
     """
     def _enrich_revision(provenance):
         p = provenance.copy()
         p['revision_url'] = url_for('api_revision',
                                     sha1_git=provenance['revision'])
         p['content_url'] = url_for('api_content_metadata',
                                    q='sha1_git:%s' % provenance['content'])
         p['origin_url'] = url_for('api_origin',
                                   origin_id=provenance['origin'])
         p['origin_visits_url'] = url_for('api_origin_visits',
                                          origin_id=provenance['origin'])
         p['origin_visit_url'] = url_for('api_origin_visit',
                                         origin_id=provenance['origin'],
                                         visit_id=provenance['visit'])
         return p
 
     return _api_lookup(
         q,
         lookup_fn=service.lookup_content_provenance,
         error_msg_if_not_found='Content with %s not found.' % q,
         enrich_fn=_enrich_revision)
 
 
 @app.route('/api/1/filetype/<string:q>/')
 @doc.route('/api/1/filetype/', tags=['upcoming'])
 @doc.arg('q',
          default='sha1:1fc6129a692e7a87b5450e2ba56e7669d0c5775d',
          argtype=doc.argtypes.algo_and_hash,
          argdoc=_doc_arg_content_id)
 @doc.raises(exc=doc.excs.badinput, doc=_doc_exc_bad_id)
 @doc.raises(exc=doc.excs.notfound, doc=_doc_exc_id_not_found)
 @doc.returns(rettype=doc.rettypes.dict,
              retdoc="""Filetype information (dict) for the matched
 content.""")
 def api_content_filetype(q):
     """Get information about the detected MIME type of a content object
 
     """
     return _api_lookup(
         q,
         lookup_fn=service.lookup_content_filetype,
         error_msg_if_not_found='No filetype information found '
         'for content %s.' % q,
         enrich_fn=utils.enrich_metadata_endpoint)
 
 
 @app.route('/api/1/language/<string:q>/')
 @doc.route('/api/1/language/', tags=['upcoming'])
 @doc.arg('q',
          default='sha1:1fc6129a692e7a87b5450e2ba56e7669d0c5775d',
          argtype=doc.argtypes.algo_and_hash,
          argdoc=_doc_arg_content_id)
 @doc.raises(exc=doc.excs.badinput, doc=_doc_exc_bad_id)
 @doc.raises(exc=doc.excs.notfound, doc=_doc_exc_id_not_found)
 @doc.returns(rettype=doc.rettypes.dict,
              retdoc="""Language information (dict) for the matched
 content.""")
 def api_content_language(q):
     """Get information about the detected (programming) language of a content
     object
 
     """
     return _api_lookup(
         q,
         lookup_fn=service.lookup_content_language,
         error_msg_if_not_found='No language information found '
         'for content %s.' % q,
         enrich_fn=utils.enrich_metadata_endpoint)
 
 
 @app.route('/api/1/license/<string:q>/')
 @doc.route('/api/1/license/', tags=['upcoming'])
 @doc.arg('q',
          default='sha1:1fc6129a692e7a87b5450e2ba56e7669d0c5775d',
          argtype=doc.argtypes.algo_and_hash,
          argdoc=_doc_arg_content_id)
 @doc.raises(exc=doc.excs.badinput, doc=_doc_exc_bad_id)
 @doc.raises(exc=doc.excs.notfound, doc=_doc_exc_id_not_found)
 @doc.returns(rettype=doc.rettypes.dict,
              retdoc="""License information (dict) for the matched
 content.""")
 def api_content_license(q):
     """Get information about the detected license of a content object
 
     """
     return _api_lookup(
         q,
         lookup_fn=service.lookup_content_license,
         error_msg_if_not_found='No license information found '
         'for content %s.' % q,
         enrich_fn=utils.enrich_metadata_endpoint)
 
 
 @app.route('/api/1/ctags/<string:q>/')
 @doc.route('/api/1/ctags/', tags=['upcoming'])
 @doc.arg('q',
          default='sha1:1fc6129a692e7a87b5450e2ba56e7669d0c5775d',
          argtype=doc.argtypes.algo_and_hash,
          argdoc=_doc_arg_content_id)
 @doc.raises(exc=doc.excs.badinput, doc=_doc_exc_bad_id)
 @doc.raises(exc=doc.excs.notfound, doc=_doc_exc_id_not_found)
 @doc.returns(rettype=doc.rettypes.dict,
              retdoc="""Ctags symbol (dict) for the matched
 content.""")
 def api_content_ctags(q):
     """Get information about all `Ctags <http://ctags.sourceforge.net/>`_-style
     symbols defined in a content object
 
     """
     return _api_lookup(
         q,
         lookup_fn=service.lookup_content_ctags,
         error_msg_if_not_found='No ctags symbol found '
         'for content %s.' % q,
         enrich_fn=utils.enrich_metadata_endpoint)
 
 
 @app.route('/api/1/content/<string:q>/raw/')
 @doc.route('/api/1/content/raw/', tags=['upcoming'])
 @doc.arg('q',
          default='adc83b19e793491b1c6ea0fd8b46cd9f32e592fc',
          argtype=doc.argtypes.algo_and_hash,
          argdoc=_doc_arg_content_id)
 @doc.raises(exc=doc.excs.badinput, doc=_doc_exc_bad_id)
 @doc.raises(exc=doc.excs.notfound, doc=_doc_exc_id_not_found)
 @doc.returns(rettype=doc.rettypes.octet_stream,
              retdoc='The raw content data as an octet stream')
 def api_content_raw(q):
     """Get the raw content of a content object (AKA "blob"), as a byte sequence
 
     """
     def generate(content):
         yield content['data']
 
     content = service.lookup_content_raw(q)
     if not content:
         raise NotFoundExc('Content with %s not found.' % q)
 
     return app.response_class(generate(content),
                               headers={'Content-disposition': 'attachment;'
                                        'filename=content_%s_raw' % q},
                               mimetype='application/octet-stream')
 
 
 @app.route('/api/1/content/<string:q>/')
 @doc.route('/api/1/content/')
 @doc.arg('q',
          default='adc83b19e793491b1c6ea0fd8b46cd9f32e592fc',
          argtype=doc.argtypes.algo_and_hash,
          argdoc=_doc_arg_content_id)
 @doc.raises(exc=doc.excs.badinput, doc=_doc_exc_bad_id)
 @doc.raises(exc=doc.excs.notfound, doc=_doc_exc_id_not_found)
 @doc.returns(rettype=doc.rettypes.dict,
              retdoc="""known metadata for content identified by q""")
 def api_content_metadata(q):
     """Get information about a content (AKA "blob") object.
 
     """
     return _api_lookup(
         q,
         lookup_fn=service.lookup_content,
         error_msg_if_not_found='Content with %s not found.' % q,
         enrich_fn=utils.enrich_content)
 
 
 @app.route('/api/1/entity/<string:uuid>/')
 @doc.route('/api/1/entity/', tags=['hidden'])
 @doc.arg('uuid',
          default='5f4d4c51-498a-4e28-88b3-b3e4e8396cba',
          argtype=doc.argtypes.uuid,
          argdoc="The entity's uuid identifier")
 @doc.raises(exc=doc.excs.badinput, doc=_doc_exc_bad_id)
 @doc.raises(exc=doc.excs.notfound, doc=_doc_exc_id_not_found)
 @doc.returns(rettype=doc.rettypes.dict,
              retdoc='The metadata of the entity identified by uuid')
 def api_entity_by_uuid(uuid):
     """Return content information if content is found.
 
     """
     return _api_lookup(
         uuid,
         lookup_fn=service.lookup_entity_by_uuid,
         error_msg_if_not_found="Entity with uuid '%s' not found." % uuid,
         enrich_fn=utils.enrich_entity)