Changeset View
Changeset View
Standalone View
Standalone View
docs/glossary.rst
- This file was added.
:orphan: | |||||
.. _glossary: | |||||
Glossary | |||||
======== | |||||
.. glossary:: | |||||
archive | |||||
An instance of the |swh| data store. | |||||
archiver | |||||
A component dedicated at replicating an :term:`archive` and ensure there | |||||
are enough copies of each element to ensure resiliency. | |||||
ark | |||||
`Archival Resource Key`_ (ARK) is a Uniform Resource Locator (URL) that is | |||||
a multi-purpose persistent identifier for information objects of any type. | |||||
artifact | |||||
software artifact | |||||
An artifact is one of many kinds of tangible by-products produced during | |||||
the development of software. | |||||
content | |||||
blob | |||||
A (specific version of a) file stored in the archive, identified by its | |||||
vlorentz: maybe explain the difference between a SHA1 and “git-like” SHA1 in that glossary as well? | |||||
cryptographic hashes (SHA1, "git-like" SHA1, SHA256) and its size. Also | |||||
known as: :term:`blob`. Note: it is incorrect to refer to Contents as | |||||
"files", because files are usually considered to be named, whereas | |||||
Contents are nameless. It is only in the context of specific | |||||
:term:`directories <directory>` that :term:`contents <content>` acquire | |||||
(local) names. | |||||
directory | |||||
A set of named pointers to contents (file entries), directories (directory | |||||
entries) and revisions (revision entries). All entries are associated to | |||||
the local name of the entry (i.e., a relative path without any path | |||||
separator) and permission metadata (e.g., ``chmod`` value or equivalent). | |||||
doi | |||||
A Digital Object Identifier or DOI_ is a persistent identifier or handle | |||||
used to uniquely identify objects, standardized by the International | |||||
Organization for Standardization (ISO). | |||||
journal | |||||
The :ref:`journal <swh-journal>` is the persistent logger of the |swh| architecture in charge | |||||
of logging changes of the archive, with publish-subscribe_ support. | |||||
lister | |||||
A :ref:`lister <swh-lister>` is a component of the |swh| architecture that is in charge of | |||||
enumerating the :term:`software origin` (e.g., VCS, packages, etc.) | |||||
available at a source code distribution place. | |||||
loader | |||||
A :ref:`loader <swh-loader-core>` is a component of the |swh| architecture | |||||
responsible for reading a source code :term:`origin` (typically a git | |||||
reposiitory) and import or update its content in the :term:`archive` (ie. | |||||
add new file contents int :term:`object storage` and repository structure | |||||
in the :term:`storage database`). | |||||
hash | |||||
cryptographic hash | |||||
checksum | |||||
digest | |||||
A fixed-size "summary" of a stream of bytes that is easy to compute, and | |||||
hard to reverse. (Cryptographic hash function Wikipedia article) also | |||||
known as: :term:`checksum`, :term:`digest`. | |||||
indexer | |||||
A component of the |swh| architecture dedicated to producing metadata | |||||
linked to the known :term:`blobs <blob>` in the :term:`archive`. | |||||
objstore | |||||
objstorage | |||||
object store | |||||
object storage | |||||
Content-addressable object storage. It is the place where actual object | |||||
:term:`blobs <blob>` objects are stored. | |||||
origin | |||||
software origin | |||||
data source | |||||
A location from which a coherent set of sources has been obtained, like a | |||||
git repository, a directory containing tarballs, etc. | |||||
person | |||||
An entity referenced by a revision as either the author or the committer | |||||
of the corresponding change. A person is associated to a full name and/or | |||||
an email address. | |||||
release | |||||
tag | |||||
milestone | |||||
a revision that has been marked as noteworthy with a specific name (e.g., | |||||
a version number), together with associated development metadata (e.g., | |||||
author, timestamp, etc). | |||||
revision | |||||
commit | |||||
changeset | |||||
A point in time snapshot of the content of a directory, together with | |||||
associated development metadata (e.g., author, timestamp, log message, | |||||
etc). | |||||
Not Done Inline Actionsprioritization vlorentz: prioritization | |||||
scheduler | |||||
The component of the |swh| architecture dedicated to the management and | |||||
the prioritization of the many tasks. | |||||
snapshot | |||||
the state of all visible branches during a specific visit of an origin | |||||
storage | |||||
storage database | |||||
The main database of the |swh| platform in which the all the elements of | |||||
the :ref:`data-model` but the :term:`content` are stored as a :ref:`Merkle | |||||
Not Done Inline ActionsDon't forget this :) vlorentz: Don't forget this :) | |||||
DAG <swh-merkle-dag>`. | |||||
type of origin | |||||
Information about the kind of hosting, e.g., whether it is a forge, a | |||||
collection of repositories, an homepage publishing tarball, or a one shot | |||||
source code repository. For all kind of repositories please specify which | |||||
VCS system is in use (Git, SVN, CVS, etc.) object. | |||||
vault | |||||
vault service | |||||
User-facing service that allows to retrieve parts of the :term:`archive` | |||||
as self-contained bundles (e.g., individual releases, entire repository | |||||
snapshots, etc.) | |||||
visit | |||||
The passage of |swh| on a given :term:`origin`, to retrieve all source | |||||
code and metadata available there at the time. A visit object stores the | |||||
state of all visible branches (if any) available at the origin at visit | |||||
time; each of them points to a revision object in the archive. Future | |||||
visits of the same origin will create new visit objects, without removing | |||||
previous ones. | |||||
.. _blob: https://en.wikipedia.org/wiki/Binary_large_object | |||||
.. _DOI: https://www.doi.org | |||||
.. _`persistent identifier`: https://docs.softwareheritage.org/devel/swh-model/persistent-identifiers.html#persistent-identifiers | |||||
.. _`Archival Resource Key`: http://n2t.net/e/ark_ids.html | |||||
.. _publish-subscribe: https://en.wikipedia.org/wiki/Publish%E2%80%93subscribe_pattern |
maybe explain the difference between a SHA1 and “git-like” SHA1 in that glossary as well?