Page MenuHomeSoftware Heritage
  • class _ConcurrentCsvWritingTask(_BaseTask):
    """Base classes for tasks writing a CSV using asyncio.
    asyncio is only used for gRPC requires to swh-graph; file writes are synchronous
    to keep the code simpler, as performance improvements from making them async
    ...
    • Jan 4 2023, 10:00 AM
    • 132 Lines
    • Python
  • 8028 git clone https://github.com/grpc/grpc\n
    8029 mkdir -p cmake/build
    8030 cmake -DgRPC_BUILD_TESTS=ON ../..
    8031 make grpc_cli
    8032 cmake ../..
    ...
    • Jan 3 2023, 6:52 PM
    • 21 Lines
  • ORIGIN_CONTRIBUTORS = """\
    origin_id,person_id
    2,0
    2,2
    0,0
    ...
    • Dec 19 2022, 5:56 PM
    • 39 Lines
    • Python
  • pytest swh
    Traceback (most recent call last):
    File "/home/tony/work/inria/repo/swh/.direnv/python-3.7.14/lib/python3.7/site-packages/_pytest/config/__init__.py", line 774, in import_plugin
    __import__(importspec)
    File "/home/tony/work/inria/repo/swh/.direnv/python-3.7.14/lib/python3.7/site-packages/_pytest/assertion/rewrite.py", line 168, in exec_module
    ...
    • Dec 8 2022, 11:46 AM
    • 84 Lines
  • dev@desktop5  ~/swh-environment/swh-graph   master  cat logging.properties
    logback.configurationFile=logback.xml
    handlers = java.util.logging.ConsoleHandler
    java.util.logging.ConsoleHandler.level = CRITICAL
    java.util.logging.ConsoleHandler.formatter = java.util.logging.SimpleFormatter
    ...
    • Dec 7 2022, 2:31 PM
    • 35 Lines
  • swhscheduler@scheduler0:~/addforgenow$ ./addforge-now-schedule-with-url-lister-id-and-optional-visit-type-and-queue.sh git.afpy.org d07d1c90-5016-4ab6-91ac-3300f8eb4fc6
    Tue Dec 6 15:58:36 UTC 2022 scheduling git origins with policy never_visited_oldest_update_first to queue add_forge_now:swh.loader.git.tasks.UpdateGitRepository for lister git.afpy.org (tablesample 1)
    swh scheduler -C /etc/softwareheritage/scheduler/listener-runner.yml origin send-to-celery --lister-uuid d07d1c90-5016-4ab6-91ac-3300f8eb4fc6 --queue add_forge_now:swh.loader.git.tasks.UpdateGitRepository --policy never_visited_oldest_update_first git --only-disabled
    Tue Dec 6 15:58:36 UTC 2022 sleep 60
    Tue Dec 6 15:58:36 UTC 2022 scheduling git origins with policy origins_without_last_update to queue add_forge_now:swh.loader.git.tasks.UpdateGitRepository for lister git.afpy.org (tablesample 1)
    ...
    • Dec 6 2022, 5:06 PM
    • 43 Lines
  • swhscheduler@scheduler0:~/addforgenow$ ./addforge-now-schedule-with-url-lister-id-and-optional-visit-type-and-queue.sh git.afpy.org d07d1c90-5016-4ab6-91ac-3300f8eb4fc6
    Tue Dec 6 11:27:05 UTC 2022 scheduling git origins with policy never_visited_oldest_update_first to queue add_forge_now:swh.loader.git.tasks.UpdateGitRepository for lister git.afpy.org (tablesample 1)
    swh scheduler -C /etc/softwareheritage/scheduler/listener-runner.yml origin send-to-celery --lister-uuid d07d1c90-5016-4ab6-91ac-3300f8eb4fc6 --queue add_forge_now:swh.loader.git.tasks.UpdateGitRepository --policy never_visited_oldest_update_first git --only-disabled
    Tue Dec 6 11:27:05 UTC 2022 sleep 60
    Tue Dec 6 11:27:05 UTC 2022 scheduling git origins with policy origins_without_last_update to queue add_forge_now:swh.loader.git.tasks.UpdateGitRepository for lister git.afpy.org (tablesample 1)
    ...
    • Dec 6 2022, 12:27 PM
    • 7 Lines
  • diff --git a/swh/loader/svn/__init__.py b/swh/loader/svn/__init__.py
    index 0204bc7..ac42897 100644
    --- a/swh/loader/svn/__init__.py
    +++ b/swh/loader/svn/__init__.py
    @@ -7,9 +7,9 @@ from typing import Any, Dict
    ...
    • Dec 5 2022, 10:47 AM
    • 1,744 Lines
  • name,instance_name,visit_type,origins_known,origins_enabled,origins_never_visited,origins_visited,origins_with_pending_changes
    CRAN,cran,cran,21295,21295,0,21295,0
    GNU,GNU,tar,393,393,39,354,107
    bitbucket,bitbucket,git,3156952,2995242,1061468,2095484,0
    cgit,alexschroeder.ch,git,81,81,0,81,2
    ...
    • Nov 30 2022, 3:03 PM
    • 78 Lines
  • => select name, instance_name, scheduler_metrics.* from scheduler_metrics inner join listers on id=lister_id where name='save-code-now';
    ─[ RECORD 1 ]────────────────┬─────────────────────────────────────
    name │ save-code-now
    instance_name │ archive.softwareheritage.org
    lister_id │ 860d41f8-d0c0-4733-a4d8-437c386bc31f
    ...
    • Nov 30 2022, 3:01 PM
    • 41 Lines
  • # Open temporary output for writes as CSV
    with tmp_output_path.open("r") as output_fd:
    with ZstdDecompressor(level=19).stream_reader(input_fd) as zstd_write:
    with csv.writer(zstd_writer) as csv_writer:
    # write header
    ...
    • Nov 30 2022, 1:39 PM
    • 19 Lines
    • Python
  • softwareheritage=> select *, encode(name, 'escape') from snapshot_branch where target_type='revision' and target='\x7076613b79f425dc8bc3600a3253e9c0c9a3397e';
    object_id | name | target | target_type | encode
    -----------+----------------------------------------------+--------------------------------------------+-------------+-----------------------
    133325445 | \x6275737465722f6d61696e2f322e372e302d32 | \x7076613b79f425dc8bc3600a3253e9c0c9a3397e | revision | buster/main/2.7.0-2
    133383513 | \x756e737461626c652f6d61696e2f322e372e302d32 | \x7076613b79f425dc8bc3600a3253e9c0c9a3397e | revision | unstable/main/2.7.0-2
    ...
    • Nov 24 2022, 1:36 PM
    • 21 Lines
  • ~/s/f/deploy  docker compose run --rm api python manage.py migrate
    WARN[0000] Found orphan containers ([deploy-nginx-1]) for this project. If you removed or renamed this service in your compose file, you can run this command with the --remove-orphans flag to clean it up.
    [+] Running 2/0
    ⠿ Container deploy-redis-1 Created 0.0s
    ⠿ Container deploy-postgres-1 Running 0.0s
    ...
    • Nov 22 2022, 11:35 AM
    • 41 Lines
  • -- Import the Software Heritage License Dataset into a SQLite database
    -- Sample usage: "sqlite3 licenses.sqlite '.read import-dataset.sql'"
    -- Related: https://forge.softwareheritage.org/T4683
    ...
    • Nov 14 2022, 4:47 PM
    • 41 Lines
    • SQL
  • diff --git a/jobs/swh-docker-dev.yaml b/jobs/swh-docker-dev.yaml
    index 44b2ef6..1fe48e3 100644
    --- a/jobs/swh-docker-dev.yaml
    +++ b/jobs/swh-docker-dev.yaml
    @@ -1,4 +1,10 @@
    ...
    • Nov 14 2022, 4:36 PM
    • 16 Lines
  • client = <django.test.client.Client object at 0x7faac8648eb0>
    def test_vault_view(client):
    url = reverse("vault")
    > check_html_get_response(client, url, status_code=200, template_used="vault-ui.html")
    ...
    • Nov 14 2022, 12:38 PM
    • 90 Lines
  • $ python manage_projects.py --gitlab staging-swh projects.yml --do-it | jq .
    {
    "nb_projects": 153,
    "nb_updated_projects": 128,
    "actions": {
    ...
    • Nov 10 2022, 2:59 PM
    • 1,171 Lines
  • Info: Using configured environment 'staging'
    Info: Retrieving pluginfacts
    Info: Retrieving plugin
    Info: Retrieving locales
    Info: Loading facts
    ...
    • Nov 8 2022, 4:53 PM
    • 28 Lines
  • $ curl -s http://deb.debian.org/debian/dists/experimental/main/source/Sources.xz | unxz | grep golang-opentelemetry-otel_1.10.0-1.dsc
    c3fb6776d3a15f43021e8adc60136ec2 2735 golang-opentelemetry-otel_1.10.0-1.dsc
    021348f1ada88dfdcfdbf22c96bcc4c8f0d839cb29f0825269aeac8bdda13418 2735 golang-opentelemetry-otel_1.10.0-1.dsc
    $ curl -s http://deb.debian.org/debian/pool/main/g/golang-opentelemetry-otel/golang-opentelemetry-otel_1.10.0-1.dsc | md5sum
    5c235253462a0eaf067992e6c34fc789 -
    ...
    • Nov 7 2022, 1:42 PM
    • 7 Lines
  • (swh) anlambert@carnavalet:/tmp$ git clone https://github.com/abaoa/SerialTool
    Cloning into 'SerialTool'...
    remote: Enumerating objects: 1301, done.
    remote: Total 1301 (delta 0), reused 0 (delta 0), pack-reused 1301
    Receiving objects: 100% (1301/1301), 16.34 MiB | 2.95 MiB/s, done.
    ...
    • Nov 4 2022, 4:26 PM
    • 33 Lines
  • 17:43:05.187741 trace.c:311 setup: git_dir: .git
    17:43:05.187784 trace.c:312 setup: git_common_dir: .git
    17:43:05.187789 trace.c:313 setup: worktree: /home/nicolasd/work/upstream/sentry/sentry
    17:43:05.187793 trace.c:314 setup: cwd: /home/nicolasd/work/upstream/sentry/sentry
    17:43:05.187796 trace.c:315 setup: prefix: (null)
    ...
    • Nov 3 2022, 5:47 PM
    • 8,427 Lines
  • storage:
    cls: pipeline
    steps:
    - cls: buffer
    - cls: filter
    ...
    • Nov 3 2022, 2:34 PM
    • 10 Lines
    • YAML
  • Python 3.9.2 (default, Feb 28 2021, 17:03:44)
    [GCC 10.2.1 20210110] on linux
    Type "help", "copyright", "credits" or "license" for more information.
    >>> import dulwich.repo
    >>> r = dulwich.repo.Repo(".")
    ...
    • Nov 2 2022, 8:11 AM
    • 27 Lines
    • Python
  • query getRevisionParents {
    revision(swhid: "swh:1:rev:db44dc9cf7a6af7b56d8ebda8c75be3375c89282") {
    message {
    text
    }
    ...
    • Oct 26 2022, 2:58 PM
    • 25 Lines
  • query resolveSwhid {
    resolveSwhid(swhid: "swh:1:dir:ec88e5b901c034d5a91aa133e824d65cff3788a3") {
    edges {
    node {
    targetType
    ...
    • Oct 26 2022, 2:58 PM
    • 28 Lines
  • query getDirEntry {
    directoryEntry(
    directorySwhid: "swh:1:dir:ec88e5b901c034d5a91aa133e824d65cff3788a3"
    path: "codemeta.json"
    ) {
    ...
    • Oct 26 2022, 2:57 PM
    • 16 Lines
  • query checkFileExists {
    origin(url: "https://github.com/rdicosmo/parmap") {
    url
    latestVisit(requireSnapshot: true) {
    date
    ...
    • Oct 26 2022, 2:56 PM
    • 41 Lines
  • query getOriginVisits {
    origins(first: 2) {
    pageInfo {
    hasNextPage
    endCursor
    ...
    • Oct 26 2022, 2:55 PM
    • 24 Lines
  • query getLatestStatus {
    origin(url: "https://github.com/rdicosmo/parmap") {
    url
    latestVisit {
    date
    ...
    • Oct 26 2022, 2:54 PM
    • 12 Lines
  • 11:25 $ curl -LI https://github.com/lihaoyi/Ammonite/releases/download/2.4.0/2.12-2.4.0
    HTTP/2 301
    server: GitHub.com
    date: Wed, 26 Oct 2022 09:25:24 GMT
    content-type: text/html; charset=utf-8
    ...
    • Oct 26 2022, 11:29 AM
    • 58 Lines
  • vlorentz@kafka4 /opt/kafka % ./bin/kafka-configs.sh --bootstrap-server kafka4.internal.softwareheritage.org:9092 --describe --entity-type topics
    Dynamic configs for topic swh.journal.indexed.content_mimetype are:
    cleanup.policy=compact sensitive=false synonyms={DYNAMIC_TOPIC_CONFIG:cleanup.policy=compact, DEFAULT_CONFIG:log.cleanup.policy=delete}
    Dynamic configs for topic swh.journal.indexed.revision_intrinsic_metadata are:
    cleanup.policy=compact sensitive=false synonyms={DYNAMIC_TOPIC_CONFIG:cleanup.policy=compact, DEFAULT_CONFIG:log.cleanup.policy=delete}
    ...
    • Oct 26 2022, 10:07 AM
    • 52 Lines
  • 15:22 $ curl -I https://codeload.github.com/fifengine/fifechan/tar.gz/0.1.5
    HTTP/2 200
    access-control-allow-origin: https://render.githubusercontent.com
    content-disposition: attachment; filename=fifechan-0.1.5.tar.gz
    content-security-policy: default-src 'none'; style-src 'unsafe-inline'; sandbox
    ...
    • Oct 25 2022, 3:32 PM
    • 14 Lines
  • {
    "category": "Permissive",
    "end_line": 21,
    "homepage_url": "http://opensource.org/licenses/mit-license.php",
    "is_exception": false,
    ...
    • Oct 25 2022, 10:26 AM
    • 38 Lines
    • JSON
  • {
    "category": "Permissive",
    "end_line": 21,
    "homepage_url": "http://opensource.org/licenses/mit-license.php",
    "is_exception": false,
    ...
    • Oct 25 2022, 10:25 AM
    • 38 Lines
    • JSON
  • {
    "license_expressions": [
    "mit"
    ],
    "licenses": [
    ...
    • Oct 25 2022, 10:22 AM
    • 47 Lines
    • JSON
  • {
    "headers": [
    {
    "tool_name": "scancode-toolkit",
    "tool_version": "31.2.1",
    ...
    • Oct 25 2022, 10:21 AM
    • 85 Lines
    • JSON
  • cat test.json | jq '.files[0] | {(.path): .licenses[] | [.spdx_license_key, .score, (.matched_rule | .is_license_intro, .is_license_notice, .is_license_reference, .is_license_tag, .is_license_text)]}'
    {
    "blobs/00/18/0018b4debcb94a83e43060fde0f8c88baa8ca476": [
    "MIT",
    100,
    ...
    • Oct 24 2022, 4:46 PM
    • 36 Lines
  • {
    "files" : [
    {
    "authors" : [],
    "copyrights" : [
    ...
    • Oct 24 2022, 4:45 PM
    • 107 Lines
    • JSON
  • $ grep -C3 '.war"' *.json
    guix-sources.json- },
    guix-sources.json- {
    guix-sources.json- "type": "git",
    guix-sources.json: "git_url": "https://github.com/a-nikolaev/curseofwar",
    ...
    • Oct 24 2022, 3:18 PM
    • 26,903 Lines
  • from google.protobuf.field_mask_pb2 import FieldMask
    from swh.graph.grpc.swhgraph_pb2 import (
    NodeFilter,
    StatsRequest,
    TraversalRequest,
    ...
    • Oct 20 2022, 5:13 PM
    • 26 Lines
    • Python
  • pytest --log-level DEBUG --durations=0 swh -m kafka -k revision_from_journal_client -vv -s
    [...]
    12.20s call swh/provenance/tests/test_journal_client.py::test_cli_revision_from_journal_client <=== NOTE THIS
    # note that test_cli_origin_from_journal_client comes before test_cli_revision_from_journal_client
    ...
    • Oct 18 2022, 11:53 AM
    • 16 Lines
  • dev-nix@desktop5:~/go-ipfs-swh-plugin$ result/bin/ipfs dag get --output-codec=git-raw f0178111494a9ed024d3859793618152ea559a168bbcbb5e2
    $ GOLOG_LOG_LEVEL="swh-bridge=info" result/bin/ipfs daemon
    Initializing daemon...
    ...
    • Oct 17 2022, 12:40 PM
    • 43 Lines
  • # https://archive.softwareheritage.org/api/1/graph/leaves/swh:1:cnt:f6c3b58820006e337b3cb55467ea735c975ce9ef/?direction=backward&resolve_origins=true
    swh:1:rel:752d77d398999ac50d000194fe975e2b8fa81147
    swh:1:rel:7e5ff01587de558391a10737bd5ea42b486a97c9
    swh:1:rel:26c079af4b779d1e070a52eb40fe17cd8db71109
    ...
    • Oct 16 2022, 11:26 AM
    • 11 Lines
  • ;; dirty hack for "forgerie"
    (ql:quickload :cffi)
    ;; FIXME: Find a way to make libmysqlclient, libcrypto found by nixified quicklisp...
    (defun prepare-ldpath ()
    ...
    • Oct 13 2022, 7:51 PM
    • 25 Lines
  • ```
    (setf forgerie-phabricator:*project-assignment-overrides*
    '((:KEY 14 :NAME "Git cloner" :ACTION :ASSIGN :REPOSITORY "swh-cloner-git")
    (:KEY 15 :NAME "Storage manager" :ACTION :ASSIGN :REPOSITORY "swh-storage")
    (:KEY 16 :NAME "Core & foundations" :ACTION :ASSIGN :REPOSITORY "swh-core")
    ...
    • Oct 13 2022, 12:09 PM
    • 446 Lines
  • ("infrastructure/puppet" "puppet-environment")
    ("modules" "swh-cloner-git")
    ("modules" "swh-core")
    ("modules" "swh-environment")
    ("modules" "swh-loader-debian")
    ...
    • Oct 12 2022, 4:38 PM
    • 224 Lines
  • (defun from-cache (wdir dir id)
    "Read cache information from *working-directory*/<DIR>/<ID>"
    (let ((cache-path (format nil "~A/~A/~A" wdir dir id)))
    (when (probe-file cache-path)
    (with-open-file (stream cache-path)
    ...
    • Oct 12 2022, 1:12 PM
    • 11 Lines
  • ```
    apiVersion: v1
    kind: Pod
    metadata:
    name: loader-cvs-manual
    ...
    • Oct 12 2022, 11:38 AM
    • 106 Lines
    • YAML
  • ; A function that takes an argument of a forgerie-core:merge-request and
    ; returns a string that will be appended to the description of created merge requests.
    ;
    ; Useful to create backlinks to the previous system, or addition migration information
    ...
    • Oct 11 2022, 5:04 PM
    • 71 Lines
  • as to why, coverage in this diff D8636 is not showing up properly:
    After building the images locally:
    extract friday evening:
    ...
    • Oct 10 2022, 9:31 AM
    • 42 Lines
  • $ git clone https://github.com/nix-community/nixpkgs-swh && cd nixpkgs-swh
    $ ./scripts/generate.sh build/ unstable
    ...
    substitution of path '/nix/store/r2jd6ygnmirm2g803mksqqjm4y39yi6i-git-2.33.1' succeeded
    ** Generate sources.json and README for release unstable
    ...
    • Oct 6 2022, 4:53 PM
    • 22 Lines
  • Definition: https://github.com/NixOS/nixpkgs/blob/350fd0044447ae8712392c6b212a18bdf2433e71/pkgs/development/tools/misc/remarkable/remarkable-toolchain/default.nix
    which present an "executable" option not propagated to the manifest so the basic hash computation (nix-store --dump) cannot work without `chmod +x` the file.
    ```
    ...
    • Oct 6 2022, 2:27 PM
    • 39 Lines
  • cat /var/tmp/sources-unstable-full.json | jq . | grep -C6 'https://www.unicode.org/Public/emoji/12.1/emoji-zwj-sequences.txt'
    {
    "outputHash": "0s2mvy1nr2v1x0rr1fxlsv8ly1vyf9978rb4hwry5vnr678ls522",
    "outputHashAlgo": "sha256",
    "outputHashMode": "recursive",
    ...
    • Oct 6 2022, 10:43 AM
    • 12 Lines
  • https://texlive.info/tlnet-archive/2021/04/08/tlnet/archive/subeqn.r15878.tar.xz
    https://github.com/etu/pass-checkup/archive/0.2.1.tar.gz
    https://www.artsoft.org/RELEASES/unix/rocksndiamonds/rocksndiamonds-4.1.1.0.tar.gz
    https://github.com/pymanopt/pymanopt/archive/0.2.5.tar.gz
    https://github.com/rhysd/clever-f.vim/archive/fd370f27cca93918184a8043220cef1aa440a1fd.tar.gz
    ...
    • Oct 6 2022, 8:47 AM
    • 34,303 Lines
  • https://repo1.maven.org/maven2/org/apache/maven/shared/maven-shared-components/17/maven-shared-components-17.pom
    https://salsa.debian.org/qt-kde-team/qt/qt4-x11/raw/0d4a3dd61ccb156dee556c214dbe91c04d44a717/debian/patches/gcc9-qforeach.patch
    https://sources.debian.net/data/main/g/gpsbabel/1.5.3-2/debian/patches/use_minizip
    https://repo1.maven.org/maven2/org/apache/maven/shared/maven-common-artifact-filters/1.2/maven-common-artifact-filters-1.2.pom
    https://github.com/rski/plyer/commit/f803697a1fe4fb5e9c729ee6ef1997b8d64f3ccd.patch
    ...
    • Oct 6 2022, 8:46 AM
    • 2,355 Lines
  • swh-lister_1 | [2022-10-05 14:37:14,083: WARNING/ForkPoolWorker-1] url <https://github.com/JorjBauer/lua-cyrussasl>: detected as 'file' with 'recursive' outputHashMode <{'outputHash': '14kzm3vk96k2i1m9f5zvpvq4pnzaf7s91h5g4h4x2bq1mynzw2s1', 'outputHashAlgo': 'sha256', 'outputHashMode': 'recursive', 'type': 'url', 'urls': ['https://github.com/JorjBauer/lua-cyrussasl'], 'integrity': 'sha256-QQv+ra8BL9EJJK/AkPRx6ttL8L77F5dqiGKaNPeof5I=', 'inferredFetcher': 'unclassified'}>
    swh-lister_1 | [2022-10-05 14:37:14,141: WARNING/ForkPoolWorker-1] Cannot detect extension for <https://crates.io/api/v1/crates/unicase/1.4.2/download>. Fallback to http head query
    swh-lister_1 | [2022-10-05 14:37:14,838: WARNING/ForkPoolWorker-1] Cannot detect extension for <https://crates.io/api/v1/crates/maplit/0.1.6/download>. Fallback to http head query
    swh-lister_1 | [2022-10-05 14:37:15,055: WARNING/ForkPoolWorker-1] Cannot detect extension for <https://crates.io/api/v1/crates/env_logger/0.6.1/download>. Fallback to http head query
    ...
    • Oct 5 2022, 4:56 PM
    • 31 Lines
  • P1485 Prior run
    swh-lister_1 | [2022-10-04 18:00:06,848: INFO/MainProcess] sync with loader@fe1e7dbc4e19
    swh-lister_1 | [2022-10-04 18:06:21,916: INFO/MainProcess] Task swh.lister.nixguix.tasks.NixGuixListerTask[1b46f6fb-baca-4028-9e7d-c61d04171146] received
    swh-lister_1 | [2022-10-04 18:06:22,902: WARNING/ForkPoolWorker-1] Skipping url <https://downloads.sourceforge.net/project/urjtag/urjtag/2021.03/urjtag-2021.03.tar.xz>: missing integrity field
    swh-lister_1 | [2022-10-04 18:06:23,196: WARNING/ForkPoolWorker-1] Skipping url <https://github.com/91861/wayst/archive/e72ca78ef72c7b1e92473a98d435a3c85d7eab98.tar.gz>: missing integrity field
    swh-lister_1 | [2022-10-04 18:06:24,150: WARNING/ForkPoolWorker-1] Skipping url <https://github.com/PyO3/maturin/archive/v0.11.3.tar.gz>: missing integrity field
    ...
    • Oct 5 2022, 4:18 PM
    • 292 Lines
  • Fetch and uncompress the tarball [1]
    Then check outputHash matches:
    ```
    cat /var/tmp/sources-unstable-full.json | jq . | grep -C6 https://github.com/aws/amazon-ssm-agent/archive/3.0.755.0.tar.gz
    ...
    • Oct 5 2022, 3:35 PM
    • 10,020 Lines
  • swh-loader_1 | [2022-10-05 13:01:57,524: INFO/MainProcess] Task swh.loader.core.tasks.LoadDirectory[017d809c-bc5e-44ff-bc0e-dffeec9c7d85] received
    swh-loader_1 | [2022-10-05 13:01:57,526: DEBUG/ForkPoolWorker-1] Loading config file /loader.yml
    swh-loader_1 | [2022-10-05 13:01:57,530: INFO/MainProcess] Task swh.loader.core.tasks.LoadDirectory[9bbc2020-3373-442b-8548-ef73fc9865ca] received
    swh-loader_1 | [2022-10-05 13:01:57,539: DEBUG/ForkPoolWorker-1] Loader checksums computation: standard
    swh-loader_1 | [2022-10-05 13:01:57,564: INFO/ForkPoolWorker-1] Load origin 'http://cran.r-project.org/src/contrib/sampling_2.9.tar.gz' with type 'directory'
    ...
    • Oct 5 2022, 3:05 PM
    • 1,832 Lines
  • swh-loader_1 | [2022-10-05 11:31:55,219: INFO/MainProcess] Task swh.loader.core.tasks.LoadContent[d87fc047-8856-4f38-b4b5-e987255f3739] received
    swh-loader_1 | [2022-10-05 11:31:55,224: DEBUG/ForkPoolWorker-1] Loading config file /loader.yml
    swh-loader_1 | [2022-10-05 11:31:55,225: INFO/MainProcess] Task swh.loader.core.tasks.LoadContent[6d1172e2-3676-4a77-a393-79b1136333f0] received
    swh-loader_1 | [2022-10-05 11:31:55,244: DEBUG/ForkPoolWorker-1] Loader checksums computation: standard
    swh-loader_1 | [2022-10-05 11:31:58,335: INFO/ForkPoolWorker-1] Load origin 'https://informationelle-selbstbestimmung-im-internet.de/emacs/jl-encrypt4.4/jl-encrypt.el' with type 'content'
    ...
    • Oct 5 2022, 3:00 PM
    • 1,223 Lines
  • diff --git a/swh/lister/nixguix/lister.py b/swh/lister/nixguix/lister.py
    index 0471b91..cd07caa 100644
    --- a/swh/lister/nixguix/lister.py
    +++ b/swh/lister/nixguix/lister.py
    @@ -19,10 +19,9 @@ import base64
    ...
    • Oct 5 2022, 11:42 AM
    • 48 Lines
  • swh  tony  yavin4  var  tmp  ERROR  %  http head "http://git.linux-nfs.org/?p=steved/libtirpc.git;a=snapshot;h=5ca4ca92f629d9d83e83544b9239abaaacf0a527;sf=tgz"
    HTTP/1.1 200 OK
    Connection: Keep-Alive
    Content-Type: application/x-gzip; charset=ISO-8859-1
    Content-disposition: inline; filename="libtirpc-5ca4ca9.tar.gz"
    ...
    • Oct 5 2022, 11:07 AM
    • 210 Lines
  • diff --git a/swh/lister/nixguix/lister.py b/swh/lister/nixguix/lister.py
    index 2623ef2..f1549e7 100644
    --- a/swh/lister/nixguix/lister.py
    +++ b/swh/lister/nixguix/lister.py
    @@ -111,7 +111,7 @@ PageResult = Tuple[ArtifactType, Union[Artifact, VCS]]
    ...
    • Oct 4 2022, 9:21 PM
    • 22 Lines
  • curl -I "http://git.marmaro.de/?p=mmh;a=snapshot;h=431604647f89d5aac7b199a7883e98e56e4ccf9e;sf=tgz"
    HTTP/1.1 200 OK
    Status: 200 OK
    Content-disposition: inline; filename="mmh-4316046.tar.gz"
    Content-Type: application/x-gzip; charset=ISO-8859-1
    ...
    • Oct 4 2022, 9:16 PM
    • 7 Lines
  • swh-loader_1 | [2022-10-04 18:00:23,961: DEBUG/ForkPoolWorker-3] Loading config file /loader.yml
    swh-loader_1 | [2022-10-04 18:00:23,962: INFO/MainProcess] Task swh.loader.core.tasks.LoadContent[c5c9322d-15d6-4a53-af3a-b157f26211a6] received
    swh-loader_1 | [2022-10-04 18:00:23,972: DEBUG/ForkPoolWorker-3] Loader checksums computation: standard
    swh-loader_1 | [2022-10-04 18:00:23,996: INFO/ForkPoolWorker-3] Load origin 'https://github.com/hunspell/hunspell/commit/ac938e2ecb48ab4dd21298126c7921689d60571b.patch' with type 'content'
    swh-loader_1 | [2022-10-04 18:00:24,038: DEBUG/ForkPoolWorker-3] prepare; origin_url=https://github.com/hunspell/hunspell/commit/ac938e2ecb48ab4dd21298126c7921689d60571b.patch fallback=https://github.com/hunspell/hunspell/commit/ac938e2ecb48ab4dd21298126c7921689d60571b.patch scheme=https path=/hunspell/hunspell/commit/ac938e2ecb48ab4dd21298126c7921689d60571b.patch
    ...
    • Oct 4 2022, 8:47 PM
    • 28 Lines
  • swh-loader_1 | [2022-10-04 18:15:27,573: INFO/ForkPoolWorker-7] Task swh.loader.core.tasks.LoadContent[fdb18199-748f-4803-9cf9-2a2e3aaf82b7] succeeded in 29.081356611044612s: {'status': 'eventful'}
    swh-loader_1 | [2022-10-04 18:15:27,574: DEBUG/ForkPoolWorker-7] Loading config file /loader.yml
    swh-loader_1 | [2022-10-04 18:15:27,575: INFO/MainProcess] Task swh.loader.core.tasks.LoadContent[b749733e-c7eb-4d1f-b932-e723eaa1aa64] received
    swh-loader_1 | [2022-10-04 18:15:27,581: DEBUG/ForkPoolWorker-7] Loader checksums computation: standard
    swh-loader_1 | [2022-10-04 18:15:27,594: INFO/ForkPoolWorker-7] Load origin 'http://miniupnp.free.fr/files/download.php?file=minissdpd-1.5.20180223.tar.gz' with type 'content'
    ...
    • Oct 4 2022, 8:44 PM
    • 134 Lines
  • Step 12/24 : RUN curl -L https://nixos.org/nix/install | sh
    ---> Running in 50a5de1d0171
    % Total % Received % Xferd Average Speed Time Time Time Current
    Dload Upload Total Spent Left Speed
    0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0
    ...
    • Oct 4 2022, 7:47 PM
    • 19 Lines
  • {
    "outputHash": "0s7p9swjqjsqddylmgid6cv263ggq7pmb734z4k84yfcrgb6kg4g",
    "outputHashAlgo": "sha256",
    "outputHashMode": "recursive",
    "type": "url",
    ...
    • Oct 4 2022, 2:09 PM
    • 43 Lines
  • {
    "outputHash": "0s7p9swjqjsqddylmgid6cv263ggq7pmb734z4k84yfcrgb6kg4g",
    "outputHashAlgo": "sha256",
    "outputHashMode": "recursive",
    "type": "url",
    ...
    • Oct 4 2022, 1:47 PM
    • 22 Lines
    • JSON
  • [2022-10-04 10:36:19,957: INFO/ForkPoolWorker-7] Task swh.loader.core.tasks.LoadDirectory[2f19995d-2208-473b-8641-5588e7eb4b90] succeeded in 0.8598709789998793s: {'status': 'uneventful'}
    docker-swh-loader-1 | [2022-10-04 10:36:20,127: DEBUG/ForkPoolWorker-8] Closed channel #1
    docker-swh-loader-1 | [2022-10-04 10:36:20,164: INFO/MainProcess] Task swh.loader.core.tasks.LoadDirectory[abda9c31-e148-46e3-98a8-9b01fc8ba29d] received
    docker-swh-loader-1 | [2022-10-04 10:36:20,164: DEBUG/ForkPoolWorker-8] Loading config file /loader.yml
    docker-swh-loader-1 | [2022-10-04 10:36:20,198: INFO/ForkPoolWorker-8] Load origin 'https://github.com/SorkinType/Gelasio/archive/5bced461d54bcf8e900bb3ba69455af35b0d2ff1.tar.gz' with type 'directory'
    ...
    • Oct 4 2022, 1:31 PM
    • 101 Lines
  • swh-lister_1 | [2022-10-04 09:44:12,739: INFO/MainProcess] Task swh.lister.nixguix.tasks.NixGuixListerTask[b5b9956d-8eb6-4e1a-9ac5-b1e89774cba9] received
    swh-lister_1 | [2022-10-04 09:44:13,983: WARNING/ForkPoolWorker-1] Cannot detect extension for 'http://tarballs.nixos.org/sha256/1j1y3cq6ys30m734axc0brdm2q9n2as4h32jws15r7w5fwr991km'. Fallback to http head query
    swh-lister_1 | [2022-10-04 09:44:14,019: WARNING/ForkPoolWorker-1] Still cannot detect extension through location 'http://tarballs.nixos.org/sha256/1j1y3cq6ys30m734axc0brdm2q9n2as4h32jws15r7w5fwr991km'...
    swh-lister_1 | [2022-10-04 09:44:14,019: WARNING/ForkPoolWorker-1] Skipping url 'http://tarballs.nixos.org/sha256/1j1y3cq6ys30m734axc0brdm2q9n2as4h32jws15r7w5fwr991km': undetected remote artifact type
    swh-lister_1 | [2022-10-04 09:44:15,038: WARNING/ForkPoolWorker-1] Skipping url 'https://web.archive.org/web/20210609022835/https://timesnewerroman.com/assets/TimesNewerRoman.zip': missing integrity field
    ...
    • Oct 4 2022, 11:46 AM
    • 44 Lines
  • swh-loader_1 | [2022-10-03 17:03:45,979: INFO/ForkPoolWorker-14] Load origin 'https://sources.debian.net/data/main/g/gpsbabel/1.5.3-2/debian/patches/use_minizip' with type 'content'
    swh-loader_1 | [2022-10-03 17:03:45,983: DEBUG/ForkPoolWorker-14] prepare; origin_url=https://sources.debian.net/data/main/g/gpsbabel/1.5.3-2/debian/patches/use_minizip fallback=https://sources.debian.net/data/main/g/gpsbabel/1.5.3-2/debian/patches/use_minizip scheme=https path=/data/main/g/gpsbabel/1.5.3-2/debian/patches/use_minizip
    swh-loader_1 | [2022-10-03 17:03:47,000: DEBUG/ForkPoolWorker-14] filename: use_minizip
    swh-loader_1 | [2022-10-03 17:03:47,000: DEBUG/ForkPoolWorker-14] filepath: /tmp/tmpbyw8070j/use_minizip
    swh-loader_1 | [2022-10-03 17:03:47,002: ERROR/ForkPoolWorker-14] Loading failure, updating to `failed` status
    ...
    • Oct 4 2022, 10:25 AM
    • 26 Lines
  • function refresh-kubeconfig {
    KUBECONFIG=$(find ~/.kube/configs -type f 2>/dev/null | xargs -I % echo -n "%:") kubectl config view --merge --flatten > ~/.kube/config
    chmod 700 ~/.kube/config
    }
    • Oct 3 2022, 4:03 PM
    • 4 Lines
  • swh-lister_1 | [2022-10-03 07:56:22,047: INFO/MainProcess] Task swh.lister.nixguix.tasks.NixGuixListerTask[f58096ad-af9f-42fa-bc29-e4791f1a24e3] received
    swh-lister_1 | [2022-10-03 07:56:44,468: INFO/ForkPoolWorker-1] tar : https://releases.wildfiregames.com/0ad-0.0.25b-alpha-unix-build.tar.xz -> https://releases.wildfiregames.com/0ad-0.0.25b-alpha-unix-build.tar.xz
    swh-lister_1 | [2022-10-03 07:56:44,473: INFO/ForkPoolWorker-1] tar : https://releases.wildfiregames.com/0ad-0.0.25b-alpha-unix-data.tar.xz -> https://releases.wildfiregames.com/0ad-0.0.25b-alpha-unix-data.tar.xz
    swh-lister_1 | [2022-10-03 07:56:44,483: INFO/ForkPoolWorker-1] tar : https://github.com/389ds/389-ds-base/archive/389-ds-base-1.4.4.17.tar.gz -> https://github.com/389ds/389-ds-base/archive/389-ds-base-1.4.4.17.tar.gz
    swh-lister_1 | [2022-10-03 07:56:44,488: INFO/ForkPoolWorker-1] tar : https://launchpad.net/4dtris/0.4/0.4.3/+download/4dtris_0.4.3.orig.tar.gz -> https://launchpad.net/4dtris/0.4/0.4.3/+download/4dtris_0.4.3.orig.tar.gz
    ...
    • Oct 3 2022, 2:26 PM
    • 17,070 Lines
  • version: '2'
    services:
    swh-scheduler-db:
    ports:
    ...
    • Sep 29 2022, 5:00 PM
    • 130 Lines
  • diff --git a/requirements-swh.txt b/requirements-swh.txt
    index b05e153..f4156a8 100644
    --- a/requirements-swh.txt
    +++ b/requirements-swh.txt
    @@ -1,3 +1,4 @@
    ...
    • Sep 28 2022, 6:03 PM
    • 9 Lines
  • diff --git a/requirements-swh.txt b/requirements-swh.txt
    index b05e153..f4156a8 100644
    --- a/requirements-swh.txt
    +++ b/requirements-swh.txt
    @@ -1,3 +1,4 @@
    ...
    • Sep 28 2022, 6:03 PM
    • 9 Lines
  • # *-. R08 <------------- V5(.../1/) V6(.../2/)
    # |\ \
    # | * | R07 <---------- V4 (http://repo_with_merges/1/)
    # | | |
    # | | * R06 <---- V2 V3 V4
    ...
    • Sep 28 2022, 3:37 PM
    • 115 Lines
  • swh/loader-git-56967877c8-fbzgc[loaders]: [2022-09-28 08:46:50,715: INFO/ForkPoolWorker-3] Load origin 'https://try.gogs.io/Rodion/ex.Kladov.git' with type 'git'
    swh/loader-git-56967877c8-fbzgc[loaders]: [2022-09-28 08:46:51,183: ERROR/ForkPoolWorker-3] Loading failure, updating to `not_found` status
    swh/loader-git-56967877c8-fbzgc[loaders]: Traceback (most recent call last):
    swh/loader-git-56967877c8-fbzgc[loaders]: File "/opt/swh/.local/lib/python3.10/site-packages/swh/loader/git/loader.py", line 297, in fetch_data
    swh/loader-git-56967877c8-fbzgc[loaders]: fetch_info = self.fetch_pack_from_origin(
    ...
    • Sep 28 2022, 10:47 AM
    • 21 Lines
  • ```
    #!/bin/bash
    #sample=0.5;
    sample=1
    ...
    • Sep 22 2022, 5:59 PM
    • 28 Lines
  • ---
    apiVersion: v1
    kind: ConfigMap
    metadata:
    name: test
    ...
    • Sep 22 2022, 2:49 PM
    • 77 Lines
  • GRAPH_VERSION="1.0.1"
    GRAPH_CLASSPATH="$HOME/swh-environment/swh-graph/java/target/swh-graph-$GRAPH_VERSION.jar"
    PERF_OPTS="-Xmx100G -XX:PretenureSizeThreshold=512M -XX:MaxNewSize=4G -XX:+UseLargePages -XX:+UseTransparentHugePages -XX:+UseNUMA -XX:+UseTLAB -XX:+ResizeTLAB"
    GRAPH_PATH="/dev/shm/swh-graph/default/graph"
    ...
    • Sep 21 2022, 2:29 PM
    • 13 Lines
    • Bash Scripting
  • ```
    root@saatchi:/home/swhscheduler# cat git-acdw-net.sh
    #!/bin/bash
    #sample=0.5;
    ...
    • Sep 21 2022, 8:50 AM
    • 29 Lines
  • #!/bin/bash
    #sample=0.5;
    sample=1
    policies=never_visited_oldest_update_first;
    ...
    • Sep 21 2022, 8:43 AM
    • 27 Lines
  • storage:
    cls: pipeline
    steps:
    - cls: buffer
    min_batch_size:
    ...
    • Sep 20 2022, 11:24 AM
    • 21 Lines
  • query GetDorectoryContent {
    directoryEntry(swhid: "swh:1:dir:ec88e5b901c034d5a91aa133e824d65cff3788a3", path: "codemeta.json") {
    name {
    text
    }
    ...
    • Sep 20 2022, 10:04 AM
    • 15 Lines
  • root@getty:~# /usr/local/sbin/create_kafka_users_rocquencourt.sh --consumer-group-prefix "swh.indexer.journal_client." swh-indexer-prod-01
    Creating user swh-indexer-prod-01, with unprivileged access to consumer group prefix swh.indexer.journal_client.
    Password for user swh-indexer-prod-01:
    Setting user credentials
    Warning: --zookeeper is deprecated and will be removed in a future version of Kafka.
    ...
    • Sep 15 2022, 5:19 PM
    • 122 Lines
  • ```
    ---
    apiVersion: apps/v1
    kind: Deployment
    metadata:
    ...
    • Sep 14 2022, 4:55 PM
    • 41 Lines
  • Error: Could not update: Execution of '/usr/bin/apt-get -q -y -o DPkg::Options::=--force-confold install python3-swh.web' returned 100: Reading package lists...
    Building dependency tree...
    Reading state information...
    The following additional packages will be installed:
    python3-swh.auth python3-swh.auth.django
    ...
    • Sep 13 2022, 4:05 PM
    • 206 Lines
  • 2:13 guest@softwareheritage => explain analyze with closest_two_visits as ((
    select ov, (date - '2021-05-03 10:00:00+00'), visit as interval
    from origin_visit ov
    where ov.origin = 2695882
    and ov.date >= '2021-05-03 10:00:00+00'
    ...
    • Sep 13 2022, 1:29 PM
    • 48 Lines
    • PostgreSQL
  • 500:
    ```
    root@moma:/var/log/varnish# cat varnishncsa.log | awk '{print $9,$4}' | grep "50[0-9]" | cut -f-2 -d: | awk '{print $2}' | sort | uniq -c
    28 [13/Sep/2022:00
    22 [13/Sep/2022:01
    ...
    • Sep 13 2022, 11:04 AM
    • 28 Lines
  • ```
    PID DATABASE APP USER CLIENT CPU% MEM% READ/s WRITE/s TIME+ Waiting IOW state Query
    1440708 barman_receive_w barman_streaming 192.168.100.18/3 0.8 0.0 1.35M 0B 641 h WalSenderMain N active START_REPLICATION SLOT "barman" 2859A/46000000 TIMELINE 1
    2677979 softwareheritage softwareheritage postgres 192.168.100.103/ 0.8 0.0 1.44M 0B 49 h WalSenderWaitFor N active START_REPLICATION SLOT "softwareheritage_somerset" LOGICAL 28953/F6811D28 (proto_version '1', publication_names '"softwareheritage"')
    774624 softwareheritage guest 127.0.0.1/32 0.0 0.0 0B 0B 598:53.50 ClientRead N idle in trans WITH dir AS (SELECT dir_entries, file_entries FROM directory WHERE id='\x5eb8379aad4ba330e903f5e63b71da873121f04e'::bytea), ls_d AS (SELECT DISTINCT UNNEST(dir_entries) AS entry_id FROM dir), ls_f AS (SELECT DISTINCT UNNEST(file_entries) AS entry_id FROM dir) (SELECT 'dir' AS type, e.target, e.name FROM ls_d LEFT JOIN
    ...
    • Sep 13 2022, 10:31 AM
    • 64 Lines
  • {
    "_index": "swh_workers-7.15.2-2022.09.09",
    "_type": "_doc",
    "_id": "rNu0IoMBnb_iToLWeLmp",
    "_version": 1,
    ...
    • Sep 9 2022, 5:18 PM
    • 288 Lines
  • │ listers [2022-09-09 10:52:54,708: INFO/ForkPoolWorker-1] Fetching URL https://pub.dev/api/packages/appcarry_sdk with params {} │
    │ listers worker: Warm shutdown (MainProcess) │
    │ listers [2022-09-09 10:52:54,903: INFO/ForkPoolWorker-1] Fetching URL https://pub.dev/api/packages/appcenter with params {} │
    │ listers worker: Warm shutdown (MainProcess) │
    │ listers [2022-09-09 10:52:55,041: INFO/ForkPoolWorker-1] Fetching URL https://pub.dev/api/packages/appcenter_analytics with params {} │
    ...
    • Sep 9 2022, 12:57 PM
    • 8 Lines
  • ```
    object Service "Software Heritage Homepage" {
    import "generic-service"
    host_name = "www.softwareheritage.org"
    ...
    • Sep 8 2022, 5:55 PM
    • 43 Lines
  • | ULID | FROM | UNTIL | RANGE | UNTIL-DOWN | #SERIES | #SAMPLES | #CHUNKS | COMP-LEVEL | COMP-FAILED | LABELS | RESOLUTION | SOURCE |
    |----------------------------|---------------------|---------------------|--------------|---------------|---------|------------|---------|------------|-------------|---------------------------------------------------------------------------------------------------------------------------------|------------|---------|
    | 01GBX4H2N8P5YQC791NBTS6PXS | 01-09-2022 15:20:33 | 01-09-2022 16:00:00 | 39m26.062s | 39h20m33.938s | 81,443 | 6,291,159 | 82,200 | 1 | false | prometheus=cattle-monitoring-system/rancher-monitoring-prometheus,prometheus_replica=prometheus-rancher-monitoring-prometheus-0 | 0s | sidecar |
    | 01GBX6S9XA3QPCBJPY87RKZN9Z | 01-09-2022 16:00:00 | 01-09-2022 18:00:00 | 1h59m59.901s | 38h0m0.099s | 81,126 | 19,517,410 | 164,322 | 1 | false | prometheus=cattle-monitoring-system/rancher-monitoring-prometheus,prometheus_replica=prometheus-rancher-monitoring-prometheus-0 | 0s | sidecar |
    ...
    • Sep 2 2022, 1:50 PM
    • 13 Lines
  • ```
    docker run -ti --rm -v /tmp:/tmp --user root rancher/mirrored-thanos-thanos:v0.17.2 \
    sidecar \
    --prometheus.url=http://127.0.0.1:9090/ \
    --shipper.upload-compacted \
    ...
    • Sep 2 2022, 9:32 AM
    • 12 Lines
  • # Please edit the object below. Lines beginning with a '#' will be ignored,
    # and an empty file will abort the edit. If an error occurs while saving this file will be
    # reopened with the relevant failures.
    #
    apiVersion: v1
    ...
    • Sep 1 2022, 4:03 PM
    • 48 Lines
  • terraform plan -target=rancher2_app_v2.archive-staging-rancher-monitoring
    rancher2_cluster.archive-staging: Refreshing state... [id=c-cx2bq]
    rancher2_app_v2.archive-staging-rancher-monitoring: Refreshing state... [id=c-cx2bq.cattle-monitoring-system/rancher-monitoring]
    Terraform used the selected providers to generate the following execution plan. Resource actions are indicated with the following symbols:
    ...
    • Sep 1 2022, 10:11 AM
    • 79 Lines
  • Aug 31 14:49:20 worker1 swh[2752560]: DEBUG:swh.core.config:Loading config file /etc/softwareheritage/indexer/origin_intrinsic_metadata.yml
    Aug 31 14:49:20 worker1 swh[2752560]: DEBUG:urllib3.util.retry:Converted retries value: 3 -> Retry(total=3, connect=None, read=None, redirect=None, status=None)
    Aug 31 14:49:20 worker1 swh[2752560]: DEBUG:rdflib:RDFLib Version: 4.2.2
    Aug 31 14:49:20 worker1 swh[2752560]: DEBUG:swh.core.config:Loading config file /etc/softwareheritage/indexer/origin_intrinsic_metadata.yml
    Aug 31 14:49:20 worker1 swh[2752560]: DEBUG:urllib3.util.retry:Converted retries value: 3 -> Retry(total=3, connect=None, read=None, redirect=None, status=None)
    ...
    • Aug 31 2022, 4:50 PM
    • 159 Lines