Add the ability to run starting from a specific task (fixes #227) #377
firefoxci-taskcluster / dataset-opus-ELRC-3075-wikipedia_health_v1-ru-en
succeeded
Feb 13, 2024 in 12m 8s
FirefoxCI (pull_request)
Fetch opus dataset
Details
View task in Taskcluster
View logs in Taskcluster
[taskcluster 2024-02-13 19:57:30.679Z] Task ID: M6gd0cTGT7-RejWr1iaI5A
[taskcluster 2024-02-13 19:57:30.679Z] Worker ID: 3179979791804037441
[taskcluster 2024-02-13 19:57:30.679Z] Worker Group: us-central1
[taskcluster 2024-02-13 19:57:30.679Z] Worker Node Type: projects/887720501152/machineTypes/n2-highmem-32
[taskcluster 2024-02-13 19:57:30.679Z] Worker Pool: translations-1/b-linux-large-gcp
[taskcluster 2024-02-13 19:57:30.679Z] Worker Version: 38.0.5
[taskcluster 2024-02-13 19:57:30.679Z] Public IP: 34.122.95.34
[taskcluster 2024-02-13 19:57:30.679Z] Hostname: translations-1-b-linux-large-gcp-hqyisvsorvolrcxzzwk1lw
[taskcluster 2024-02-13 19:57:30.679Z] using cache "translations-level-1-checkouts-v3-7afeb851dd97df8f3607-KTThW1rRQEWlZgGQ0sWCPQ" -> /builds/worker/checkouts
[taskcluster 2024-02-13 19:57:31.186Z] Image 'public/image.tar.zst' from task 'KTThW1rRQEWlZgGQ0sWCPQ' loaded. Using image ID sha256:53facad048ff33f5c58e9d52d6e58e6cd4fcdd5a8e5788c85f46e559dd9deed5.
[taskcluster 2024-02-13 19:57:31.241Z] === Task Starting ===
[setup 2024-02-13T19:57:31.505Z] run-task started in /builds/worker
[setup 2024-02-13T19:57:31.505Z] Invoked by command: --firefox_translations_training-checkout=/builds/worker/checkouts/vcs/ -- bash -c pip3 install --upgrade pip setuptools && pip3 install -r $VCS_PATH/pipeline/data/requirements/data.txt && python3 $VCS_PATH/pipeline/data/dataset_importer.py --type corpus --dataset opus_ELRC-3075-wikipedia_health/v1 --output_prefix $TASK_WORKDIR/artifacts/ELRC-3075-wikipedia_health_v1
[setup 2024-02-13T19:57:31.505Z] Python version: 3.10.12
[cache 2024-02-13T19:57:31.507Z] cache /builds/worker/checkouts exists; requirements: gid=1000 uid=1000 version=1
[volume 2024-02-13T19:57:31.507Z] changing ownership of volume /builds/worker/.cache to 1000:1000
[volume 2024-02-13T19:57:31.507Z] volume /builds/worker/checkouts is a cache
[setup 2024-02-13T19:57:31.507Z] running as worker:worker
[vcs 2024-02-13T19:57:31.507Z] executing ['git', 'config', '--global', '--add', 'safe.directory', '/builds/worker/checkouts/vcs']
[vcs 2024-02-13T19:57:31.509Z] executing ['git', 'fetch', '--no-tags', 'https://github.com/bhearsum/firefox-translations-training', 'start-specific']
[vcs 2024-02-13T19:57:31.705Z] From https://github.com/bhearsum/firefox-translations-training
[vcs 2024-02-13T19:57:31.705Z] * branch start-specific -> FETCH_HEAD
[vcs 2024-02-13T19:57:31.711Z] executing ['git', 'checkout', '-f', '-B', 'start-specific', '37fbf272d7eb316897377144111a3ef057becfd4']
[vcs 2024-02-13T19:57:31.771Z] Reset branch 'start-specific'
[vcs 2024-02-13T19:57:31.772Z] executing ['git', 'submodule', 'init']
[vcs 2024-02-13T19:57:31.793Z] executing ['git', 'submodule', 'update', '--force']
[vcs 2024-02-13T19:57:31.845Z] Submodule path '3rd_party/browsermt-marian-dev': checked out '11c6ae7c46be21ef96ed10c60f28022fa968939f'
[vcs 2024-02-13T19:57:31.857Z] Submodule path '3rd_party/extract-lex': checked out '42fa605b53f32eaf6c6e0b5677255c21c91b3d49'
[vcs 2024-02-13T19:57:31.869Z] Submodule path '3rd_party/fast_align': checked out 'cab1e9aac8d3bb02ff5ae58218d8d225a039fa11'
[vcs 2024-02-13T19:57:31.899Z] Submodule path '3rd_party/kenlm': checked out 'bbf4fc511266c5d4515047055d7bdec659a6e158'
[vcs 2024-02-13T19:57:32.029Z] Submodule path '3rd_party/marian-dev': checked out 'e8a1a2530fb84cbff7383302ebca393e5875c441'
[vcs 2024-02-13T19:57:32.050Z] Submodule path '3rd_party/preprocess': checked out '64307314b4d5a9a0bd529b5c1036b0710d995eec'
[vcs 2024-02-13T19:57:32.051Z] cleaning git checkout...
[vcs 2024-02-13T19:57:32.051Z] executing ['git', 'clean', '-nxdff']
[vcs 2024-02-13T19:57:32.054Z] removing []
[vcs 2024-02-13T19:57:32.054Z] successfully cleaned git checkout!
[vcs 2024-02-13T19:57:32.056Z] TinderboxPrint:<a href='https://github.com/bhearsum/firefox-translations-training/commit/37fbf272d7eb316897377144111a3ef057becfd4' title='Built from firefox-translations-training commit 37fbf272d7eb316897377144111a3ef057becfd4'>37fbf272d7eb316897377144111a3ef057becfd4</a>
[task 2024-02-13T19:57:32.056Z] executing ['bash', '-c', 'pip3 install --upgrade pip setuptools && pip3 install -r $VCS_PATH/pipeline/data/requirements/data.txt && python3 $VCS_PATH/pipeline/data/dataset_importer.py --type corpus --dataset opus_ELRC-3075-wikipedia_health/v1 --output_prefix $TASK_WORKDIR/artifacts/ELRC-3075-wikipedia_health_v1']
[task 2024-02-13T19:57:32.445Z] Defaulting to user installation because normal site-packages is not writeable
[task 2024-02-13T19:57:32.474Z] Requirement already satisfied: pip in /usr/lib/python3/dist-packages (22.0.2)
[task 2024-02-13T19:57:32.651Z] Collecting pip
[task 2024-02-13T19:57:32.835Z] Downloading pip-24.0-py3-none-any.whl (2.1 MB)
[task 2024-02-13T19:57:33.067Z] ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 2.1/2.1 MB 9.2 MB/s eta 0:00:00
[task 2024-02-13T19:57:33.080Z] Requirement already satisfied: setuptools in /usr/lib/python3/dist-packages (59.6.0)
[task 2024-02-13T19:57:33.413Z] Collecting setuptools
[task 2024-02-13T19:57:33.468Z] Downloading setuptools-69.1.0-py3-none-any.whl (819 kB)
[task 2024-02-13T19:57:33.479Z] ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 819.3/819.3 KB 97.0 MB/s eta 0:00:00
[task 2024-02-13T19:57:33.673Z] Installing collected packages: setuptools, pip
[task 2024-02-13T19:57:35.138Z] Successfully installed pip-24.0 setuptools-69.1.0
[task 2024-02-13T19:57:35.542Z] Defaulting to user installation because normal site-packages is not writeable
[task 2024-02-13T19:57:35.604Z] Collecting opustrainer@ git+https://github.com/hplt-project/OpusTrainer.git@9133e1525c7ee37f53ea14ee6a180152bf7ea192 (from -r /builds/worker/checkouts/vcs/pipeline/data/requirements/data.txt (line 11))
[task 2024-02-13T19:57:35.605Z] Cloning https://github.com/hplt-project/OpusTrainer.git (to revision 9133e1525c7ee37f53ea14ee6a180152bf7ea192) to /tmp/pip-install-jctp_xno/opustrainer_f5a4a1ce0e6a4c18b430a6385a871441
[task 2024-02-13T19:57:35.607Z] Running command git clone --filter=blob:none --quiet https://github.com/hplt-project/OpusTrainer.git /tmp/pip-install-jctp_xno/opustrainer_f5a4a1ce0e6a4c18b430a6385a871441
[task 2024-02-13T19:57:36.356Z] Running command git rev-parse -q --verify 'sha^9133e1525c7ee37f53ea14ee6a180152bf7ea192'
[task 2024-02-13T19:57:36.358Z] Running command git fetch -q https://github.com/hplt-project/OpusTrainer.git 9133e1525c7ee37f53ea14ee6a180152bf7ea192
[task 2024-02-13T19:57:36.582Z] Running command git checkout -q 9133e1525c7ee37f53ea14ee6a180152bf7ea192
[task 2024-02-13T19:57:36.853Z] Resolved https://github.com/hplt-project/OpusTrainer.git to commit 9133e1525c7ee37f53ea14ee6a180152bf7ea192
[task 2024-02-13T19:57:36.858Z] Installing build dependencies: started
[task 2024-02-13T19:57:39.263Z] Installing build dependencies: finished with status 'done'
[task 2024-02-13T19:57:39.264Z] Getting requirements to build wheel: started
[task 2024-02-13T19:57:39.487Z] Getting requirements to build wheel: finished with status 'done'
[task 2024-02-13T19:57:39.489Z] Preparing metadata (pyproject.toml): started
[task 2024-02-13T19:57:39.714Z] Preparing metadata (pyproject.toml): finished with status 'done'
[task 2024-02-13T19:57:39.834Z] Collecting click==8.1.7 (from -r /builds/worker/checkouts/vcs/pipeline/data/requirements/data.txt (line 7))
[task 2024-02-13T19:57:40.001Z] Downloading click-8.1.7-py3-none-any.whl.metadata (3.0 kB)
[task 2024-02-13T19:57:40.053Z] Collecting joblib==1.3.2 (from -r /builds/worker/checkouts/vcs/pipeline/data/requirements/data.txt (line 9))
[task 2024-02-13T19:57:40.096Z] Downloading joblib-1.3.2-py3-none-any.whl.metadata (5.4 kB)
[task 2024-02-13T19:57:40.157Z] Collecting pyyaml==6.0.1 (from -r /builds/worker/checkouts/vcs/pipeline/data/requirements/data.txt (line 13))
[task 2024-02-13T19:57:40.199Z] Downloading PyYAML-6.0.1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (2.1 kB)
[task 2024-02-13T19:57:40.605Z] Collecting regex==2023.10.3 (from -r /builds/worker/checkouts/vcs/pipeline/data/requirements/data.txt (line 15))
[task 2024-02-13T19:57:40.649Z] Downloading regex-2023.10.3-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (40 kB)
[task 2024-02-13T19:57:40.676Z] ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 40.9/40.9 kB 1.5 MB/s eta 0:00:00
[task 2024-02-13T19:57:40.709Z] Collecting sacremoses==0.0.53 (from -r /builds/worker/checkouts/vcs/pipeline/data/requirements/data.txt (line 17))
[task 2024-02-13T19:57:40.752Z] Downloading sacremoses-0.0.53.tar.gz (880 kB)
[task 2024-02-13T19:57:40.893Z] ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 880.6/880.6 kB 6.3 MB/s eta 0:00:00
[task 2024-02-13T19:57:40.935Z] Preparing metadata (setup.py): started
[task 2024-02-13T19:57:41.097Z] Preparing metadata (setup.py): finished with status 'done'
[task 2024-02-13T19:57:41.168Z] Collecting sentencepiece==0.1.99 (from -r /builds/worker/checkouts/vcs/pipeline/data/requirements/data.txt (line 19))
[task 2024-02-13T19:57:41.210Z] Downloading sentencepiece-0.1.99-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (1.3 MB)
[task 2024-02-13T19:57:41.260Z] ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.3/1.3 MB 27.7 MB/s eta 0:00:00
[task 2024-02-13T19:57:41.266Z] Requirement already satisfied: six==1.16.0 in /usr/local/lib/python3.10/dist-packages (from -r /builds/worker/checkouts/vcs/pipeline/data/requirements/data.txt (line 21)) (1.16.0)
[task 2024-02-13T19:57:41.343Z] Collecting tqdm==4.66.1 (from -r /builds/worker/checkouts/vcs/pipeline/data/requirements/data.txt (line 23))
[task 2024-02-13T19:57:41.386Z] Downloading tqdm-4.66.1-py3-none-any.whl.metadata (57 kB)
[task 2024-02-13T19:57:41.391Z] ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 57.6/57.6 kB 16.8 MB/s eta 0:00:00
[task 2024-02-13T19:57:41.448Z] Collecting typo==0.1.5 (from -r /builds/worker/checkouts/vcs/pipeline/data/requirements/data.txt (line 25))
[task 2024-02-13T19:57:41.491Z] Downloading typo-0.1.5.tar.gz (7.0 kB)
[task 2024-02-13T19:57:41.499Z] Preparing metadata (setup.py): started
[task 2024-02-13T19:57:41.657Z] Preparing metadata (setup.py): finished with status 'done'
[task 2024-02-13T19:57:41.733Z] Downloading click-8.1.7-py3-none-any.whl (97 kB)
[task 2024-02-13T19:57:41.739Z] ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 97.9/97.9 kB 30.3 MB/s eta 0:00:00
[task 2024-02-13T19:57:41.782Z] Downloading joblib-1.3.2-py3-none-any.whl (302 kB)
[task 2024-02-13T19:57:41.789Z] ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 302.2/302.2 kB 75.9 MB/s eta 0:00:00
[task 2024-02-13T19:57:41.831Z] Downloading PyYAML-6.0.1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (705 kB)
[task 2024-02-13T19:57:41.840Z] ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 705.5/705.5 kB 99.0 MB/s eta 0:00:00
[task 2024-02-13T19:57:41.883Z] Downloading regex-2023.10.3-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (773 kB)
[task 2024-02-13T19:57:41.893Z] ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 773.9/773.9 kB 93.0 MB/s eta 0:00:00
[task 2024-02-13T19:57:41.936Z] Downloading tqdm-4.66.1-py3-none-any.whl (78 kB)
[task 2024-02-13T19:57:41.940Z] ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 78.3/78.3 kB 26.9 MB/s eta 0:00:00
[task 2024-02-13T19:57:41.955Z] Building wheels for collected packages: sacremoses, typo, opustrainer
[task 2024-02-13T19:57:41.955Z] Building wheel for sacremoses (setup.py): started
[task 2024-02-13T19:57:42.336Z] Building wheel for sacremoses (setup.py): finished with status 'done'
[task 2024-02-13T19:57:42.340Z] Created wheel for sacremoses: filename=sacremoses-0.0.53-py3-none-any.whl size=895241 sha256=810d926b9aebc322884944e4e209497d89de4f75345eeac49562e6687ae1463b
[task 2024-02-13T19:57:42.340Z] Stored in directory: /builds/worker/.cache/pip/wheels/00/24/97/a2ea5324f36bc626e1ea0267f33db6aa80d157ee977e9e42fb
[task 2024-02-13T19:57:42.343Z] Building wheel for typo (setup.py): started
[task 2024-02-13T19:57:42.570Z] Building wheel for typo (setup.py): finished with status 'done'
[task 2024-02-13T19:57:42.571Z] Created wheel for typo: filename=typo-0.1.5-py3-none-any.whl size=6837 sha256=d5fbccb3ccb5066825ba2cfddd1f51cc85b179b5a519fd561bc083ece2045be6
[task 2024-02-13T19:57:42.571Z] Stored in directory: /builds/worker/.cache/pip/wheels/2e/2f/73/60e0ce42d1375a386b9171a37cd5536e173ad950a98e7dc6b1
[task 2024-02-13T19:57:42.578Z] Building wheel for opustrainer (pyproject.toml): started
[task 2024-02-13T19:57:42.829Z] Building wheel for opustrainer (pyproject.toml): finished with status 'done'
[task 2024-02-13T19:57:42.830Z] Created wheel for opustrainer: filename=opustrainer-0.2-py3-none-any.whl size=39889 sha256=6137f0be33c875ea9c1a03e62cbfd89d0b1d5eccd04be6f78636726dcd352b76
[task 2024-02-13T19:57:42.830Z] Stored in directory: /builds/worker/.cache/pip/wheels/be/de/18/46a4ed14bd505e2a41b3680ce76f5ec719db6334485446d524
[task 2024-02-13T19:57:42.833Z] Successfully built sacremoses typo opustrainer
[task 2024-02-13T19:57:42.999Z] Installing collected packages: typo, sentencepiece, tqdm, regex, pyyaml, joblib, click, sacremoses, opustrainer
[task 2024-02-13T19:57:43.478Z] Successfully installed click-8.1.7 joblib-1.3.2 opustrainer-0.2 pyyaml-6.0.1 regex-2023.10.3 sacremoses-0.0.53 sentencepiece-0.1.99 tqdm-4.66.1 typo-0.1.5
[task 2024-02-13T19:57:46.026Z] Running with arguments: ['/builds/worker/checkouts/vcs/pipeline/data/dataset_importer.py', '--type', 'corpus', '--dataset', 'opus_ELRC-3075-wikipedia_health/v1', '--output_prefix', '/builds/worker/artifacts/ELRC-3075-wikipedia_health_v1']
[task 2024-02-13T19:57:46.026Z] Starting dataset import and augmentation.
[task 2024-02-13T19:57:46.026Z] Downloading parallel dataset
[task 2024-02-13T19:57:46.026Z] + set -euo pipefail
[task 2024-02-13T19:57:46.026Z] + [[ -z ru ]]
[task 2024-02-13T19:57:46.026Z] + [[ -z en ]]
[task 2024-02-13T19:57:46.026Z] + dataset=opus_ELRC-3075-wikipedia_health/v1
[task 2024-02-13T19:57:46.026Z] + output_prefix=/builds/worker/artifacts/ELRC-3075-wikipedia_health_v1
[task 2024-02-13T19:57:46.026Z] + echo '###### Downloading dataset opus_ELRC-3075-wikipedia_health/v1'
[task 2024-02-13T19:57:46.026Z] ###### Downloading dataset opus_ELRC-3075-wikipedia_health/v1
[task 2024-02-13T19:57:46.026Z] ++ dirname /builds/worker/checkouts/vcs/pipeline/data/download-corpus.sh
[task 2024-02-13T19:57:46.026Z] + cd /builds/worker/checkouts/vcs/pipeline/data
[task 2024-02-13T19:57:46.026Z] ++ dirname /builds/worker/artifacts/ELRC-3075-wikipedia_health_v1
[task 2024-02-13T19:57:46.026Z] + dir=/builds/worker/artifacts
[task 2024-02-13T19:57:46.026Z] + mkdir -p /builds/worker/artifacts
[task 2024-02-13T19:57:46.026Z] + name=ELRC-3075-wikipedia_health/v1
[task 2024-02-13T19:57:46.026Z] + type=opus
[task 2024-02-13T19:57:46.026Z] + bash importers/corpus/opus.sh ru en /builds/worker/artifacts/ELRC-3075-wikipedia_health_v1 ELRC-3075-wikipedia_health/v1
[task 2024-02-13T19:57:46.026Z] + set -euo pipefail
[task 2024-02-13T19:57:46.026Z] + echo '###### Downloading opus corpus'
[task 2024-02-13T19:57:46.026Z] ###### Downloading opus corpus
[task 2024-02-13T19:57:46.026Z] + src=ru
[task 2024-02-13T19:57:46.026Z] + trg=en
[task 2024-02-13T19:57:46.026Z] + output_prefix=/builds/worker/artifacts/ELRC-3075-wikipedia_health_v1
[task 2024-02-13T19:57:46.026Z] + dataset=ELRC-3075-wikipedia_health/v1
[task 2024-02-13T19:57:46.026Z] + COMPRESSION_CMD=zstdmt
[task 2024-02-13T19:57:46.026Z] + ARTIFACT_EXT=zst
[task 2024-02-13T19:57:46.026Z] + WGET=wget
[task 2024-02-13T19:57:46.026Z] + name=ELRC-3075-wikipedia_health
[task 2024-02-13T19:57:46.026Z] + name_and_version=ELRC_3075_wikipedia_health_v1
[task 2024-02-13T19:57:46.026Z] ++ dirname /builds/worker/artifacts/ELRC-3075-wikipedia_health_v1
[task 2024-02-13T19:57:46.026Z] + tmp=/builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1
[task 2024-02-13T19:57:46.026Z] + mkdir -p /builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1
[task 2024-02-13T19:57:46.026Z] + archive_path=/builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1/ELRC-3075-wikipedia_health.txt.zip
[task 2024-02-13T19:57:46.027Z] + wget -O /builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1/ELRC-3075-wikipedia_health.txt.zip https://object.pouta.csc.fi/OPUS-ELRC-3075-wikipedia_health/v1/moses/ru-en.txt.zip
[task 2024-02-13T19:57:46.027Z] --2024-02-13 19:57:43-- https://object.pouta.csc.fi/OPUS-ELRC-3075-wikipedia_health/v1/moses/ru-en.txt.zip
[task 2024-02-13T19:57:46.027Z] Resolving object.pouta.csc.fi (object.pouta.csc.fi)... 86.50.254.18, 86.50.254.19
[task 2024-02-13T19:57:46.027Z] Connecting to object.pouta.csc.fi (object.pouta.csc.fi)|86.50.254.18|:443... connected.
[task 2024-02-13T19:57:46.027Z] HTTP request sent, awaiting response... 404 Not Found
[task 2024-02-13T19:57:46.027Z] 2024-02-13 19:57:44 ERROR 404: Not Found.
[task 2024-02-13T19:57:46.027Z]
[task 2024-02-13T19:57:46.027Z] + wget -O /builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1/ELRC-3075-wikipedia_health.txt.zip https://object.pouta.csc.fi/OPUS-ELRC-3075-wikipedia_health/v1/moses/en-ru.txt.zip
[task 2024-02-13T19:57:46.027Z] --2024-02-13 19:57:44-- https://object.pouta.csc.fi/OPUS-ELRC-3075-wikipedia_health/v1/moses/en-ru.txt.zip
[task 2024-02-13T19:57:46.027Z] Resolving object.pouta.csc.fi (object.pouta.csc.fi)... 86.50.254.18, 86.50.254.19
[task 2024-02-13T19:57:46.027Z] Connecting to object.pouta.csc.fi (object.pouta.csc.fi)|86.50.254.18|:443... connected.
[task 2024-02-13T19:57:46.027Z] HTTP request sent, awaiting response... 200 OK
[task 2024-02-13T19:57:46.027Z] Length: 517126 (505K) [application/zip]
[task 2024-02-13T19:57:46.027Z] Saving to: ‘/builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1/ELRC-3075-wikipedia_health.txt.zip’
[task 2024-02-13T19:57:46.027Z]
[task 2024-02-13T19:57:46.027Z] 0K .......... .......... .......... .......... .......... 9% 184K 2s
[task 2024-02-13T19:57:46.027Z] 50K .......... .......... .......... .......... .......... 19% 366K 2s
[task 2024-02-13T19:57:46.027Z] 100K .......... .......... .......... .......... .......... 29% 366K 1s
[task 2024-02-13T19:57:46.027Z] 150K .......... .......... .......... .......... .......... 39% 157M 1s
[task 2024-02-13T19:57:46.027Z] 200K .......... .......... .......... .......... .......... 49% 368K 1s
[task 2024-02-13T19:57:46.027Z] 250K .......... .......... .......... .......... .......... 59% 133M 0s
[task 2024-02-13T19:57:46.027Z] 300K .......... .......... .......... .......... .......... 69% 171M 0s
[task 2024-02-13T19:57:46.027Z] 350K .......... .......... .......... .......... .......... 79% 175M 0s
[task 2024-02-13T19:57:46.027Z] 400K .......... .......... .......... .......... .......... 89% 369K 0s
[task 2024-02-13T19:57:46.027Z] 450K .......... .......... .......... .......... .......... 99% 94.4M 0s
[task 2024-02-13T19:57:46.027Z] 500K ..... 100% 9.32T=0.8s
[task 2024-02-13T19:57:46.027Z]
[task 2024-02-13T19:57:46.027Z] 2024-02-13 19:57:45 (617 KB/s) - ‘/builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1/ELRC-3075-wikipedia_health.txt.zip’ saved [517126/517126]
[task 2024-02-13T19:57:46.027Z]
[task 2024-02-13T19:57:46.027Z] + unzip -o /builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1/ELRC-3075-wikipedia_health.txt.zip -d /builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1
[task 2024-02-13T19:57:46.027Z] Archive: /builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1/ELRC-3075-wikipedia_health.txt.zip
[task 2024-02-13T19:57:46.027Z] inflating: /builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1/README
[task 2024-02-13T19:57:46.027Z] inflating: /builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1/LICENSE
[task 2024-02-13T19:57:46.027Z] inflating: /builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1/ELRC-3075-wikipedia_health.en-ru.en
[task 2024-02-13T19:57:46.027Z] inflating: /builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1/ELRC-3075-wikipedia_health.en-ru.ru
[task 2024-02-13T19:57:46.027Z] inflating: /builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1/ELRC-3075-wikipedia_health.en-ru.xml
[task 2024-02-13T19:57:46.027Z] + for lang in ${src} ${trg}
[task 2024-02-13T19:57:46.027Z] + zstdmt -c /builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1/ELRC-3075-wikipedia_health.ru-en.ru
[task 2024-02-13T19:57:46.027Z] zstd: can't stat /builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1/ELRC-3075-wikipedia_health.ru-en.ru : No such file or directory -- ignored
[task 2024-02-13T19:57:46.027Z] + zstdmt -c /builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1/ELRC-3075-wikipedia_health.en-ru.ru
[task 2024-02-13T19:57:46.027Z] + for lang in ${src} ${trg}
[task 2024-02-13T19:57:46.027Z] + zstdmt -c /builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1/ELRC-3075-wikipedia_health.ru-en.en
[task 2024-02-13T19:57:46.027Z] zstd: can't stat /builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1/ELRC-3075-wikipedia_health.ru-en.en : No such file or directory -- ignored
[task 2024-02-13T19:57:46.027Z] + zstdmt -c /builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1/ELRC-3075-wikipedia_health.en-ru.en
[task 2024-02-13T19:57:46.027Z] + rm -rf /builds/worker/artifacts/opus/ELRC_3075_wikipedia_health_v1
[task 2024-02-13T19:57:46.027Z] + echo '###### Done: Downloading opus corpus'
[task 2024-02-13T19:57:46.027Z] ###### Done: Downloading opus corpus
[task 2024-02-13T19:57:46.027Z] + echo '###### Done: Downloading dataset opus_ELRC-3075-wikipedia_health/v1'
[task 2024-02-13T19:57:46.027Z] ###### Done: Downloading dataset opus_ELRC-3075-wikipedia_health/v1
[task 2024-02-13T19:57:46.027Z]
[task 2024-02-13T19:57:46.027Z] Finished dataset import and augmentation.
[taskcluster 2024-02-13 19:57:46.310Z] === Task Finished ===
[taskcluster 2024-02-13 19:57:46.726Z] Successful task run with exit code: 0 completed in 16.049 seconds
Loading