Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Revert CPU generic workers #629

Merged
merged 2 commits into from
May 27, 2024

Trigger CI

9db5284
Select commit
Loading
Failed to load commit list.
Merged

Revert CPU generic workers #629

Trigger CI
9db5284
Select commit
Loading
Failed to load commit list.
firefoxci-taskcluster / merge-corpus-ru-en succeeded May 24, 2024 in 13m 45s

FirefoxCI (pull_request)

merge corpus for ru-en

Details

View task in Taskcluster
View logs in Taskcluster


[taskcluster 2024-05-24 17:02:38.488Z] Task ID: CNkzq1f4TQW7VSM1Md0CAg
[taskcluster 2024-05-24 17:02:38.489Z] Worker ID: 459303397715966052
[taskcluster 2024-05-24 17:02:38.489Z] Worker Group: us-central1-f
[taskcluster 2024-05-24 17:02:38.489Z] Worker Node Type: projects/887720501152/machineTypes/n2-highmem-32
[taskcluster 2024-05-24 17:02:38.489Z] Worker Pool: translations-1/b-linux-large-gcp-300gb
[taskcluster 2024-05-24 17:02:38.489Z] Worker Version: 38.0.5
[taskcluster 2024-05-24 17:02:38.489Z] Public IP: 104.197.135.232
[taskcluster 2024-05-24 17:02:38.489Z] Hostname: translations-1-b-linux-large-gcp-300gb-br0qyfjtr7qh1sdjrxxvfq
[taskcluster 2024-05-24 17:02:38.489Z] using cache "translations-level-1-checkouts-v3-7afeb851dd97df8f3607-Q_deBg9LRdWXU21Qhn7s6Q" -> /builds/worker/checkouts

[taskcluster 2024-05-24 17:02:39.089Z] Image 'public/image.tar.zst' from task 'Q_deBg9LRdWXU21Qhn7s6Q' loaded.  Using image ID sha256:1e62b245ce9b86147f4b5a4cdcf3708c9a763f8cdaa58df95c74f81427d083ab.
[taskcluster 2024-05-24 17:02:39.137Z] === Task Starting ===
[setup 2024-05-24T17:02:39.468Z] run-task started in /builds/worker
[setup 2024-05-24T17:02:39.468Z] Invoked by command: --firefox_translations_training-checkout=/builds/worker/checkouts/vcs/ -- bash -c export BIN=$MOZ_FETCHES_DIR && $VCS_PATH/pipeline/clean/merge-corpus.sh artifacts/corpus $MOZ_FETCHES_DIR/*.zst
[setup 2024-05-24T17:02:39.468Z] Python version: 3.10.12
[cache 2024-05-24T17:02:39.470Z] cache /builds/worker/checkouts exists; requirements: gid=1000 uid=1000 version=1
[volume 2024-05-24T17:02:39.470Z] volume /builds/worker/checkouts is a cache
[setup 2024-05-24T17:02:39.470Z] running as worker:worker
[vcs 2024-05-24T17:02:39.470Z] executing ['git', 'config', '--global', '--add', 'safe.directory', '/builds/worker/checkouts/vcs']
[vcs 2024-05-24T17:02:39.472Z] executing ['git', 'fetch', '--tags', '--force', 'https://github.com/mozilla/firefox-translations-training', 'revert_d2g_pool']
[vcs 2024-05-24T17:02:39.681Z] From https://github.com/mozilla/firefox-translations-training
[vcs 2024-05-24T17:02:39.681Z]  * branch            revert_d2g_pool -> FETCH_HEAD
[vcs 2024-05-24T17:02:39.687Z] executing ['git', 'fetch', '--no-tags', 'https://github.com/mozilla/firefox-translations-training', 'revert_d2g_pool']
[vcs 2024-05-24T17:02:39.884Z] From https://github.com/mozilla/firefox-translations-training
[vcs 2024-05-24T17:02:39.884Z]  * branch            revert_d2g_pool -> FETCH_HEAD
[vcs 2024-05-24T17:02:39.890Z] executing ['git', 'checkout', '-f', '-B', 'revert_d2g_pool', '9db52842e17a9fa0ececcfc30809128fa1c52288']
[vcs 2024-05-24T17:02:39.895Z] Switched to a new branch 'revert_d2g_pool'
[vcs 2024-05-24T17:02:39.896Z] executing ['git', 'submodule', 'init']
[vcs 2024-05-24T17:02:39.916Z] executing ['git', 'submodule', 'update', '--force']
[vcs 2024-05-24T17:02:39.948Z] Submodule path '3rd_party/browsermt-marian-dev': checked out '11c6ae7c46be21ef96ed10c60f28022fa968939f'
[vcs 2024-05-24T17:02:39.958Z] Submodule path '3rd_party/extract-lex': checked out '42fa605b53f32eaf6c6e0b5677255c21c91b3d49'
[vcs 2024-05-24T17:02:39.968Z] Submodule path '3rd_party/fast_align': checked out 'cab1e9aac8d3bb02ff5ae58218d8d225a039fa11'
[vcs 2024-05-24T17:02:39.978Z] Submodule path '3rd_party/kenlm': checked out 'bbf4fc511266c5d4515047055d7bdec659a6e158'
[vcs 2024-05-24T17:02:39.993Z] Submodule path '3rd_party/marian-dev': checked out 'e8a1a2530fb84cbff7383302ebca393e5875c441'
[vcs 2024-05-24T17:02:40.003Z] Submodule path '3rd_party/preprocess': checked out '64307314b4d5a9a0bd529b5c1036b0710d995eec'
[vcs 2024-05-24T17:02:40.003Z] cleaning git checkout...
[vcs 2024-05-24T17:02:40.003Z] executing ['git', 'clean', '-nxdff']
[vcs 2024-05-24T17:02:40.006Z] removing []
[vcs 2024-05-24T17:02:40.006Z] successfully cleaned git checkout!
[vcs 2024-05-24T17:02:40.008Z] TinderboxPrint:<a href='https://github.com/mozilla/firefox-translations-training/commit/9db52842e17a9fa0ececcfc30809128fa1c52288' title='Built from firefox-translations-training commit 9db52842e17a9fa0ececcfc30809128fa1c52288'>9db52842e17a9fa0ececcfc30809128fa1c52288</a>
[setup 2024-05-24T17:02:40.008Z] MOZ_FETCHES_DIR is /builds/worker/fetches
[fetches 2024-05-24T17:02:40.008Z] fetching artifacts
[fetches 2024-05-24T17:02:40.008Z] executing ['/usr/bin/python3', '-u', '/usr/local/bin/fetch-content', 'task-artifacts']
attempt 1/5
Downloading https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/cEFbQlaUQxmnC9hffjUAaA/artifacts/public/build/ELRC-3075-wikipedia_health_v1.en.zst to /builds/worker/fetches/ELRC-3075-wikipedia_health_v1.en.zst
attempt 1/5Downloading https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/cEFbQlaUQxmnC9hffjUAaA/artifacts/public/build/ELRC-3075-wikipedia_health_v1.en.zst

Downloading https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/cEFbQlaUQxmnC9hffjUAaA/artifacts/public/build/ELRC-3075-wikipedia_health_v1.ru.zst to /builds/worker/fetches/ELRC-3075-wikipedia_health_v1.ru.zst
Downloading https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/cEFbQlaUQxmnC9hffjUAaA/artifacts/public/build/ELRC-3075-wikipedia_health_v1.ru.zstattempt 1/5

Downloading https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/dllI0BLIT4WW6OOIFPP1ow/artifacts/public/build/ada83_v1.en.zst to /builds/worker/fetches/ada83_v1.en.zst
attempt 1/5
Downloading https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/dllI0BLIT4WW6OOIFPP1ow/artifacts/public/build/ada83_v1.ru.zst to /builds/worker/fetches/ada83_v1.ru.zst
attempt 1/5
Downloading https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/dt7X26lwS_Gyv8ni2I08PQ/artifacts/public/build/gcp_pytest-dataset_a0017e.en.zst to /builds/worker/fetches/gcp_pytest-dataset_a0017e.en.zstDownloading https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/dllI0BLIT4WW6OOIFPP1ow/artifacts/public/build/ada83_v1.ru.zst

Downloading https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/dt7X26lwS_Gyv8ni2I08PQ/artifacts/public/build/gcp_pytest-dataset_a0017e.en.zstattempt 1/5
attempt 1/5
Downloading https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/dt7X26lwS_Gyv8ni2I08PQ/artifacts/public/build/gcp_pytest-dataset_a0017e.ru.zst to /builds/worker/fetches/gcp_pytest-dataset_a0017e.ru.zst

Downloading https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/dllI0BLIT4WW6OOIFPP1ow/artifacts/public/build/ada83_v1.en.zstDownloading https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/AgPlxUayScSDmqgddgY9pw/artifacts/public/build/dedupe.tar.zst to /builds/worker/fetches/dedupe.tar.zst

Downloading https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/dt7X26lwS_Gyv8ni2I08PQ/artifacts/public/build/gcp_pytest-dataset_a0017e.ru.zstDownloading https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/AgPlxUayScSDmqgddgY9pw/artifacts/public/build/dedupe.tar.zst

https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/dt7X26lwS_Gyv8ni2I08PQ/artifacts/public/build/gcp_pytest-dataset_a0017e.ru.zst resolved to 755 bytes with sha256 37b8ecdecfaaefbbcc0555a44d8f1201c09d9e0c58e5a1ee8fec159ddee93375 in 0.111s
https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/dt7X26lwS_Gyv8ni2I08PQ/artifacts/public/build/gcp_pytest-dataset_a0017e.en.zst resolved to 539 bytes with sha256 1584216350b567c80f2e92b665881d19b4466596d4c848af460c1c16bfd193ab in 0.117s
https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/dllI0BLIT4WW6OOIFPP1ow/artifacts/public/build/ada83_v1.ru.zst resolved to 164865 bytes with sha256 741811858877b11c10de47ff79ac2278cfa4272793769563a66c3e1786b94e22 in 0.151s
https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/cEFbQlaUQxmnC9hffjUAaA/artifacts/public/build/ELRC-3075-wikipedia_health_v1.ru.zst resolved to 218815 bytes with sha256 a81b233f3e4eb53b599b1f4afab8cd1debcaaf09fedad5af44c9d27ea2dbe7f0 in 0.158s
https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/dllI0BLIT4WW6OOIFPP1ow/artifacts/public/build/ada83_v1.en.zst resolved to 110134 bytes with sha256 b5aa13577484746f222b9a43f4117147420505eb606986d4f3b6aaa64132a920 in 0.162s
https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/cEFbQlaUQxmnC9hffjUAaA/artifacts/public/build/ELRC-3075-wikipedia_health_v1.en.zst resolved to 154187 bytes with sha256 9a5c3cde479c654c86e8635d79003eb1e6e36fa71d88ddcfbbc4d7f3541f6082 in 0.177s
https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/AgPlxUayScSDmqgddgY9pw/artifacts/public/build/dedupe.tar.zst resolved to 133246 bytes with sha256 1e567b7eabfbbac81c949ba165181bf3b4b5a15d4fcd114a0fc1d7ec82cfac07 in 0.237s
Extracting /builds/worker/fetches/dedupe.tar.zst to /builds/worker/fetches
/builds/worker/fetches/dedupe.tar.zst extracted in 0.002s
Removing /builds/worker/fetches/dedupe.tar.zst
PERFHERDER_DATA: {"framework": {"name": "build_metrics"}, "suites": [{"name": "fetch_content", "value": 0.2461546210000165, "lowerIsBetter": true, "shouldAlert": false, "subtests": []}]}
[fetches 2024-05-24T17:02:40.331Z] finished fetching artifacts
[task 2024-05-24T17:02:40.331Z] executing ['bash', '-c', 'export BIN=$MOZ_FETCHES_DIR && $VCS_PATH/pipeline/clean/merge-corpus.sh artifacts/corpus $MOZ_FETCHES_DIR/*.zst']
[task 2024-05-24T17:02:40.333Z] + set -euo pipefail
[task 2024-05-24T17:02:40.333Z] + echo '###### Merging parallel datasets'
[task 2024-05-24T17:02:40.333Z] ###### Merging parallel datasets
[task 2024-05-24T17:02:40.333Z] + test -v SRC
[task 2024-05-24T17:02:40.333Z] + test -v TRG
[task 2024-05-24T17:02:40.333Z] + test -v BIN
[task 2024-05-24T17:02:40.333Z] + output_prefix=artifacts/corpus
[task 2024-05-24T17:02:40.333Z] + inputs=("${@:2}")
[task 2024-05-24T17:02:40.333Z] + COMPRESSION_CMD=zstdmt
[task 2024-05-24T17:02:40.333Z] + ARTIFACT_EXT=zst
[task 2024-05-24T17:02:40.333Z] + tmp=artifacts/corpus/merge
[task 2024-05-24T17:02:40.333Z] + mkdir -p artifacts/corpus/merge
[task 2024-05-24T17:02:40.335Z] + echo '### Merging'
[task 2024-05-24T17:02:40.335Z] ### Merging
[task 2024-05-24T17:02:40.335Z] + [[ /builds/worker/fetches/ada83_v1.en.zst == *.zst ]]
[task 2024-05-24T17:02:40.335Z] ++ tr ' ' '\n'
[task 2024-05-24T17:02:40.335Z] ++ echo /builds/worker/fetches/ada83_v1.en.zst /builds/worker/fetches/ada83_v1.ru.zst /builds/worker/fetches/ELRC-3075-wikipedia_health_v1.en.zst /builds/worker/fetches/ELRC-3075-wikipedia_health_v1.ru.zst /builds/worker/fetches/gcp_pytest-dataset_a0017e.en.zst /builds/worker/fetches/gcp_pytest-dataset_a0017e.ru.zst
[task 2024-05-24T17:02:40.335Z] ++ grep ru.zst
[task 2024-05-24T17:02:40.335Z] ++ tr '\n' ' '
[task 2024-05-24T17:02:40.337Z] + cat /builds/worker/fetches/ada83_v1.ru.zst /builds/worker/fetches/ELRC-3075-wikipedia_health_v1.ru.zst /builds/worker/fetches/gcp_pytest-dataset_a0017e.ru.zst
[task 2024-05-24T17:02:40.338Z] ++ echo /builds/worker/fetches/ada83_v1.en.zst /builds/worker/fetches/ada83_v1.ru.zst /builds/worker/fetches/ELRC-3075-wikipedia_health_v1.en.zst /builds/worker/fetches/ELRC-3075-wikipedia_health_v1.ru.zst /builds/worker/fetches/gcp_pytest-dataset_a0017e.en.zst /builds/worker/fetches/gcp_pytest-dataset_a0017e.ru.zst
[task 2024-05-24T17:02:40.338Z] ++ tr ' ' '\n'
[task 2024-05-24T17:02:40.338Z] ++ grep en.zst
[task 2024-05-24T17:02:40.338Z] ++ tr '\n' ' '
[task 2024-05-24T17:02:40.339Z] + cat /builds/worker/fetches/ada83_v1.en.zst /builds/worker/fetches/ELRC-3075-wikipedia_health_v1.en.zst /builds/worker/fetches/gcp_pytest-dataset_a0017e.en.zst
[task 2024-05-24T17:02:40.340Z] + echo '### Deduplication'
[task 2024-05-24T17:02:40.340Z] ### Deduplication
[task 2024-05-24T17:02:40.341Z] + /builds/worker/fetches/dedupe
[task 2024-05-24T17:02:40.341Z] + zstdmt
[task 2024-05-24T17:02:40.341Z] ++ zstdmt -dc artifacts/corpus/merge/corpus.ru.dup.zst
[task 2024-05-24T17:02:40.341Z] + paste /dev/fd/63 /dev/fd/62
[task 2024-05-24T17:02:40.341Z] ++ zstdmt -dc artifacts/corpus/merge/corpus.en.dup.zst
[task 2024-05-24T17:02:40.342Z] File stdin isn't normal.  Using slower read() instead of mmap().  No progress bar.
[task 2024-05-24T17:02:40.361Z] Kept 7028 / 7181 = 0.978694
[task 2024-05-24T17:02:40.378Z] + zstdmt -dc artifacts/corpus/merge.ruen.zst
[task 2024-05-24T17:02:40.378Z] + cut -f1
[task 2024-05-24T17:02:40.378Z] + zstdmt
[task 2024-05-24T17:02:40.404Z] + zstdmt -dc artifacts/corpus/merge.ruen.zst
[task 2024-05-24T17:02:40.404Z] + cut -f2
[task 2024-05-24T17:02:40.404Z] + zstdmt
[task 2024-05-24T17:02:40.421Z] + rm -rf artifacts/corpus/merge
[task 2024-05-24T17:02:40.422Z] + echo '###### Done: Merging parallel datasets'
[task 2024-05-24T17:02:40.422Z] ###### Done: Merging parallel datasets
[fetches 2024-05-24T17:02:40.422Z] removing /builds/worker/fetches
[fetches 2024-05-24T17:02:40.423Z] finished
[taskcluster 2024-05-24 17:02:40.637Z] === Task Finished ===
[taskcluster 2024-05-24 17:02:41.312Z] Successful task run with exit code: 0 completed in 2.824 seconds