Update master to Sockeye 2 #822

fhieber · 2020-06-03T08:53:16Z

Merges Sockeye 2 (sockeye_2 branch) into master.

Commits should not be squashed

Pull Request Checklist

Changes are complete (if posting work-in-progress code, prefix your pull request title with '[WIP]'
until you can check this box.
Unit tests pass (pytest)
Were system tests modified? If so did you run these at least 5 times to account for the variation across runs?
System tests pass (pytest test/system)
Passed code style checking (./style-check.sh)
You have considered writing a test
Updated major/minor version in sockeye/__init__.py. Major version bump if this is a backwards incompatible change.
Updated CHANGELOG.md

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

…nding inference code. Removed weight normalization from OutputLayer (not used)

…lated bug in decoder

… to model.py

…th Mixed as its not needed. Comment out tutorial args tests -> tutorials need updating to transformer models

* Update to MXNET 1.6.0 * Add CUDA 10.2 * changelog

* Fail on empty target validation sentences.

* Option for setting parameters in model * Unit tests and flags for set_parameters Co-authored-by: Currey <[email protected]>

* fp16 with fp32 accumulation on log_softmax * Hybrid beam search take, removing encoder takes * Bulk prepare inference input in CPU before sending all to GPU * Beam search decoding set to model dtype instead of fp32 * Replaced split-concat with slicing, added and modified comments, and some renaming * Fixed test failures and errors * Model state structure and resolved cherry-picking artifacts * Corrected comments to match correct variables and shapes * Flat state list, nesting determined by state structure * Type declarations for ensemble decoding states * Updated changelog and version * Convert accumulated scores back to fp32 before argsort

…tates. (#810)

* Pad vocab to a multiple of 8 for quantization * Single codebase using decoding float32 and int8 transformer, except embeddings * No need for a space change in inference * Remove logging code * Undo changes to train.py defaults * Allow casting to non-int8 types * Move dtype to model * Default to FullyConnected * Remove unnecessary imports * Comment weight initializer zeros * Warning on cast * Copyright on quantization.py, spacing fix * Tuples as (1,) * TransformerConfig doesn't have dtype anymore * More dtype passing * Output layer quantization * Fix missing import/logger * CPU-independent disk format Works with this quantization program (TODO integrate): import mxnet as mx model = mx.nd.load("/home/ubuntu/idid-enus/model.amt.sf-concat/params.best") dense = [k[0:-7] for k in model.keys() if k.endswith('.weight') and not k.startswith("embedding_source.")] dense.remove("encoder.pos_embedding") dense.remove("decoder.pos_embedding") for param in dense: name = param + ".weight" b = model[name] b_max = mx.nd.contrib.intgemm_maxabsolute(b) # The disk format just quantizes. b_prepared = mx.nd.contrib.intgemm_prepare_data(b, b_max) model[name] = b_prepared model[param + ".scaling"] = b_max / 127.0 mx.nd.save("/home/ubuntu/idid-enus/model.amt.sf-concat.quant/params.best", model) * Update comment * Version that loads a float32 model and quantizes on the fly But it doesn't check all parameters are in the provided model * Disk saving option * Wrap comment to 80 characters * C.DTYPE_INT8 and space after # * No spacing around keyword arguments * Typing on convert_weights_disk_format Co-Authored-By: Felix Hieber <[email protected]> * Typing on convert_weights_cpu_dependent Co-Authored-By: Felix Hieber <[email protected]> * Make calls friendly to custom operators * Hacky way to find custom operator * Configurable to custom operator * fheiber's patch to dtypes * C.DTYPE_FP32 and remove errant , * Quantization: minimize mean squared error for parameters * Use cached quantization scaling * Quantization: do on-the-fly directly * Hackily restore model type to saving type * Quantization: store scaling * Fix use of existing scaling factors Co-authored-by: Felix Hieber <[email protected]>

* Quantize CLI, Docker build update, version/changelog update.

fhieber added 30 commits June 7, 2019 15:10

Initial commit of Sockeye 2.0 based on Gluon

f5e9ec7

Delete image captioning code

64a6714

Fix test_fixed_param_strategy test

8472cef

Fix test in test_arguments.py

198e5b7

Remove test_attention, fix test_average

ddb69b8

Fix test_bleu and update sacrebleu to 1.3.5

fd1a89e

Fix test_config

486c596

Cleanup test_constraints

335d8da

Remove test_coverage. Partially fix test_data_io

4bdb514

Removed RNN, CNN encoder/decoder

40c9afe

Remove breaking test code

2a6ac0e

Update some transformer tests

49d75e1

update more tests, removed outdated tests

97901b4

Updated OutputLayer to support vocabulary selection. updated correspo…

26bc186

…nding inference code. Removed weight normalization from OutputLayer (not used)

Fix tests related to get_max_output_len at inference time. Fixed a re…

d79e345

…lated bug in decoder

Fix mock in test

b3a86da

Disable non-transformer integration tests

b9544d3

Revise scoring code to make integration tests pass. Moved load_models…

f2a8b8e

… to model.py

Fix changelog version

8380af0

Fix LHUC and tests

357ba06

Merge branch 'master' into sockeye_2

bd1fe19

Rework scoring

16dba01

Fix edge case with batch*beam == 1

e2bd484

Fix loading of translator and model in CheckpointDecoder

c8934ea

Fix none parsing in metrics file

fce3c50

use np.allclose

f767ca3

Fix various secondary CLIs, test_other_clis now passes

6c7cad3

Remove old cli arguments related to RNN/CNN. Remove initialization wi…

63d45c8

…th Mixed as its not needed. Comment out tutorial args tests -> tutorials need updating to transformer models

Updated integration tests to cover more features with transformer model

2a4a06d

Remove unused WIP BeamSeach class for now

71cf1ad

fhieber and others added 21 commits February 25, 2020 10:30

Update to MXNet 1.6 (#775)

6dd2741

* Update to MXNET 1.6.0 * Add CUDA 10.2 * changelog

Update setup.md (#789)

7e715a7

Do not store duplicate, shared parameters (#792)

b08eb14

Github action: nightly builds with mxnet (#795)

ed503d3

Sockeye 2 validcheck (#794)

f3bb172

* Fail on empty target validation sentences.

Use nightly build repo link

6026558

Option for setting parameters in model (#800)

bcc30e4

* Option for setting parameters in model * Unit tests and flags for set_parameters Co-authored-by: Currey <[email protected]>

Fix log message about source factors (#802)

1bf4006

Dockerfile for CPU-optimized Sockeye image (#803)

afbde7a

generate_graphs.py incorrect dependency. (#804)

54f72de

Remove empty module sockeye_contrib.optimizers (#807)

8887712

Update papers with Sockeye (#806)

3b23c78

Revise transformer state caching in beam search to cache transposed s…

34c0960

…tates. (#810)

Sockeye 2 heafield quantize pr2 (#812)

50393fc

* Quantize CLI, Docker build update, version/changelog update.

Process the shards using multiple processes in prepare_train_data (#813)

b1b0973

Don't cast a model if it's already in that format. (#816)

6320542

fix Python 3.5 build, no format strings (#817)

45d704a

Add Sockeye 2 project description paper (#819)

d91f57b

Merge branch 'master' into sockeye_2_merge

e433eae

fhieber added the sockeye_2 label Jun 3, 2020

fhieber requested review from davvil, mjdenkowski and tdomhan as code owners June 3, 2020 08:53

fhieber added 2 commits June 3, 2020 11:02

Fix manifest

16b38c3

Fix github actions

ed01ab8

tdomhan approved these changes Jun 3, 2020

View reviewed changes

fhieber merged commit 88dc440 into master Jun 3, 2020

fhieber deleted the sockeye_2_merge_again branch June 3, 2020 09:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update master to Sockeye 2 #822

Update master to Sockeye 2 #822

fhieber commented Jun 3, 2020

Update master to Sockeye 2 #822

Update master to Sockeye 2 #822

Conversation

fhieber commented Jun 3, 2020

Pull Request Checklist