Skip to content

Files

Latest commit

 

History

History
64 lines (55 loc) · 3.96 KB

README.md

File metadata and controls

64 lines (55 loc) · 3.96 KB

Results

  • Setup: fbank80, num_frms200, epoch150, ArcMargin, aug_prob0.6, speed_perturb (no spec_aug)

  • Scoring: cosine (sub mean of vox2_dev)

  • Metric: EER(%)

  • 🔥 UPDATE 2022.07.19: We apply the same setups as the winning system of CNSRC 2022 (see cnceleb recipe for details), and obtain significant performance improvement.

    • LR scheduler warmup from 0
    • Remove one embedding layer in ResNet models
    • Add large margin fine-tuning strategy (LM)
Model Params Flops LM AS-Norm vox1-O-clean vox1-E-clean vox1-H-clean
XVEC-TSTP-emb512 4.61M 0.53G × × 1.989 1.209 3.412
× 1.834 1.846 3.124
× 1.749 1.721 2.944
1.590 1.641 2.726
ECAPA_TDNN_GLOB_c512-ASTP-emb192 6.19M 1.04G × × 1.069 1.209 2.310
× 0.957 1.128 2.105
× 0.878 1.072 2.007
0.782 1.005 1.824
ECAPA_TDNN_GLOB_c1024-ASTP-emb192 14.65M 2.65G × × 0.856 1.072 2.059
× 0.808 0.990 1.874
× 0.798 0.993 1.883
0.728 0.929 1.721
ResNet34-TSTP-emb256 6.63M 4.55G × × 0.867 1.049 1.959
× 0.787 0.964 1.726
× 0.797 0.937 1.695
0.723 0.867 1.532
ResNet221-TSTP-emb256 23.79M 21.29G × × 0.569 0.774 1.464
× 0.479 0.707 1.290
× 0.580 0.729 1.351
0.505 0.676 1.213
ResNet293-TSTP-emb256 28.62M 28.10G × × 0.595 0.756 1.433
× 0.537 0.701 1.276
× 0.532 0.707 1.311
0.447 0.657 1.183
RepVGG_TINY_A0 6.26M 4.65G × × 0.909 1.034 1.943
× 0.824 0.953 1.709
CAM++ 7.18M 1.15G × × 0.803 0.932 1.860
× 0.718 0.879 1.735
x 0.707 0.845 1.664
0.659 0.803 1.569
ERes2Net34_Base 7.88M 3.43G × × 0.914 1.065 1.986
× 0.803 0.976 1.787
x 0.824 0.968 1.776
0.744 0.896 1.603
Res2Net34_Base 4.68M 1.77G × × 1.351 1.347 2.478
× 1.234 1.232 2.162

PLDA results

If you are interested in the PLDA scoring (which is inferior to the simple cosine scoring under the margin based setting), simply run:

local/score_plda.sh --stage 1 --stop-stage 3 --exp_dir exp_name

The results on ResNet34 (large margin, no asnorm) are:

Scoring method vox1-O-clean vox1-E-clean vox1-H-clean
PLDA 1.207 1.350 2.528