-
Notifications
You must be signed in to change notification settings - Fork 39
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to get the decoding result scores from #42
Comments
Hypothesis is a NamedTuple object. You can refer attributes. https://github.com/espnet/espnet/blob/master/espnet/nets/beam_search.py#L19-L33 |
Hi, thanks for your response. By referring to the link. I modified the code as follows:
By executing the following:
I got : I think the number "-57.1623" is the the result of log P_encdec(y|x) + log P_ctc(y|x) + log P_lm(y), where log P_encdec(y|x) is -2.6879, log P_ctc(y|x) is -0.8112 and log P_lm(y) is -55.0374, a bit mismatch though... If I denote -57.1623 as nbests[0].score Thanks a lot |
You can get the arbitrary n-best scores by giving |
Thanks for the comment. I currently treat the "score" (i.e., -57.1623) as a rough confidence score to indicate how confident the model predicts the semantic meaning of the audio is so. From my observation, the score of nbests[0] is higher than that of the nbests[1]. I guess it is adequate for my purpose. |
Hi,
Thanks for the work. I am trying to use the pre-trained model, but I don't know how to get the decoding score for the corresponding decoding results.
The code above only prints text. I would like to get decoding confidence as well.
I checked speech2text class.
From the code above I conjecture that the confidence should be obtained from the "hyp", but it is not clear to me how
to parse "hyp" to get the score.
The text was updated successfully, but these errors were encountered: