-
Notifications
You must be signed in to change notification settings - Fork 245
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
token level timestep #78
Comments
@Mddct I don't quite understand your question and example. Currently we tokenize string (aka label) into list of characters or list of subwords. |
Output label and its start time and end time in orign wav。 |
@Mddct Oh, currently we don't support that feature yet. But I'll look into it. Anyway, if you have any idea of doing that, especially for rnn transducer, can you update here? |
I am trying to do this feature now on transducer 。But I could not find any
paper and article related。
Nguyễn Lê Huy <[email protected]> 于 2020年12月15日周二 下午10:48写道:
… @Mddct <https://github.com/Mddct> Oh, currently we don't support that
feature yet. But I'll look into it. Anyway, if you have any idea of doing
that, especially for rnn transducer, can you update here?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#78 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABFN3QYUJITAQYCUHIQ4Y53SU5ZLVANCNFSM4U22QBMA>
.
|
There's this but for CTC, maybe we can apply it with some modification. |
@usimarit It seems two steps: I will evaluate the accuracy later. But for other language or own corpus, we need do force aligment to get time information。 |
Is it possible to output token level timestep?
eg:
hello 100-600
world 712-900
.......
The text was updated successfully, but these errors were encountered: