-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
After saved, new words created, duplicates. #10
Comments
👋 thanks for flagging this, |
It happened when save it as well, all time code export option have same problem. |
Ok, makes sense, when it saves it runs time code re alignment. What do you use for speech to text before converting it to DPE format? |
I can’t seem to reproduce in storybook, what version of slate transcript editor are you on? Update: sorry saw you saw in first post 0.0.15 |
I am using Azure, Xfyun etc. This file is from Azure, I converted to DPE. |
@pietrop, we are seeing this, and we had a student today at American University have their transcript rendered pretty unusable when some words were replicated thousands of times across different portions of their transcript. We are using Google STT before converting to DPE. We are seeing the issue sometimes upon saving. |
Thanks for flagging this @overZellis133 , it be good to take a close look at the sample data |
To recap our convo I am not sure if this is caused by the conversion of the data provided to SlateJs. It takes DPE format. For GCP, I made a converter, pietrop/gcp-to-dpe, in latest v2, this is refactored (removing intermeidate draftJs conversion, as it origially came from So worth trying using that, and see if issue still persists. |
Thanks. I will try this GCP later sometime.
To make @bbc/react-transcript-editor work better for long hours files, I
customize it to have pagination for the transcript, load a portion into the
editor, won't hang the browser. It works for my project.
And add the paragraph mode, color of text changing, retrieve original
transcript, align with original to get the color of changes (if copy and
paste, instead of editing).
[image: image.png]
Pietro <[email protected]> 于2020年10月28日周三 上午7:31写道:
… To recap our convo
I am not sure if this is caused by the conversion of the data provided to
SlateJs. It takes DPE format.
For GCP, I made a converter, pietrop/gcp-to-dpe
<https://github.com/pietrop/gcp-to-dpe>, in latest v2, this is refactored
(removing intermeidate draftJs conversion, as it origially came from
@bbc/react-transcript-editor), and needs/uses GCP Speaker diarization to
break paragraphs on speaker change.
So worth trying using that, and see if issue still persists.
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#10 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AOZF5RMULADK3X67AMBRSV3SM5J3PANCNFSM4QNA2NHQ>
.
|
ah, that's interesting, wasn't able to see the image tho? you are using |
Earlier one.
…On Fri, 6 Nov 2020, 11:34 Pietro, ***@***.***> wrote:
ah, that's interesting, wasn't able to see the image tho?
you are using @pietrop/slate-transcript-editor or earlier version @bbc/react-transcript-editor
?
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#10 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AOZF5RINGNSXQVVWULLDLLLSONVDLANCNFSM4QNA2NHQ>
.
|
cool, yeah it be good to see what the pagination look like if you might be able to share that as a PR in |
Yeah no rush, and no worries if the code isn't perfect, it just be interesting to see the code/PR to see the concept/idea behind the pagination in draftJS 😊 |
Hm, I did outside of draftJS, pagation in to editor, not in draftJS. I added one more props to the editor, to pass whole transcript, but only take the one page to the editor (DPE, then draftJS) to edit, when page change, save page into memory (slice of the array), take the required page to editor. When choose to Save, save the whole transcript to local/database. |
closing in favor of this, |
Describe the bug
"slate-transcript-editor": "0.0.15",
3cbfe40c-99a9-463a-b58e-39d14e02cdcb.wav.dpe.json.zip
I am using the slate editor. It has some issues on the export function, when export with time code, it changed the transcript, duplicates.
Changed to:
The text was updated successfully, but these errors were encountered: