[FR] About status of text-corpora analysis. #57
Labels
analyzer
cv-tbodataset-analyzer related
compiler
cv-tbox-dataset-compiler related
help wanted
Extra attention is needed
Mozilla Common Voice started to use the database for new text-corpus directly, without exporting newly added (validated) sentences to the public. Therefore, our analysis on text-corpora is outdated (not changed after March 2023 release v13.0).
You can read about the issue and possible solutions on the Common Voice repo:
common-voice/common-voice#4100
It seems until it is fixed, there is nothing we can do about this. Any other idea is most welcome.
The text was updated successfully, but these errors were encountered: