scoring poor NMT outputs #8

keisks · 2018-01-23T03:40:23Z

Hello,

Thank you for developing M2 scorer!

I recently ran into a problem when I use m2 script for poor Neural MT outputs.

e.g., When I evaluated the following poor NMT output (for sentence id 333), the m2script takes very long time to compute.
In my environment, it takes more than 5 hours and is still running...

As it is a genetic risk , the patient force might have a high chance of carrying the risk , hence the need to inform their relatives is important . Hence , you are suffering from a genetic disease that the genetic trait might be passed on to your next generation if you have a child . Hence , there is no legal obligation to disclose to their family members , there is no legal obligation . Hence , there is no legal obligation . Hence , there is no legal obligation . Hence , there is no legal obligation . Hence , there is no legal obligation . Hence , there is no legal obligation . Hence , there is no legal obligation . Hence , there is no legal obligation . Hence , there is no legal obligation . Hence , there is no legal obligation . Hence , there is no legal obligation . Hence , there is no legal obligation . Hence , there is no legal obligation . Hence , there is no legal obligation . Hence , there is no legal obligation . Hence , there is no legal obligation . Hence , there

Is this an expected behavior and is there a way to work around?

Thank you,

shamilcm · 2018-01-23T05:57:05Z

The original M2 algorithm and implementation is not optimized to cases where the output is very different from the source sentence. We will try to release an optimized implementation soon. Meanwhile, if it is to validate an NMT system, you may try replacing the output sentence with the source sentence itself if the edit distance is very high. The M2 implementation within Moses tries to do something similar by avoiding extremely different sentences compared to the source.

keisks · 2018-01-23T15:14:26Z

Thank you for the suggestion and I look forward to the optimized version :)

shm007g · 2019-12-11T07:40:00Z

m2scorer is relly slow when I evaluate my GEC data.

It takes hours just for evaluating 2000 short(<300) sentence.

By the way, I am using the official version of 3.2.

amal-meer · 2021-04-06T11:04:17Z

Is there a solution to this? I can not evaluate my GEC system although the testing data is 980 sentence with a maximum length of 433. It took more that 7 hours and still running.

amal-meer · 2021-04-07T06:19:48Z

I run the script on a PC with higher specifications and it finished running in less than 6 hours. It is too long but I added this note for those who might have the same problem.

keisks closed this as completed Jan 23, 2018

shamilcm added the enhancement label Mar 6, 2018

shamilcm reopened this Mar 6, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scoring poor NMT outputs #8

scoring poor NMT outputs #8

keisks commented Jan 23, 2018

shamilcm commented Jan 23, 2018 •

edited

Loading

keisks commented Jan 23, 2018

shm007g commented Dec 11, 2019 •

edited

Loading

amal-meer commented Apr 6, 2021

amal-meer commented Apr 7, 2021

scoring poor NMT outputs #8

scoring poor NMT outputs #8

Comments

keisks commented Jan 23, 2018

shamilcm commented Jan 23, 2018 • edited Loading

keisks commented Jan 23, 2018

shm007g commented Dec 11, 2019 • edited Loading

amal-meer commented Apr 6, 2021

amal-meer commented Apr 7, 2021

shamilcm commented Jan 23, 2018 •

edited

Loading

shm007g commented Dec 11, 2019 •

edited

Loading