Skip to content

Commit

Permalink
ara-Amiri traineddata at char train=1.127%
Browse files Browse the repository at this point in the history
  • Loading branch information
Shreeshrii committed Feb 25, 2019
1 parent ba03b08 commit 749c723
Show file tree
Hide file tree
Showing 3 changed files with 30 additions and 4 deletions.
26 changes: 26 additions & 0 deletions Arabic-TOC-ara-Amiri.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,26 @@
الجفا ...... ............................................. ‎٨٧٢‏
‏غرام مشُميت ............................................... ‎٨٧٣‏
‏الفؤاد الكسير .............................................. ‎٨٧٤‏
‏عَقيقَ في عقيق في عقيقي ....................................... ‎٨٧٥‏
‏الباب الحادي عشر: مُتفرزقات .................................... ‎٨٧٧‏
‏الشاي ................................................... ‎٨٧٩‏
‏مَديح الشاي ............................................... ‎٨٨٠٩‏
‏مسك الشاي ............................................... ‎٨٨١‏
‏ليلة الشاي ................................................ ‎٨٨٢‏
‏رجال السر ...... ......................................... ‎٨٨٣‏
‏في فضل الاجتماع ...... ..................................... ‎٨٨٤‏
‏شججة ................................................... ‎٨٨٥‏
‏لله دَبني رَوَاحة ............................................ ‎٨٨٦‏
‏خطة عَبِْيّة ..... .......................................... ‎٨٨٧‏
‏مزايا الزمان ............................................... ‎٨٨٨‏
‏عَشرَاء ................................................... ‎٨٨٩‏
‏قطع علاقة في عتاب ......................................... ‎٨٩١‏
‏مُعاتبة ...... ............................................. ‎٨٩٢‏
‏السمكة .................................................. ‎٨٩٤‏
‏نظرة ...... .............................................. ‎٨٩٥‏
‏القطار ................................................... ‎٨٩٦‏
‏المعالي .................................................. ‎٨٩٨‏
‏المصادر والمراجع ............................................ ‎٩٠٠‏
‏الفهرس ................................................... ‎٩٠٢‏
‎٩١٢‏

Binary file modified ara-Amiri.traineddata
Binary file not shown.
8 changes: 4 additions & 4 deletions build/tesstrain_amiri.sh
Original file line number Diff line number Diff line change
Expand Up @@ -6,8 +6,8 @@
################################################################
# variables to set tasks performed
################################################################
MakeEval=yes
MakePlusMinus=yes
MakeEval=NO
MakePlusMinus=NO
RunPlusTraining=yes
RunEval=yes
################################################################l
Expand Down Expand Up @@ -128,7 +128,7 @@ fi

if [ $RunPlusTraining = "yes" ]; then

for ((PlusMinusIterations=4300; PlusMinusIterations<=7000; PlusMinusIterations+=1000)); do
for ((PlusMinusIterations=6300; PlusMinusIterations<=7300; PlusMinusIterations+=500)); do

echo "#### Plus-Minus training Using Amiri text #####"
lstmtraining \
Expand Down Expand Up @@ -173,7 +173,7 @@ tesseract $eval_output_dir/ara.Amiri.exp0.tif ara.Amiri.exp0-$ModelName-eval --t

wdiff --no-common --statistics ./ara.Amiri.exp0-$ModelName-eval.txt $langdata_dir/$Lang/$Lang.evalnew.training_text

tesseract /home/ubuntu/tessdata_arabic/Arabic-TOC.tif /home/ubuntu/tessdata_arabic/Arabic-TOC-ara-Amiri --tessdata-dir ./ara-Amiri-from-Arabic --oem 1 --psm 6 -l ara-Amiri
tesseract /home/ubuntu/tessdata_arabic/Arabic-TOC.png /home/ubuntu/tessdata_arabic/Arabic-TOC-ara-Amiri --tessdata-dir ./ara-Amiri-from-Arabic --oem 1 --psm 6 -l ara-Amiri

tesseract /home/ubuntu/tessdata_arabic/Arabic-TOC-numbers.png /home/ubuntu/tessdata_arabic/Arabic-TOC-numbers-ara-Amiri --tessdata-dir ./ara-Amiri-from-Arabic --oem 1 --psm 6 -l ara-Amiri

Expand Down

0 comments on commit 749c723

Please sign in to comment.