-
Notifications
You must be signed in to change notification settings - Fork 552
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Can't read Data from img #91
Comments
hehehe trying to break captchas my friend ;D Given the original picture: Cleaning You'll need to use imagemagick (or something similar) to clean up the picture noise before sending to tesseract. $ convert -colorspace gray -modulate 120 -contrast-stretch 10%x80% -modulate 140 -gaussian-blur 1 -contrast-stretch 5%x50% +repage -negate -gaussian-blur 4 -negate -modulate 130 original.jpeg clean.jpeg would give you the following image: Recognizing Now pass the clean image to tesseract: echo (new TesseractOCR('clean.jpeg'))->run();
// outputs 655V,3A There is an undesired comma (,) on the output, because the cleaning wasn't 100% perfect. echo (new TesseractOCR('clean.jpeg'))->whitelist(range(0, 9), range('A', 'Z'))->run();
// outputs 655V3A And there you have it ... But I have to tell you, it will not work everytime. So make sure you collect a large number of captchas from this source, build the best cleaning sequence of filters you can, and prepare your code to keep trying new captchas until it succeeds. |
thank you verymuch |
don't read this website verifycode:
http://www.miitbeian.gov.cn/getVerifyCode?73
i try you offer img is OK ,i need you help.please .
you can save this website verify code XXX.JPG
The text was updated successfully, but these errors were encountered: