1
2
mirror of https://github.com/vimagick/dockerfiles synced 2024-06-29 18:21:24 +00:00
dockerfiles/tesseract
ImgBotApp 1c434b2d2e
[ImgBot] Optimize images
*Total -- 181.18kb -> 152.26kb (15.96%)

/krakend/data/krakend.png -- 32.27kb -> 23.93kb (25.84%)
/node-red/screenshot.png -- 80.86kb -> 61.85kb (23.51%)
/rtmp/server/html/img/cctv.jpg -- 60.20kb -> 58.80kb (2.33%)
/tesseract/data/chinese.jpg -- 7.84kb -> 7.67kb (2.19%)

Signed-off-by: ImgBotApp <ImgBotHelp@gmail.com>
2020-08-26 02:52:07 +00:00
..
data [ImgBot] Optimize images 2020-08-26 02:52:07 +00:00
docker-compose.yml update tesseract 2019-12-07 13:15:29 +08:00
Dockerfile update tesseract 2019-12-07 13:15:29 +08:00
README.md update ludwig 2019-12-09 08:33:02 +08:00

tesseract

Tesseract is an Open Source OCR engine, available under the Apache 2.0 license. It can be used directly, or (for programmers) using an API. It supports a wide variety of languages.

Tesseract doesn't have a built-in GUI, but there are several available from the 3rdParty page.

Quick Start

$ alias tesseract='docker run --rm -v `pwd`:/data -w /data vimagick/tesseract'

$ tesseract input.png output -l eng --psm 3
$ cat output.txt
The (quick) [brown] {fox} jumps!

$ tesseract chinese.jpg stdout -l chi_tra --psm 8 --oem 0
學習