1
2
mirror of https://github.com/vimagick/dockerfiles synced 2024-06-28 09:41:20 +00:00
dockerfiles/tesseract/README.md

28 lines
691 B
Markdown
Raw Normal View History

2015-07-16 14:24:25 +00:00
tesseract
=========
![](https://badge.imagelayers.io/vimagick/tesseract:latest.svg)
2016-01-29 16:36:47 +00:00
[Tesseract][1] is an Open Source OCR engine, available under the Apache 2.0
2015-07-16 14:24:25 +00:00
license. It can be used directly, or (for programmers) using an API. It
supports a wide variety of languages.
Tesseract doesn't have a built-in GUI, but there are several available from the
3rdParty page.
Quick Start
-----------
```
2019-12-07 05:00:54 +00:00
$ alias tesseract='docker run --rm -v `pwd`:/data -w /data vimagick/tesseract'
2019-12-07 05:15:29 +00:00
2019-12-07 05:00:54 +00:00
$ tesseract input.png output -l eng --psm 3
$ cat output.txt
2019-12-07 05:15:29 +00:00
The (quick) [brown] {fox} jumps!
2019-12-09 00:33:02 +00:00
$ tesseract chinese.jpg stdout -l chi_tra --psm 8 --oem 0
2019-12-07 05:15:29 +00:00
學習
2015-07-16 14:24:25 +00:00
```
2016-01-29 16:36:47 +00:00
[1]: https://github.com/tesseract-ocr/tesseract