Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Process batch files from momery #224

Closed
itumeleng-mabote opened this issue Aug 13, 2019 · 4 comments
Closed

Process batch files from momery #224

itumeleng-mabote opened this issue Aug 13, 2019 · 4 comments

Comments

@itumeleng-mabote
Copy link

itumeleng-mabote commented Aug 13, 2019

Is there a way I could batch feed tesserect memory files to speed it up?

@bozhodimitrov
Copy link
Collaborator

You can use the tesseract option for list of files. Just pass string with the path to the text file that contains the list of images. Check the pytesseract docs for more info and example.

@itumeleng-mabote
Copy link
Author

Was hoping to do everything in memory. Is that possible?

@bozhodimitrov
Copy link
Collaborator

bozhodimitrov commented Aug 14, 2019

pytesseract is just a wrapper to the C++ tesseract standalone program, so if you have the files in the python memory, pyteeseract doesn't support direct processing (via pipe/ipc), it will save it as a file on disk.

There is a related open issue #172

@bozhodimitrov
Copy link
Collaborator

Referencing and closing as duplicate in favor of #172

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants