Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PSM.AUTO_ONLY still includes OCR results #391

Open
olivierpascal opened this issue Jan 5, 2020 · 1 comment
Open

PSM.AUTO_ONLY still includes OCR results #391

olivierpascal opened this issue Jan 5, 2020 · 1 comment
Labels
Dependency Bug Valid bug where fixing is outside the scope of this repo

Comments

@olivierpascal
Copy link

olivierpascal commented Jan 5, 2020

Describe the bug
The result of recognize() with PSM_AUTO_ONLY still include text.
I guess this is not normal.

To Reproduce
worker.setParameters({ tessedit_pageseg_mode: PSM.AUTO_ONLY });
worker.recognize(image);
Return: { ..., psm: "AUTO_ONLY", oem: "DEFAULT", text: "bla bla" }

Expected behavior
The doc state: "Automatic page segmentation, but no OSD, or OCR."
So I expect { ..., text: null } but this is not the case.

Desktop (please complete the following information):

  • OS *
  • Browser *
@Balearica
Copy link
Collaborator

This mode is marked as "not implemented" in Tesseract, and this behavior is consistent with the desktop version. Therefore, outside of clarifying the documentation (or removing this option from the API to avoid confusion) adding this feature is probably outside of the scope of this project.

@Balearica Balearica added the Dependency Bug Valid bug where fixing is outside the scope of this repo label Sep 17, 2022
@naptha naptha deleted a comment from Kumar6174 Aug 1, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Dependency Bug Valid bug where fixing is outside the scope of this repo
Projects
None yet
Development

No branches or pull requests

3 participants