extract_text produces hexadecimal output #2413
Labels
Has MCVE
A minimal, complete and verifiable example helps a lot to debug / understand feature requests
workflow-text-extraction
From a users perspective, text extraction is the affected feature/workflow
The below code results in what looks like a bunch of hexadecimal. The first page of the pdf is displayed below, I note that I can copy/paste text normally from it (via Google Chrome).
Environment
Which environment were you using when you encountered the problem?
Code + PDF
This is a minimal, complete example that shows the issue:
Share here the PDF file(s) that cause the issue:
kia-stonic-owners-manual-my23.pdf
First page of pdf
top of
text.txt
The text was updated successfully, but these errors were encountered: