You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
To be able to reconstruct a document (like an HTML page), it would be necessary to add a tag like [tagimage]1[/tagimage] in the extracted text at the place the image was found.
In the exemaple 1 is the place of the images in page.images
Code Example
How would your feature be used? (Remove this if it is not applicable.)
frompypdfimportPdfReader, PdfWriter
... # your new feature in action!print(page.extract_text(withTags=1))
results :
some text
[tagimage]0[/tagimage]
other text
[tagimage]1[/tagimage]
The text was updated successfully, but these errors were encountered:
Explanation
To be able to reconstruct a document (like an HTML page), it would be necessary to add a tag like [tagimage]1[/tagimage] in the extracted text at the place the image was found.
In the exemaple 1 is the place of the images in page.images
Code Example
How would your feature be used? (Remove this if it is not applicable.)
results :
some text
[tagimage]0[/tagimage]
other text
[tagimage]1[/tagimage]
The text was updated successfully, but these errors were encountered: