Issues: Unstructured-IO/unstructured
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
feat/custom-metadata
awaiting-response
enhancement
New feature or request
#3079
opened May 22, 2024 by
streamnsight
Set
resolve_entities=False
by default in lxml
parser for partition_xml
#3078
opened May 22, 2024 by
MthwRobinson
bug/windows reopen temp file (pdf hi_res)
bug
Something isn't working
#3076
opened May 22, 2024 by
KristianMischke
Unstrutured library is unable extract CDATA from the xml data
bug
Something isn't working
#3075
opened May 22, 2024 by
PhaneendraGunda
Add manual coordinate constraints to New feature or request
partition_pdf()
.
enhancement
#3072
opened May 22, 2024 by
ChiNoel-osu
Switch
skip_infer_table_types
default to None
instead of list
#3063
opened May 21, 2024 by
MthwRobinson
partition_pdf is loading the model at every call
enhancement
New feature or request
#3058
opened May 20, 2024 by
SkanderHellal
feat/Move the category field to Element
enhancement
New feature or request
#3055
opened May 20, 2024 by
hubert-rutkowski85
Deprecate
CheckBox
so that all Element
objects are a subclass of Text
#3053
opened May 20, 2024 by
MthwRobinson
ModuleNotFoundError: No module named 'torch._C'
awaiting-response
bug
Something isn't working
#3052
opened May 20, 2024 by
kshirsagaraj
Update Docker images to use Python 3.12
packaging
Issues with building and installing `unstructured`
#3051
opened May 20, 2024 by
MthwRobinson
feat/Extract images in partition_html
enhancement
New feature or request
needs follow up
#3050
opened May 19, 2024 by
jiarongkoh
fix Issues with building and installing `unstructured`
bson
so MongoDB and AstraDB dependencies are compatible
packaging
#3049
opened May 17, 2024 by
MthwRobinson
bug/element type for non-English languages
bug
Something isn't working
#3044
opened May 17, 2024 by
cm-halfspace
Reenable build for ARM64 images and refactor smoke test as needed
packaging
Issues with building and installing `unstructured`
#3041
opened May 16, 2024 by
MthwRobinson
Redirect sphinx docs pages to docs.unstructured.io
documentation
Improvements or additions to documentation
#3038
opened May 16, 2024 by
MthwRobinson
bug/poor partition output from Something isn't working
image
Issues related to partitioning image formats like PNG, TIFF, etc.
ocr
Related to optical character recognition (OCR).
ocr_only
strategies with TIFF image file
bug
#3027
opened May 15, 2024 by
yuming-long
Table Title and Table content separate chunks: Merge contents of parent_id and element.id
#3012
opened May 14, 2024 by
weissenbacherpwc
contains_english_word
is not longer used and is safe to remove
good first issue
#3007
opened May 13, 2024 by
MthwRobinson
partition_msg
is unable to process attachments
needs follow up
pptx
#3006
opened May 13, 2024 by
MthwRobinson
Problems when I parsing Chineses PDF documents
bug
Something isn't working
needs follow up
#2999
opened May 10, 2024 by
WangJiaxin-x
bug/some tables in PDF not getting recognized
awaiting-response
bug
Something isn't working
pdf
#2997
opened May 9, 2024 by
Ritesh1137
feat/ocr_layer_to_pdf
enhancement
New feature or request
ocr
Related to optical character recognition (OCR).
#2991
opened May 8, 2024 by
punjabdhaputar
Previous Next
ProTip!
Adding no:label will show everything without a label.