I will attempt to document all notable changes to this project in this file. I did not keep a changelog for pre-1.0 releases. Apologies.
The format is based on Keep a Changelog.
- Method
splitPages()
- Updated from PDFBox 2.0.25 to 2.0.30
- Fix
sanitize()
so it no longer fails on PDFs without a documentTree, metadata, etc.
- Methods
getEmbeddedFiles()
,hasEmbeddedSearchIndex()
,getDocumentOutlineTitles()
, andremoveBookmarks()
- More tests
- Updated from PDFBox 2.0.19 to 2.0.25
sanitize()
now also removes the document outline (bookmarks)
- Methods
getVersion()
andgetAcroForm()
- TestBox as dev dependency
- Basic testing via TestBox in
/tests
server.json
for testing.gitignore
- A changelog
- Method
listXFAElements()
- When
getText()
encounters a PDF with issues that prevent text extraction, and error is logged but not thrown, and an empty string is returned. Resolves issue #2.