Skip to content

Navigation Menu

Explore
For
- Enterprise
- Teams
- Startups
- Education
By Solution
Resources
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

This repository has been archived by the owner on Mar 30, 2022. It is now read-only.

elifesciences / sciencebeam-parser Public archive

Notifications You must be signed in to change notification settings
Fork 33
Star 293

Code
Issues
Pull requests
Actions
Projects 1
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Wiki
Security
Insights

Releases: elifesciences/sciencebeam-parser

Releases · elifesciences/sciencebeam-parser

v0.1.8

03 Mar 10:21

de-code

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

v0.1.8 Latest

Latest

extract tables as images (#517, #519)
various dependency updates

Assets 2

All reactions

v0.1.7

10 Feb 19:56

de-code

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

v0.1.7

added support for remote training data generation (#514)
generate delft data (#510)
various dependency updates

Assets 2

All reactions

v0.1.6

14 Jan 16:45

de-code

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

v0.1.6

output training data files to a directory structure (#502)
generate training data references for the name model, for the reference author names (#501)
generate training data for the name model, for the author names in the header section (#493)
generate training data for the table model (#492)
refactored generate training data (#491)
generate training data for the figure model (#489)
generate training data for the citation model (#488)
added missing fulltext training data ref section (#486)
generate reference-segmenter training data (#485)
generate training data for the fulltext model (#483)
improve generate data tests (#482)
generate affiliation-address training data (#481)
generate header training data (#479)
fixed segmentation acknowledgement training data gen (#477)
generate segmentation training data using model (#476)
added segmentation model training data generator (#472)
added link to usage examples (#471)
various dependency updates

Assets 2

All reactions

v0.1.5

23 Nov 12:48

de-code

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

v0.1.5

fixed asset zip failing with assert on relative path (#469)

Assets 2

All reactions

v0.1.4

19 Nov 16:47

de-code

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

v0.1.4

added Jupyter notebook support (#467)

Assets 2

All reactions

v0.1.3

17 Nov 20:54

de-code

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

v0.1.3

fixed missing xslt and default config in pypi package (#464)
fixed missing setup resources in python package (#463)

Assets 2

All reactions

v0.1.2

16 Nov 19:35

de-code

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

v0.1.2

refactored python api (#459)
improved python library documentation (#458)

Assets 2

All reactions

v0.1.1

15 Nov 14:41

de-code

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

v0.1.1

push pypi package (#457)
download to the user's home directory by default (#455)
moved config to sciencebeam_parser/resources/default_config (#454)
switched to sciencebeam-trainer-delft pypi release (#453)
optionally replace text by cv graphic (#452)
fix multiple affiliation markers (#451)
various dependency updates

Assets 2

All reactions

v0.1.0

08 Nov 17:14

de-code

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

v0.1.0

only preload cv and ocr models if enabled
guess media type if empty
(renamed sciencebeam to sciencebeam-pipelines; redirect to sciencebeam-parser)

Assets 2

All reactions

v0.0.2

08 Nov 11:34

de-code

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

v0.0.2

preload models
add word* support
added /processReferences endpoint
added /processHeaderDocument endpoint
added /convert API endpoint
added jats support
fix graphic coords assert and division by zero
match svg pages with uncommen page dimensions
added ocr config params; configured sparse text ocr psm
fixed ocr no image error with large image
prevent concurrent tesser_api calls
fixed local file path assert
figure label ocr
added libgl1 to docker image
added poppler to docker image
integrate layout parser for cv figure detection
added download_if_url_from_alternatives
fixed unnessary download of gzipped wapiti model
avoid matching graphic svg
use send file for zip with assets to avoid memory issues
implement asset document
implemented figure bounding box baseline
refactored tei document
split aff on other label
added support for config env bool value
made use_first_token_of_block configurable
move formula out of paragraph
improve segmentation fulltext consistency
fixed app features context not passed to some models
use underscore for model config keys
implemented first and last name lookup feature
add author aff markers as delimters
switched default name header model to 0.6.0
improve name header
various dependency updates

Assets 2

All reactions

Previous 1 2 Next

Previous Next

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.