Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate alternatives to 'extruct' dependency #150

Open
jayaddison opened this issue May 7, 2020 · 3 comments
Open

Investigate alternatives to 'extruct' dependency #150

jayaddison opened this issue May 7, 2020 · 3 comments

Comments

@jayaddison
Copy link
Collaborator

This dependency pulls in lxml, which requires native C compilation at dependency-installation time.

This currently prevents the construction of pure-Python recipe-scrapers container images.

@jayaddison
Copy link
Collaborator Author

After a bit of investigation here, this isn't currently straightforward to do. Perhaps we'll revisit it again in future.

@jayaddison
Copy link
Collaborator Author

Reopening this after spending a while on #629 debugging issues related to lxml.

@jayaddison
Copy link
Collaborator Author

Note: https://github.com/gawel/pyquery/ is an alternative HTML-querying engine and is pure Python. It depends on lxml too, but might (?) be lighterweight.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants