New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unbound prefixes not handled #1341
Comments
Is this something that will be addressed anytime soon? |
Hi, we are a student group and we would like to fix this bug. Can't guarantee that we are able to fix it but we would like to have a try. |
Hi @SimonSchmid. I am an undergraduate student. One of my courses this semester related to Software Engineering requires us to fix issues on Github. I can understand the first case,
I now understand what you want for the second case from the link you provided https://html.spec.whatwg.org/#coercing-an-html-dom-into-an-infoset. You may want to search the attribute by the key "xlinkU00003Ahref" rather than "xlink:href". Please take a look at PR #1682. |
Hello,
I want to report an issue I am having with jsoup. I have not found a similar issue, so I am creating a new one.
I created a toy example that illustrates the issue:
This webpage contains two unbound prefixes, one in within a tag and one within an attribute. Jsoup does not handle these according to https://html.spec.whatwg.org/#creating-and-inserting-nodes and https://html.spec.whatwg.org/#coercing-an-html-dom-into-an-infoset. There it says, the first case (tag) should be handled as follows:
<test:h1>
becomes<testU00003Ah1>
. The second case is handled by adding thexlink
namespace to the html tag.Without the unbound prefixes being fixed, I have issues using XPath. It would be nice if jsoup handles such cases.
Regards,
Simon
The text was updated successfully, but these errors were encountered: