You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<!DOCTYPE html SYSTEM "about:legacy-compat"><html><head><METAhttp-equiv="Content-Type" content="text/html; charset=UTF-8"></head><bodyname="" style="color: red"><phnh="2">unicode attr names</p></body></html>
This is caused by W3CDOM.java#L346 hard-codes the syntax to xml. It can be easily fixed by checking the doctype of the output document and use that as the syntax.
The text was updated successfully, but these errors were encountered:
Hi @jhy . My sincere apologies. I have seen your response. I'm at a company conference this week and hence have been struggling on time to work on your suggestions. I'll try and push an update this week, if not, next week for sure.
When parsing and converting an html document, the syntax was hard-coded to xml. This PR checks the document type of the output document and uses that to determine which attributes are valid.
Co-authored-by: jhy <jonathan@hedley.net>
Fixes#1647
Consider the following html document:
Using v1.14.2 and running the following code:
Results in:
This is caused by W3CDOM.java#L346 hard-codes the syntax to
xml
. It can be easily fixed by checking the doctype of the output document and use that as the syntax.The text was updated successfully, but these errors were encountered: