You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Right now we are extracting just the first time because major of the data is living on other HTML tags that are simple to extract (open graph, twitter, itemprops,...)
I suppose a better effort could be done there to try to extract as much information possible, PRs are very open for this 🙂
Prerequisites
package.json
.Subject of the issue
Hello! Some pages (https://bykvu.com/ru/bukvy/uchenye-nazvali-depressiju-prichinoj-22-opasnyh-zaboleva~/w/readstreams) has multiple ld+json blocks so why metascrapper use only first one?
https://github.com/microlinkhq/metascraper/blob/master/packages/metascraper-helpers/index.js#L230-L231
It's obviously by design, but real life differs from our expectations...
Steps to reproduce
Parse this:
Expected behaviour
Result should contain 'datePublished'
Actual behaviour
Result contains data only from first <script> tag
The text was updated successfully, but these errors were encountered: