Add Project Gezond scraper (Dutch Website) #691

NijeboerFrank · 2022-11-30T19:53:00Z

Thanks for this project! I use it a lot on my Mealie instance.

This PR contains a Dutch recipe site that I use sometimes (Link). Sadly the site did not have a nice way of using the schema nor did the HTML have useful class names, so my implementation might seem a bit hacky.

Feedback is appreciated to improve the implementation!

jayaddison · 2022-12-01T12:39:09Z

Thanks @NijeboerFrank for your contribution! This implementation looks really good to me.

One nitpick I had was about the category field (the newline), and it seems like that's also something the unit tests are complaining about at the moment. Hopefully a quick fixup.

Sadly the site did not have a nice way of using the schema nor did the HTML have useful class names

That's ok - this library is here to help with situations like that :)

Feedback is appreciated to improve the implementation!

One more item I noticed: we could replace .get_text(...) calls with .text attribute accesses. Let me scour for any other improvements as well.

recipe_scrapers/projectgezond.py

tests/test_projectgezond.py

jayaddison · 2022-12-01T17:57:50Z

recipe_scrapers/projectgezond.py

+    def yields(self):
+        # Match everything in the h2 with 'Dit heb je nodig'
+        # The text inside the parentheses contains the yield for the ingredients that are listed
+        return re.search(


One more thing to keep in mind - hopefully not important in this case, but in general - and sorry if I'm explaining things that you understand already, but it's worth being careful to limit what regular expressions can match on, and/or how much input text they are provided as input.

Just something I repeat (no pun intended) at nearly every available opportunity 😄

Thanks for your advice! I'm not really that familiar with regular expressions, so all tips are appreciated 😄

jayaddison · 2022-12-01T17:59:03Z

This looks good to me - thanks again @NijeboerFrank. I'll plan to merge and release this in the nearish future (possibly not today, but should be within the next few days otherwise).

Add Project Gezond

3b8531c

jayaddison reviewed Dec 1, 2022

View reviewed changes

recipe_scrapers/projectgezond.py Outdated Show resolved Hide resolved

Fix test faillures and implement PR feedback

58a2bda

jayaddison reviewed Dec 1, 2022

View reviewed changes

tests/test_projectgezond.py Outdated Show resolved Hide resolved

Change category to string

dca35fe

jayaddison reviewed Dec 1, 2022

View reviewed changes

jayaddison merged commit 607ab04 into hhursev:main Dec 6, 2022

jayaddison pushed a commit that referenced this pull request Dec 16, 2022

Add Project Gezond scraper (Dutch Website) (#691)

7410df3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Project Gezond scraper (Dutch Website) #691

Add Project Gezond scraper (Dutch Website) #691

NijeboerFrank commented Nov 30, 2022

jayaddison commented Dec 1, 2022

jayaddison Dec 1, 2022

NijeboerFrank Dec 1, 2022

jayaddison commented Dec 1, 2022

Add Project Gezond scraper (Dutch Website) #691

Add Project Gezond scraper (Dutch Website) #691

Conversation

NijeboerFrank commented Nov 30, 2022

jayaddison commented Dec 1, 2022

jayaddison Dec 1, 2022

Choose a reason for hiding this comment

NijeboerFrank Dec 1, 2022

Choose a reason for hiding this comment

jayaddison commented Dec 1, 2022