Skip to content

Latest commit

 

History

History
14 lines (9 loc) · 673 Bytes

README.md

File metadata and controls

14 lines (9 loc) · 673 Bytes

spacephotos

Astronomy Photo of the Day Web Scraper

Requires BeautifulSoup (http://www.crummy.com/software/BeautifulSoup/)

Will download all of the full-res photos (>5GB of data, as of 8-31-15) from http://apod.nasa.gov/apod/archivepix.html and generate a CSV with metadata.

Photos up to 8-31-15 available here: https://umich.box.com/s/px126wfyx1xtiyis172v85qdi4f92290

Relies on a couple of assumptions:

  • Astronomy photo is always the first image on the page
  • If there's an tag surrounding the image, that's the link to the full-resolution image

Roughly 1 out of 7 of days do not have an image, but instead a video/SVG/Java Applet/etc. These will not download.