Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Seattle temperature data incorrect? #42

Closed
jakevdp opened this issue Aug 25, 2018 · 7 comments
Closed

Seattle temperature data incorrect? #42

jakevdp opened this issue Aug 25, 2018 · 7 comments

Comments

@jakevdp
Copy link
Contributor

jakevdp commented Aug 25, 2018

The seattle-temps dataset claims that the temperature in Seattle never rose above 76 degrees F in 2010:

visualization 48

Vega Editor Link

According to my own memory, the temperature was much hotter. Other more reliable sources agree; for example, weather underground claims that Seattle hit 96 degrees F in August 2010: https://www.wunderground.com/history/monthly/us/wa/seattle/KSEA/date/2010-8

Perhaps the dataset is mislabeled?

@domoritz
Copy link
Member

That's a very old dataset that came from Vega if I remember correctly. @arvind will know.

@domoritz
Copy link
Member

Ping @arvind. Let's make sure to credit the source of this dataset or update it with data from noaa.

@RandomFractals
Copy link

yeah, we might need to list || ref all your raw data sources ... #15

I've looked at all of them over a span of a year playing with vega specs & Seattle temps is not the only questionable dataset ...

@eitanlees
Copy link
Collaborator

I think I figured this issue out!

The numbers aren't direct temperature measures but rather 30 year averages.

From NOAA Hourly Normal Documentation

The 1981-2010 Normals comprise all climate normals using the thirty year period of temperature,degree days, precipitation, snowfall, snow depth, wind, etc. Data is organized into hourly, daily,monthly, seasonal and annual normals. This document describes the elements and layout of the Hourly Normals which are derived from a composite of climate records from numerous sources that were merged and then subjected to a suite of quality assurance reviews.

The hourly normals provide a suite of descriptive statistics based on hourly observations at a few hundred stations from across the United States and its Pacific territories. Statistics are provided as 30-year averages, frequencies of occurrence, and percentiles for each hour and day of the year. These products are useful in examination of the diurnal change of a particular variable.

I downloaded the data for the Seattle Tacoma International Airport station.

Screen Shot 2019-12-16 at 7 57 33 PM

The numbers are a little different from the dataset originally uploaded here. They are similar though!

I could clean up this new data and replace the current seattle-temps if you think that would be useful.

@domoritz
Copy link
Member

Thank you. I think that having data with clear provenance would be good. @arvind are you okay with updating the dataset?

@eitanlees Thank you for your efforts to update the datasets in this repo. Let me add you as a collaborator.

@arvind
Copy link
Member

arvind commented Dec 17, 2019

Sounds good to me as I don’t remember the original provenance! Thanks @eitanlees!

@domoritz
Copy link
Member

Excellent. @eitanlees can you send a pull request?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants