Name		Name	Last commit message	Last commit date
parent directory ..
data		data
extract		extract
transform		transform
README.md		README.md
package.json		package.json

README.md

October 2019 - Visualize the Jump-Scares for over 500 horror, thriller, and sci-fi Movies

Reddit contest page

The code in this directory scrapes the data source website and stores the extracted data in a suitable format for analysis.

How to use

Setup

npm install

Run the whole ETL

The following command extracts data from the online database and run the data transformation scripts:

npm start

The following is executed:

download the list of movies
download extra metadata from each movie page
download all jump-scare timing subtitle files
process the subtitle files in a single JSON file containing timestamps of all jump-scares (both minor and major ones)

Results are saved in the directory data.

Run individual steps

The following data can be extracted from wheresthejump.com

Download all subtitle files

Download the .srt files announcing jump scares for the all the movies referenced in the website.

npm run extract-subtitles

The subtitles are placed in the directory data/subtitles.

Download the list of movies

Download the list of movies listed in https://wheresthejump.com/full-movie-list/, together with associated metadata:

Director
Year
Jump count
Jump Scare rating
Netflix (US)
Imdb rating

npm run extract-subtitles

The results are saved in a JSON file: data/moviesList.json.

Build a timeline of all jump scare in all movies

This transformation script processes all the downloaded subtitles files and outputs a single file containing the jump-scare timestamps of all movies.

npm run build-jumpscare-timeline

(!) node.js v12+ is required for this transformation to work, since String.prototype.matchAll() is used. Can be replaced by Regexp.exec in order to run on older versions of node.js (c.f. MDN Article.

The results are saved in a JSON file: data/jumpScareTimeline.json.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

10-the-jump-scares

10-the-jump-scares

data

data

extract

extract

transform

transform

README.md

README.md

package.json

package.json

README.md

October 2019 - Visualize the Jump-Scares for over 500 horror, thriller, and sci-fi Movies

How to use

Setup

Run the whole ETL

Run individual steps

Download all subtitle files

Download the list of movies

Build a timeline of all jump scare in all movies

Files

10-the-jump-scares

Directory actions

More options

Directory actions

More options

Latest commit

History

10-the-jump-scares

Folders and files

parent directory

October 2019 - Visualize the Jump-Scares for over 500 horror, thriller, and sci-fi Movies

How to use

Setup

Run the whole ETL

Run individual steps

Download all subtitle files

Download the list of movies

Build a timeline of all jump scare in all movies