Skip to content
This repository has been archived by the owner on Sep 22, 2021. It is now read-only.

🔎 Scraper for gathering all available nodes for goobox

License

Notifications You must be signed in to change notification settings

GooBox/goobox-nodes-scraper

Repository files navigation

goobox-nodes-scraper

Build Status codecov GPLv3

Scraper for gathering all available nodes for goobox.

Getting started

To run Goobox Nodes Scraper you need previously to install the requirements and you can either use public docker image or build it from sources.

Requirements

  1. Docker: Install it following official docs.

Use public image

You can use public docker image to run the service. E.g. run Storj nodes scraper, collect them and put together into a csv format file:

docker run -v /your/output/dir:/srv/apps/goobox-nodes-service/output goobox/goobox-nodes-scraper:latest scrapy storj_nodes -o output/out.csv -t csv

Build from sources

To build Goobox Nodes Scraper from sources you need to clone this project and build the image.

git clone https://github.com/goobox/goobox-nodes-scraper.git & cd goobox-nodes-scraper
python3.6 make build

Once build is completed you can run the scraper using scrapy command from the entry point.

python3.6 make run scrapy

Help

The entry point has a self-describing help that can be queried.

python3.6 make run -h

Also, each command has its own help.

python3.6 make run scrapy -h

Usage example

To run the scraper for collecting Storj nodes first create a directory to keep the output.

mkdir output

The scraper is going to gather Storj node information, generate a csv file and put it into the previous directory. If you prefer to generate a different kind of export you can use a different format as specified by Scrapy's Feed exports.

python3.6 make run scrapy crawl storj_nodes -o output/out.csv -t csv

Once the scraper has finished you can get the output csv file.

License

GNU GPL v3

Credits

This product includes GeoLite2 data created by MaxMind.

About

🔎 Scraper for gathering all available nodes for goobox

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages