Skip to content

A Nodejs crawler that crawls a government website and takes out data of latest tenders passed along with their dates and contractor (if any)

Notifications You must be signed in to change notification settings

thisisyashgarg/nodejs-crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NodeJS Crawler (using Puppeteer)

This code imports puppeteer for web scraping. It creates a schema with date, subject and contractor and pushes it into mainData array. It also writes data from mainData array to a json file.

Prerequisites

  • Node
  • Puppeteer

Getting Started

  1. Clone this repo
git clone https://github.com/thisisyashgarg/nodejs-crawler.git 
  1. Install npm
npm init -y
  1. Install dependencies
npm i puppeteer 
npm i dotenv 
  1. Finally, run the code
node index.js 

Output

The output of this program will be a JSON file containing arrays of objects, each with the data of date, subject and contractors.

About

A Nodejs crawler that crawls a government website and takes out data of latest tenders passed along with their dates and contractor (if any)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published