Skip to content

thomasbruketta/bezoscrape

Repository files navigation

Bezoscrape: An Amazon Search Page Scraper Script

Overview

Bezoscrape is a simple script that leverages the power of Puppeteer to scrape data from any Amazon search results page. Once executed, the script examines webpage of the provided URL to identify products. It then outputs the URL of the product with the lowest price, the product with the highest rating, and the product that has the quickest delivery time.

Getting Started

Follow the steps below to get up and running with Bezoscrape:

  1. Clone the Repository
git clone https://github.com/thomasbruketta/bezoscrape.git
  1. Navigate to the Repository Directory
cd <repository_directory>
  1. Install the Required NPM Packages
npm install
  1. Run the Application

    To scrape data from a specific Amazon search results page, run the following command: example search url: https://www.amazon.com/s?k=roller+skates&crid=L0RRO035G49K&sprefix=roller+skate%2Caps%2C174&ref=nb_sb_noss_1

node bezoscrape.js <amazon_product_search_url>
  1. Verify the Output

    The script will output the URL of the product with the lowest price, the product with the highest rating, and the product that has the quickest delivery time. IMPORTANT, the script runs a logged out session of Amazon. When validating results, be sure to use a logged out version of Amazon to ensure the results are accurate as Prime members get expedited shipping on many products.

TODO:

  • Add unit tests.
  • Refactor and split large methods into smaller, testable ones.
  • Address the edge case bug concerning shipping dates set in a future year.
  • Introduce more informative error handling.
  • Add support for signed in Amazon experience.
  • Migrate to Typescript

New Features Section

Test Push for Release Note AI

\n---\nTest commit 1. \n---\nTest commit 2.

About

Bezos has many products, let's scrape a few!

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors