Bezoscrape is a simple script that leverages the power of Puppeteer to scrape data from any Amazon search results page. Once executed, the script examines webpage of the provided URL to identify products. It then outputs the URL of the product with the lowest price, the product with the highest rating, and the product that has the quickest delivery time.
Follow the steps below to get up and running with Bezoscrape:
- Clone the Repository
git clone https://github.com/thomasbruketta/bezoscrape.git- Navigate to the Repository Directory
cd <repository_directory>- Install the Required NPM Packages
npm install-
Run the Application
To scrape data from a specific Amazon search results page, run the following command: example search url: https://www.amazon.com/s?k=roller+skates&crid=L0RRO035G49K&sprefix=roller+skate%2Caps%2C174&ref=nb_sb_noss_1
node bezoscrape.js <amazon_product_search_url>-
Verify the Output
The script will output the URL of the product with the lowest price, the product with the highest rating, and the product that has the quickest delivery time. IMPORTANT, the script runs a logged out session of Amazon. When validating results, be sure to use a logged out version of Amazon to ensure the results are accurate as Prime members get expedited shipping on many products.
- Add unit tests.
- Refactor and split large methods into smaller, testable ones.
- Address the edge case bug concerning shipping dates set in a future year.
- Introduce more informative error handling.
- Add support for signed in Amazon experience.
- Migrate to Typescript
\n---\nTest commit 1. \n---\nTest commit 2.