Skip to content

SharadhNaidu/JCrawl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

JCrawl

This repository contains both interfaces for JCrawl:

  • GUI/ -> JavaFX desktop app
  • TUI/ -> Terminal interface

Prerequisites

  • Java 17+
  • Maven 3.9+

Run GUI

From project root:

mvn -f GUI/pom.xml javafx:run

Run TUI

From project root (uses packaged source in TUI/src/jcrawl):

javac -d OUTPUT/TUI/out TUI/src/jcrawl/*.java
java -cp OUTPUT/TUI/out jcrawl.JCrawl --help

Example crawl:

java -cp OUTPUT/TUI/out jcrawl.JCrawl https://example.com --max-depth 0 --threads 2 --delay 50

Output location

By default, both interfaces write crawl output under OUTPUT/:

  • GUI default: OUTPUT/GUI/jcrawl-output
  • TUI default: OUTPUT/TUI/jcrawl-output

More demo commands

TUI quick demo:

java -cp OUTPUT/TUI/out jcrawl.JCrawl https://example.com --max-depth 0 --threads 2 --delay 80

TUI richer demo (more visible activity):

java -cp OUTPUT/TUI/out jcrawl.JCrawl https://books.toscrape.com --max-depth 1 --threads 4 --delay 120

GUI run command:

mvn -f GUI/pom.xml javafx:run

About

webscrapper using JAVA from scratch.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors