๐ If you like this project, give it a โญ on GitHub โ it helps more developers discover VisionFusion!
By Yabix โ AI-Powered Data Intelligence Platform
dynamic_ocr_qr/
โ
โโโ app/
โ โโโ __init__.py
โ โโโ ocr_dyn.py # OCR + Gemini JSON extractor (AI-based text & info extraction)
โ โโโ qr_dyn.py # Advanced QR detection & cleaning (multi-engine)
โ โโโ mix_ocr_qr.py # Merge OCR and QR data dynamically into unified JSON
โ โโโ scrap.py # Intelligent web scraper + Gemini web analyzer + translation
โ โโโ excel_mode.py # Excel-based enrichment, merging & deduplication
โ โโโ final_mix.py # Final integration: JSON + Excel into clean master dataset
โ
โโโ streamlit_app/
โ โโโ app2.py # Streamlit-based web interface for interactive use
โ
โ
โโโ requirements.txt # All Python dependencies
โโโ .gitignore # Ignore caches, outputs, and private keys
โโโ LICENSE # Proprietary License
โโโ README.md # (This file)
Dynamic OCR + QR Intelligence Suite by Yabix is an AI-powered document automation system that processes images, PDFs, and web data โ combining Optical Character Recognition (OCR), QR code extraction, and AI-driven website analysis into one intelligent workflow.
It automatically extracts structured data from scanned documents, decodes embedded QR links, enriches the data by crawling company websites, and delivers a clean, bilingual Excel output.
AI-based OCR using Google Gemini SDK to extract structured JSON data fields from scanned documents.
- Converts PDFs to images using
pdf2image - Sends image data to Gemini (
gemini-2.5-flash) - Extracts addresses, phones, company names, persons, and text
- Ensures schema-valid JSON output
๐ฆ Output: gemini_output.json
Multi-engine QR code decoder with deep enhancement and smart URL validation.
- Uses OpenCV, pyzbar, and ZXing for detection
- Enhances images using CLAHE, sharpening, adaptive thresholding
- Supports rotated / low-quality / PDF-embedded QR codes
- Extracts and cleans URLs or vCard fields
๐ฆ Outputs:
- Raw:
final_superqr_v6_raw.json - Cleaned:
final_superqr_v6_clean.json
Merges OCR and QR results into a single structured dataset.
- Aligns per-file/page results intelligently
- Merges OCR JSONs with corresponding QR URLs
- Handles PDFs with multiple pages
๐ฆ Output: mix_ocr_qr.json
Web crawler + Gemini-based AI analyzer + translation engine.
- Extracts URLs from
mix_ocr_qr.json - Crawls websites (multi-threaded) up to configurable depth
- Cleans HTML, extracts business info using Gemini
- Translates English fields to Persian (ENโFA)
- Smart SSL handling for domains
- Generates bilingual Excel report
๐ฆ Outputs:
urls_clean.jsongemini_scrap_output.jsonweb_analysis.xlsx
AI-driven Excel enrichment mode for bulk processing company URLs.
- Reads input Excel file containing URLs
- Crawls and analyzes content with Gemini
- Translates and merges into enriched dataset
- Handles duplicate merging and normalization
๐ฆ Output: output_enriched_<timestamp>.xlsx
Final-stage integration for unified data output.
- Merges JSON & Excel datasets
- Deduplicates by domain, phone, email, or company name
- Normalizes Persian and English content
- Generates clean BI-ready Excel export
๐ฆ Output: merged_final_<timestamp>.xlsx
Modern Streamlit Web Interface for interactive document automation.
- Upload images, PDFs, or Excel files
- Run full OCR/QR/Web pipeline directly in browser
- Monitor logs and processing in real-time
- View and download structured results
git clone https://github.com/<your-username>/dynamic_ocr_qr.git
cd dynamic_ocr_qrpython -m venv venv
source venv/bin/activate # macOS/Linux
venv\Scripts\activate # Windowspip install -r requirements.txt-
Install Poppler for Windows โ Poppler Releases
-
Set environment variable:
setx POPPLER_PATH "C:\poppler\Library\bin"
Set your Google Gemini API key:
setx API_KEY "AIzaSyXXXXXX"python app/ocr_dyn.py
python app/qr_dyn.py
python app/mix_ocr_qr.py
python app/scrap.py
python app/final_mix.pystreamlit run streamlit_app/app2.py| Stage | File | Description |
|---|---|---|
| OCR | gemini_output.json |
AI-extracted structured text |
| QR | final_superqr_v6_clean.json |
Clean URLs from QR |
| Merge | mix_ocr_qr.json |
Unified JSON (OCR + QR) |
| Web | web_analysis.xlsx |
Web-enriched bilingual company data |
| Final | merged_final_*.xlsx |
Master deduplicated dataset |
- ๐ง Google Gemini SDK (
google-genai) - ๐ผ OpenCV, Pillow, Pyzbar, Pyzxing, Pdf2Image
- ๐ Requests, BeautifulSoup4
- ๐งพ Pandas, OpenPyXL
- ๐ Streamlit
- ๐งฎ Multithreading & Smart SSL Handler
- Integrated Web Dashboard (with job tracking)
- Real-time Streamlit pipeline visualization
- Supabase / Firestore integration
- Auto-export to Google Sheets
This project is licensed under a Proprietary License. All rights reserved ยฉ 2025 Yabix. Unauthorized copying, modification, or redistribution of any part of this software is strictly prohibited. For commercial or enterprise licensing, please contact: ๐ง [email protected]
Intelligent Document Automation | AI-Powered Data Extraction | Web Intelligence

