Skip to content

Latest commit

 

History

History
14 lines (11 loc) · 402 Bytes

File metadata and controls

14 lines (11 loc) · 402 Bytes

Unstructured Cat

Unstructured Cat

A Cheshire Cat AI plugin for document ingestion using the Unstructured lib.

required linux packages

  • libreoffice
  • python3-opencv
  • libmagic-dev
  • pandoc
  • poppler-utils
  • tesseract-ocr
  • tesseract-ocs-LANG (ita, eng ...)