anything-llm/collector/scripts/watch
Timothy Carambat 3e78476739
Franzbischoff document improvements (#241)
* cosmetic changes to be compatible to hadolint

* common configuration for most editors until better plugins comes up

* Changes on PDF metadata, using PyMuPDF (faster and more compatible)

* small changes on other file ingestions in order to try to keep the fields equal

* Lint, review, and review

* fixed unknown chars

* Use PyMuPDF for pdf loading for 200% speed increase
linting

---------

Co-authored-by: Francisco Bischoff <franzbischoff@gmail.com>
Co-authored-by: Francisco Bischoff <984592+franzbischoff@users.noreply.github.com>
2023-09-18 16:21:37 -07:00
..
convert Franzbischoff document improvements (#241) 2023-09-18 16:21:37 -07:00
__init__.py inital commit 2023-06-03 19:28:07 -07:00
filetypes.py Added mbox support (#106) 2023-06-25 18:11:05 -07:00
main.py Upload and process documents via UI + document processor in docker image (#65) 2023-06-16 16:01:27 -07:00
process_single.py Upload and process documents via UI + document processor in docker image (#65) 2023-06-16 16:01:27 -07:00
utils.py Split large PDFS into subfolder in documents (#176) 2023-08-03 18:57:50 -07:00