anything-llm/collector/scripts
Timothy Carambat 3e78476739
Franzbischoff document improvements (#241)
* cosmetic changes to be compatible to hadolint

* common configuration for most editors until better plugins comes up

* Changes on PDF metadata, using PyMuPDF (faster and more compatible)

* small changes on other file ingestions in order to try to keep the fields equal

* Lint, review, and review

* fixed unknown chars

* Use PyMuPDF for pdf loading for 200% speed increase
linting

---------

Co-authored-by: Francisco Bischoff <franzbischoff@gmail.com>
Co-authored-by: Francisco Bischoff <984592+franzbischoff@users.noreply.github.com>
2023-09-18 16:21:37 -07:00
..
watch Franzbischoff document improvements (#241) 2023-09-18 16:21:37 -07:00
__init__.py inital commit 2023-06-03 19:28:07 -07:00
gitbook.py Franzbischoff document improvements (#241) 2023-09-18 16:21:37 -07:00
link_utils.py inital commit 2023-06-03 19:28:07 -07:00
link.py be able to parse relative and FQDN links from root reliabily (#138) 2023-07-05 14:40:54 -07:00
medium_utils.py inital commit 2023-06-03 19:28:07 -07:00
medium.py Docker support (#34) 2023-06-13 11:26:11 -07:00
sitemap.py dockerfile cleanup; enforce text LF line endings (#81) 2023-06-17 20:18:01 -07:00
substack_utils.py inital commit 2023-06-03 19:28:07 -07:00
substack.py Docker support (#34) 2023-06-13 11:26:11 -07:00
twitter.py Twitter Feature (#134) 2023-07-06 14:05:50 -07:00
utils.py inital commit 2023-06-03 19:28:07 -07:00
youtube.py Docker support (#34) 2023-06-13 11:26:11 -07:00
yt_utils.py inital commit 2023-06-03 19:28:07 -07:00