Commit Graph

5 Commits

Author SHA1 Message Date
Timothy Carambat
719521c307
Document Processor v2 (#442)
* wip: init refactor of document processor to JS

* add NodeJs PDF support

* wip: partity with python processor
feat: add pptx support

* fix: forgot files

* Remove python scripts totally

* wip:update docker to boot new collector

* add package.json support

* update dockerfile for new build

* update gitignore and linting

* add more protections on file lookup

* update package.json

* test build

* update docker commands to use cap-add=SYS_ADMIN so web scraper can run
update all scripts to reflect this
remove docker build for branch
2023-12-14 15:14:56 -08:00
Timothy Carambat
9242578263
Aws docker fixes (#309)
* WIP aws fixes to latest changes;

* change default in docker env

* bump storage default for docker

* Updates to docker deployment
2023-10-29 11:03:41 -07:00
timothycarambat
1b4e29a3b9 bump up gunicorn timeout for large files 2023-08-16 17:54:00 -07:00
frasergr
4079020de0
dockerfile cleanup; enforce text LF line endings (#81) 2023-06-17 20:18:01 -07:00
frasergr
9f33b3dfcb
Docker support (#34)
* Updates for Linux for frontend/server

* frontend/server docker

* updated Dockerfile for deps related to node vectordb

* updates for collector in docker

* docker deps for ODT processing

* ignore another collector dir

* storage mount improvements; run as UID

* fix pypandoc version typo

* permissions fixes
2023-06-13 11:26:11 -07:00