anything-llm/README.md

<p align="center">
  <a href="https://useanything.com"><img src="https://github.com/Mintplex-Labs/anything-llm/blob/master/images/wordmark.png?raw=true" alt="AnythingLLM logo"></a>
</p>

<p align="center">
    <b>AnythingLLM: A business-compliant document chatbot</b>. <br />
    A hyper-efficient and open-source enterprise-ready document chatbot solution for all.
</p>

<p align="center">
  <a href="https://discord.gg/6UyHPeGZAC" target="_blank">
      <img src="https://dcbadge.vercel.app/api/server/6UyHPeGZAC?compact=true&style=flat" alt="Discord">
  </a> |
  <a href="https://github.com/Mintplex-Labs/anything-llm/blob/master/LICENSE" target="_blank">
      <img src="https://img.shields.io/static/v1?label=license&message=MIT&color=white" alt="License">
  </a> |
  <a href="https://docs.useanything.com" target="_blank">
    Docs
  </a> |
   <a href="https://my.mintplexlabs.com/aio-checkout?product=anythingllm" target="_blank">
    Hosted Instance
  </a>
</p>

A full-stack application that enables you to turn any document, resource, or piece of content into context that any LLM can use as references during chatting. This application allows you to pick and choose which LLM or Vector Database you want to use. Currently this project supports [Pinecone](https://pinecone.io), [ChromaDB](https://trychroma.com) & more for vector storage and [OpenAI](https://openai.com) for LLM/chatting.


![Chatting](/images/screenshots/chat.png)
[view more screenshots](/images/screenshots/SCREENSHOTS.md)

### Watch the demo!

[![Watch the video](/images/youtube.png)](https://youtu.be/0vZ69AIP_hM)


### Product Overview
AnythingLLM aims to be a full-stack application where you can use commercial off-the-shelf LLMs or popular open source LLMs and vectorDB solutions.

Anything LLM is a full-stack product that you can run locally as well as host remotely and be able to chat intelligently with any documents you provide it.

AnythingLLM divides your documents into objects called `workspaces`. A Workspace functions a lot like a thread, but with the addition of containerization of your documents. Workspaces can share documents, but they do not talk to each other so you can keep your context for each workspace clean.

Some cool features of AnythingLLM
- Multi-user instance support and oversight
- Atomically manage documents in your vector database from a simple UI
- Two chat modes `conversation` and `query`. Conversation retains previous questions and amendments. Query is simple QA against your documents
- Each chat response contains a citation that is linked to the original content
- Simple technology stack for fast iteration
- 100% Cloud deployment ready.
- "Bring your own LLM" model. _still in progress - openai support only currently_
- Extremely efficient cost-saving measures for managing very large documents. You'll never pay to embed a massive document or transcript more than once. 90% more cost effective than other document chatbot solutions.

### Technical Overview
This monorepo consists of three main sections:
- `collector`: Python tools that enable you to quickly convert online resources or local documents into LLM useable format.
- `frontend`: A viteJS + React frontend that you can run to easily create and manage all your content the LLM can use.
- `server`: A nodeJS + express server to handle all the interactions and do all the vectorDB management and LLM interactions.

### Requirements
- `yarn` and `node` on your machine
- `python` 3.9+ for running scripts in `collector/`.
- access to an LLM like `GPT-3.5`, `GPT-4`.
- a [Pinecone.io](https://pinecone.io) free account*.
*you can use drop in replacements for these. This is just the easiest to get up and running fast. We support multiple vector database providers.

## How to get started (Docker - simple setup)
[Get up and running in minutes with Docker](./docker/HOW_TO_USE_DOCKER.md)


### How to get started (Development environment)
- `yarn setup` from the project root directory.
  - This will fill in the required `.env` files you'll need in each of the application sections. Go fill those out before proceeding or else things won't work right.
- `cd frontend && yarn install && cd ../server && yarn install` from the project root directory.
 

Next, you will need some content to embed. This could be a Youtube Channel, Medium articles, local text files, word documents, and the list goes on. This is where you will use the `collector/` part of the repo.

[Go set up and run collector scripts](./collector/README.md)

[Learn about documents](./server/storage/documents/DOCUMENTS.md)

[Learn about vector caching](./server/storage/vector-cache/VECTOR_CACHE.md)

### Contributing
- create issue
- create PR with branch name format of `<issue number>-<short name>`
- yee haw let's merge
[FEATURE] Enable the ability to have multi user instances (#158) * multi user wip * WIP MUM features * invitation mgmt * suspend or unsuspend users * workspace mangement * manage chats * manage chats * add Support for admin system settings for users to delete workspaces and limit chats per user * fix issue ith system var update app to lazy load invite page * cleanup and bug fixes * wrong method * update readme * update readme * update readme * bump version to 0.1.0 2023-07-25 19:37:04 +02:00			`<p align="center">`
add feedback form, hosting link, update readme, show promo image 2023-08-12 02:28:30 +02:00			`<a href="https://useanything.com"><img src="https://github.com/Mintplex-Labs/anything-llm/blob/master/images/wordmark.png?raw=true" alt="AnythingLLM logo"></a>`
			`</p>`

			`<p align="center">`
			`<b>AnythingLLM: A business-compliant document chatbot</b>. <br />`
			`A hyper-efficient and open-source enterprise-ready document chatbot solution for all.`
[FEATURE] Enable the ability to have multi user instances (#158) * multi user wip * WIP MUM features * invitation mgmt * suspend or unsuspend users * workspace mangement * manage chats * manage chats * add Support for admin system settings for users to delete workspaces and limit chats per user * fix issue ith system var update app to lazy load invite page * cleanup and bug fixes * wrong method * update readme * update readme * update readme * bump version to 0.1.0 2023-07-25 19:37:04 +02:00			`</p>`

			`<p align="center">`
			`<a href="https://discord.gg/6UyHPeGZAC" target="_blank">`
			`<img src="https://dcbadge.vercel.app/api/server/6UyHPeGZAC?compact=true&style=flat" alt="Discord">`
			`</a> \|`
			`<a href="https://github.com/Mintplex-Labs/anything-llm/blob/master/LICENSE" target="_blank">`
			`<img src="https://img.shields.io/static/v1?label=license&message=MIT&color=white" alt="License">`
			`</a> \|`
add feedback form, hosting link, update readme, show promo image 2023-08-12 02:28:30 +02:00			`<a href="https://docs.useanything.com" target="_blank">`
[FEATURE] Enable the ability to have multi user instances (#158) * multi user wip * WIP MUM features * invitation mgmt * suspend or unsuspend users * workspace mangement * manage chats * manage chats * add Support for admin system settings for users to delete workspaces and limit chats per user * fix issue ith system var update app to lazy load invite page * cleanup and bug fixes * wrong method * update readme * update readme * update readme * bump version to 0.1.0 2023-07-25 19:37:04 +02:00			`Docs`
add feedback form, hosting link, update readme, show promo image 2023-08-12 02:28:30 +02:00			`</a> \|`
			`<a href="https://my.mintplexlabs.com/aio-checkout?product=anythingllm" target="_blank">`
			`Hosted Instance`
[FEATURE] Enable the ability to have multi user instances (#158) * multi user wip * WIP MUM features * invitation mgmt * suspend or unsuspend users * workspace mangement * manage chats * manage chats * add Support for admin system settings for users to delete workspaces and limit chats per user * fix issue ith system var update app to lazy load invite page * cleanup and bug fixes * wrong method * update readme * update readme * update readme * bump version to 0.1.0 2023-07-25 19:37:04 +02:00			`</a>`
			`</p>`

			`A full-stack application that enables you to turn any document, resource, or piece of content into context that any LLM can use as references during chatting. This application allows you to pick and choose which LLM or Vector Database you want to use. Currently this project supports [Pinecone](https://pinecone.io), [ChromaDB](https://trychroma.com) & more for vector storage and [OpenAI](https://openai.com) for LLM/chatting.`
Lancedb support (#6) * add start of lanceDB support * lancedb initial support * add null method for deletion of documents from namespace since LanceDB does not support show warning modal on frontend for this * update .env.example and lancedb methods for sourcing * change export method * update readme 2023-06-09 03:40:29 +02:00
inital commit ⚡ 2023-06-04 04:28:07 +02:00
			`![Chatting](/images/screenshots/chat.png)`
			`[view more screenshots](/images/screenshots/SCREENSHOTS.md)`

			`### Watch the demo!`

Update Readme Thumbnail 2023-07-27 03:39:13 +02:00			`[![Watch the video](/images/youtube.png)](https://youtu.be/0vZ69AIP_hM)`
update README 2023-06-07 01:00:39 +02:00
inital commit ⚡ 2023-06-04 04:28:07 +02:00
			`### Product Overview`
[FEATURE] Enable the ability to have multi user instances (#158) * multi user wip * WIP MUM features * invitation mgmt * suspend or unsuspend users * workspace mangement * manage chats * manage chats * add Support for admin system settings for users to delete workspaces and limit chats per user * fix issue ith system var update app to lazy load invite page * cleanup and bug fixes * wrong method * update readme * update readme * update readme * bump version to 0.1.0 2023-07-25 19:37:04 +02:00			`AnythingLLM aims to be a full-stack application where you can use commercial off-the-shelf LLMs or popular open source LLMs and vectorDB solutions.`
inital commit ⚡ 2023-06-04 04:28:07 +02:00
			`Anything LLM is a full-stack product that you can run locally as well as host remotely and be able to chat intelligently with any documents you provide it.`

			AnythingLLM divides your documents into objects called `workspaces`. A Workspace functions a lot like a thread, but with the addition of containerization of your documents. Workspaces can share documents, but they do not talk to each other so you can keep your context for each workspace clean.

			`Some cool features of AnythingLLM`
[FEATURE] Enable the ability to have multi user instances (#158) * multi user wip * WIP MUM features * invitation mgmt * suspend or unsuspend users * workspace mangement * manage chats * manage chats * add Support for admin system settings for users to delete workspaces and limit chats per user * fix issue ith system var update app to lazy load invite page * cleanup and bug fixes * wrong method * update readme * update readme * update readme * bump version to 0.1.0 2023-07-25 19:37:04 +02:00			`- Multi-user instance support and oversight`
			`- Atomically manage documents in your vector database from a simple UI`
inital commit ⚡ 2023-06-04 04:28:07 +02:00			- Two chat modes `conversation` and `query`. Conversation retains previous questions and amendments. Query is simple QA against your documents
			`- Each chat response contains a citation that is linked to the original content`
			`- Simple technology stack for fast iteration`
[FEATURE] Enable the ability to have multi user instances (#158) * multi user wip * WIP MUM features * invitation mgmt * suspend or unsuspend users * workspace mangement * manage chats * manage chats * add Support for admin system settings for users to delete workspaces and limit chats per user * fix issue ith system var update app to lazy load invite page * cleanup and bug fixes * wrong method * update readme * update readme * update readme * bump version to 0.1.0 2023-07-25 19:37:04 +02:00			`- 100% Cloud deployment ready.`
			`- "Bring your own LLM" model. _still in progress - openai support only currently_`
			`- Extremely efficient cost-saving measures for managing very large documents. You'll never pay to embed a massive document or transcript more than once. 90% more cost effective than other document chatbot solutions.`
inital commit ⚡ 2023-06-04 04:28:07 +02:00
			`### Technical Overview`
			`This monorepo consists of three main sections:`
			- `collector`: Python tools that enable you to quickly convert online resources or local documents into LLM useable format.
			- `frontend`: A viteJS + React frontend that you can run to easily create and manage all your content the LLM can use.
			- `server`: A nodeJS + express server to handle all the interactions and do all the vectorDB management and LLM interactions.

			`### Requirements`
			- `yarn` and `node` on your machine
[FEATURE] Enable the ability to have multi user instances (#158) * multi user wip * WIP MUM features * invitation mgmt * suspend or unsuspend users * workspace mangement * manage chats * manage chats * add Support for admin system settings for users to delete workspaces and limit chats per user * fix issue ith system var update app to lazy load invite page * cleanup and bug fixes * wrong method * update readme * update readme * update readme * bump version to 0.1.0 2023-07-25 19:37:04 +02:00			- `python` 3.9+ for running scripts in `collector/`.
			- access to an LLM like `GPT-3.5`, `GPT-4`.
Lancedb support (#6) * add start of lanceDB support * lancedb initial support * add null method for deletion of documents from namespace since LanceDB does not support show warning modal on frontend for this * update .env.example and lancedb methods for sourcing * change export method * update readme 2023-06-09 03:40:29 +02:00			`- a [Pinecone.io](https://pinecone.io) free account*.`
			`*you can use drop in replacements for these. This is just the easiest to get up and running fast. We support multiple vector database providers.`
inital commit ⚡ 2023-06-04 04:28:07 +02:00
add Docker setup to Readme.md 2023-06-13 22:25:56 +02:00			`## How to get started (Docker - simple setup)`
			`[Get up and running in minutes with Docker](./docker/HOW_TO_USE_DOCKER.md)`


			`### How to get started (Development environment)`
inital commit ⚡ 2023-06-04 04:28:07 +02:00			- `yarn setup` from the project root directory.
update readme with build instructions (#41) 2023-06-13 06:56:29 +02:00			- This will fill in the required `.env` files you'll need in each of the application sections. Go fill those out before proceeding or else things won't work right.
			- `cd frontend && yarn install && cd ../server && yarn install` from the project root directory.

inital commit ⚡ 2023-06-04 04:28:07 +02:00
			Next, you will need some content to embed. This could be a Youtube Channel, Medium articles, local text files, word documents, and the list goes on. This is where you will use the `collector/` part of the repo.

			`[Go set up and run collector scripts](./collector/README.md)`

Docker support (#34) * Updates for Linux for frontend/server * frontend/server docker * updated Dockerfile for deps related to node vectordb * updates for collector in docker * docker deps for ODT processing * ignore another collector dir * storage mount improvements; run as UID * fix pypandoc version typo * permissions fixes 2023-06-13 20:26:11 +02:00			`[Learn about documents](./server/storage/documents/DOCUMENTS.md)`
inital commit ⚡ 2023-06-04 04:28:07 +02:00
Docker support (#34) * Updates for Linux for frontend/server * frontend/server docker * updated Dockerfile for deps related to node vectordb * updates for collector in docker * docker deps for ODT processing * ignore another collector dir * storage mount improvements; run as UID * fix pypandoc version typo * permissions fixes 2023-06-13 20:26:11 +02:00			`[Learn about vector caching](./server/storage/vector-cache/VECTOR_CACHE.md)`
inital commit ⚡ 2023-06-04 04:28:07 +02:00
			`### Contributing`
			`- create issue`
			- create PR with branch name format of `<issue number>-<short name>`
update readme with build instructions (#41) 2023-06-13 06:56:29 +02:00			`- yee haw let's merge`