anything-llm

mirror of https://github.com/Mintplex-Labs/anything-llm.git synced 2024-11-15 02:50:10 +01:00

Open-source multi-user ChatGPT for all LLMs, embedders, and vector databases. Unlimited documents, messages, and users in one privacy-focused app.

Go to file

Timothy Carambat 5df6b5f7d9 Bump perplexity models (#1905 ) * Added Supported Models Free Tier - chat_models.txt Need to fill in correct Parameter Count. * Bump perplexity model closes #1901 closes #1900 --------- Co-authored-by: Tim-Hoekstra <135951177+Tim-Hoekstra@users.noreply.github.com>		2024-07-19 15:11:10 -07:00
.devcontainer	Feature/devcontv2 (#1622 )	2024-06-06 12:50:42 -07:00
.github	remove temp workflow file	2024-07-19 08:10:04 -07:00
.vscode	[FEAT] Create custom pdfloader (#1852 )	2024-07-11 12:26:11 -07:00
cloud-deployments	Digital Ocean Terraform Patches (#1866 )	2024-07-14 20:07:17 -07:00
collector	[FIX] PDFLoader module bug fix (#1879 )	2024-07-16 13:09:43 -07:00
docker	Docker build frontend layer improvements (#1904 )	2024-07-19 15:01:16 -07:00
embed	[FEAT] Generic error messages for embed chat widget (#1861 )	2024-07-15 12:40:29 -07:00
frontend	linting hotfix	2024-07-19 15:10:32 -07:00
images	replace stored GIF with Github CDN hosted image	2024-01-04 10:59:24 -08:00
locales	Update README.zh-CN.md (#1827 )	2024-07-08 09:36:28 -07:00
server	Bump perplexity models (#1905 )	2024-07-19 15:11:10 -07:00
.dockerignore	Docker build frontend layer improvements (#1904 )	2024-07-19 15:01:16 -07:00
.editorconfig	devcontainer v1 (#297 )	2024-01-08 15:31:06 -08:00
.gitattributes	dockerfile cleanup; enforce text LF line endings (#81 )	2023-06-17 20:18:01 -07:00
.gitignore	devcontainer v1 (#297 )	2024-01-08 15:31:06 -08:00
.hadolint.yaml	Update Ubuntu base image and improve Dockerfile (#609 )	2024-03-06 16:34:45 -08:00
.nvmrc	add .nvmrc in root	2023-06-08 10:35:36 -07:00
.prettierignore	Feature/devcontv2 (#1622 )	2024-06-06 12:50:42 -07:00
.prettierrc	Feature/devcontv2 (#1622 )	2024-06-06 12:50:42 -07:00
BARE_METAL.md	update STORAGE_DIR for baremetal.md	2024-05-10 09:48:03 -07:00
eslint.config.js	devcontainer v1 (#297 )	2024-01-08 15:31:06 -08:00
LICENSE	inital commit ⚡	2023-06-03 19:28:07 -07:00
package.json	Init support of i18n and English, Mandarin, Spanish, French (#1317 )	2024-06-19 14:48:19 -07:00
pull_request_template.md	README updates	2024-04-19 11:46:49 -07:00
README.md	fix readme typo	2024-06-12 09:36:35 -07:00
SECURITY.md	Create SECURITY.md	2023-09-08 16:31:30 -07:00

README.md

AnythingLLM: The all-in-one AI app you were looking for.
Chat with your docs, use AI Agents, hyper-configurable, multi-user, & no frustrating set up required.

| | Docs | Hosted Instance

English · 简体中文 · 日本語

👉 AnythingLLM for desktop (Mac, Windows, & Linux)! Download Now

A full-stack application that enables you to turn any document, resource, or piece of content into context that any LLM can use as references during chatting. This application allows you to pick and choose which LLM or Vector Database you want to use as well as supporting multi-user management and permissions.

Watch the demo!

Product Overview

AnythingLLM is a full-stack application where you can use commercial off-the-shelf LLMs or popular open source LLMs and vectorDB solutions to build a private ChatGPT with no compromises that you can run locally as well as host remotely and be able to chat intelligently with any documents you provide it.

AnythingLLM divides your documents into objects called workspaces. A Workspace functions a lot like a thread, but with the addition of containerization of your documents. Workspaces can share documents, but they do not talk to each other so you can keep your context for each workspace clean.

Some cool features of AnythingLLM

Multi-user instance support and permissioning
Agents inside your workspace (browse the web, run code, etc)
Custom Embeddable Chat widget for your website
Multiple document type support (PDF, TXT, DOCX, etc)
Manage documents in your vector database from a simple UI
Two chat modes conversation and query. Conversation retains previous questions and amendments. Query is simple QA against your documents
In-chat citations
100% Cloud deployment ready.
"Bring your own LLM" model.
Extremely efficient cost-saving measures for managing very large documents. You'll never pay to embed a massive document or transcript more than once. 90% more cost effective than other document chatbot solutions.
Full Developer API for custom integrations!

Supported LLMs, Embedder Models, Speech models, and Vector Databases

Language Learning Models:

Embedder models:

Audio Transcription models:

AnythingLLM Built-in (default)
OpenAI

TTS (text-to-speech) support:

Native Browser Built-in (default)
OpenAI TTS
ElevenLabs

STT (speech-to-text) support:

Native Browser Built-in (default)

Vector Databases:

Technical Overview

This monorepo consists of three main sections:

frontend: A viteJS + React frontend that you can run to easily create and manage all your content the LLM can use.
server: A NodeJS express server to handle all the interactions and do all the vectorDB management and LLM interactions.
collector: NodeJS express server that process and parses documents from the UI.
docker: Docker instructions and build process + information for building from source.
embed: Code specifically for generation of the embed widget.

🛳 Self Hosting

Mintplex Labs & the community maintain a number of deployment methods, scripts, and templates that you can use to run AnythingLLM locally. Refer to the table below to read how to deploy on your preferred environment or to automatically deploy.

Docker	AWS	GCP	Digital Ocean	Render.com

Railway	RepoCloud

or set up a production AnythingLLM instance without Docker →

How to setup for development

yarn setup To fill in the required .env files you'll need in each of the application sections (from root of repo).
- Go fill those out before proceeding. Ensure server/.env.development is filled or else things won't work right.
yarn dev:server To boot the server locally (from root of repo).
yarn dev:frontend To boot the frontend locally (from root of repo).
yarn dev:collector To then run the document collector (from root of repo).

Learn about documents

Learn about vector caching

Contributing

create issue
create PR with branch name format of <issue number>-<short name>
yee haw let's merge

Telemetry & Privacy

AnythingLLM by Mintplex Labs Inc contains a telemetry feature that collects anonymous usage information.

More about Telemetry & Privacy for AnythingLLM

Why?

We use this information to help us understand how AnythingLLM is used, to help us prioritize work on new features and bug fixes, and to help us improve AnythingLLM's performance and stability.

Opting out

Set DISABLE_TELEMETRY in your server or docker .env settings to "true" to opt out of telemetry. You can also do this in-app by going to the sidebar > Privacy and disabling telemetry.

What do you explicitly track?

We will only track usage details that help us make product and roadmap decisions, specifically:

Typ of your installation (Docker or Desktop)
When a document is added or removed. No information about the document. Just that the event occurred. This gives us an idea of use.
Type of vector database in use. Let's us know which vector database provider is the most used to prioritize changes when updates arrive for that provider.
Type of LLM in use. Let's us know the most popular choice and prioritize changes when updates arrive for that provider.
Chat is sent. This is the most regular "event" and gives us an idea of the daily-activity of this project across all installations. Again, only the event is sent - we have no information on the nature or content of the chat itself.

You can verify these claims by finding all locations Telemetry.sendTelemetry is called. Additionally these events are written to the output log so you can also see the specific data which was sent - if enabled. No IP or other identifying information is collected. The Telemetry provider is PostHog - an open-source telemetry collection service.

View all telemetry events in source code

🔗 More Products

VectorAdmin: An all-in-one GUI & tool-suite for managing vector databases.
OpenAI Assistant Swarm: Turn your entire library of OpenAI assistants into one single army commanded from a single agent.