anything-llm

mirror of https://github.com/Mintplex-Labs/anything-llm.git synced 2024-08-19 11:50:10 +02:00

Author	SHA1	Message	Date
Timothy Carambat	c59ab9da0a	Refactor LLM chat backend (#717 ) * refactor stream/chat/embed-stram to be a single execution logic path so that it is easier to maintain and build upon * no thread in sync chat since only api uses it adjust import locations	2024-02-14 12:32:07 -08:00
Timothy Carambat	aca5940650	Refactor handleStream to LLM Classes (#685 )	2024-02-07 08:15:14 -08:00
Sean Hatfield	3fe7a25759	add token context limit for native llm settings (#614 ) Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-01-17 16:25:30 -08:00
Sean Hatfield	c2c8fe9756	add support for mistral api (#610 ) * add support for mistral api * update docs to show support for Mistral * add default temp to all providers, suggest different results per provider --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-01-17 14:42:05 -08:00
Sean Hatfield	90df37582b	Per workspace model selection (#582 ) * WIP model selection per workspace (migrations and openai saves properly * revert OpenAiOption * add support for models per workspace for anthropic, localAi, ollama, openAi, and togetherAi * remove unneeded comments * update logic for when LLMProvider is reset, reset Ai provider files with master * remove frontend/api reset of workspace chat and move logic to updateENV add postUpdate callbacks to envs * set preferred model for chat on class instantiation * remove extra param * linting * remove unused var * refactor chat model selection on workspace * linting * add fallback for base path to localai models --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-01-17 12:59:25 -08:00
Timothy Carambat	6d5968bf7e	Llm chore cleanup (#501 ) * move internal functions to private in class simplify lc message convertor * Fix hanging Context text when none is present	2023-12-28 14:42:34 -08:00
Timothy Carambat	655ebd9479	[Feature] AnythingLLM use locally hosted Llama.cpp and GGUF files for inferencing (#413 ) * Implement use of native embedder (all-Mini-L6-v2) stop showing prisma queries during dev * Add native embedder as an available embedder selection * wrap model loader in try/catch * print progress on download * add built-in LLM support (expiermental) * Update to progress output for embedder * move embedder selection options to component * saftey checks for modelfile * update ref * Hide selection when on hosted subdomain * update documentation hide localLlama when on hosted * saftey checks for storage of models * update dockerfile to pre-build Llama.cpp bindings * update lockfile * add langchain doc comment * remove extraneous --no-metal option * Show data handling for private LLM * persist model in memory for N+1 chats * update import update dev comment on token model size * update primary README * chore: more readme updates and remove screenshots - too much to maintain, just use the app! * remove screeshot link	2023-12-07 14:48:27 -08:00

7 Commits