anything-llm

mirror of https://github.com/Mintplex-Labs/anything-llm.git synced 2024-11-14 10:30:10 +01:00

Author	SHA1	Message	Date
Timothy Carambat	c59ab9da0a	Refactor LLM chat backend (#717 ) * refactor stream/chat/embed-stram to be a single execution logic path so that it is easier to maintain and build upon * no thread in sync chat since only api uses it adjust import locations	2024-02-14 12:32:07 -08:00
Timothy Carambat	f490c35456	Recover from fatal Ollama crash from LangChain library (#693 ) Resolve fatal crash from Ollama failure	2024-02-07 16:23:17 -08:00
Timothy Carambat	aca5940650	Refactor handleStream to LLM Classes (#685 )	2024-02-07 08:15:14 -08:00
Timothy Carambat	2bc11d3f1a	Implement support for HuggingFace Inference Endpoints (#680 )	2024-02-06 09:17:51 -08:00
Sean Hatfield	21653b09fc	[FEAT] add gpt-4-turbo-preview (#651 ) * add gpt-4-turbo-preview * add gpt-4-turbo-preview to valid models	2024-01-26 13:03:50 -08:00
Sean Hatfield	62cea07599	add gpt-3.5-turbo-1106 model for openai LLM (#636 ) * add gpt-3.5-turbo-1106 model for openai LLM * add gpt-3.5-turbo-1106 as valid model for backend and per workspace model selection	2024-01-22 13:19:47 -08:00
Sean Hatfield	3fe7a25759	add token context limit for native llm settings (#614 ) Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-01-17 16:25:30 -08:00
Sean Hatfield	c2c8fe9756	add support for mistral api (#610 ) * add support for mistral api * update docs to show support for Mistral * add default temp to all providers, suggest different results per provider --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-01-17 14:42:05 -08:00
Sean Hatfield	90df37582b	Per workspace model selection (#582 ) * WIP model selection per workspace (migrations and openai saves properly * revert OpenAiOption * add support for models per workspace for anthropic, localAi, ollama, openAi, and togetherAi * remove unneeded comments * update logic for when LLMProvider is reset, reset Ai provider files with master * remove frontend/api reset of workspace chat and move logic to updateENV add postUpdate callbacks to envs * set preferred model for chat on class instantiation * remove extra param * linting * remove unused var * refactor chat model selection on workspace * linting * add fallback for base path to localai models --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-01-17 12:59:25 -08:00
Sean Hatfield	1d39b8a2ce	add Together AI LLM support (#560 ) * add Together AI LLM support * update readme to support together ai * Patch togetherAI implementation * add model sorting/option labels by organization for model selection * linting + add data handling for TogetherAI * change truthy statement patch validLLMSelection method --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-01-10 12:35:30 -08:00
Timothy Carambat	75dd86967c	Implement AzureOpenAI model chat streaming (#518 ) resolves #492	2024-01-03 16:25:39 -08:00
Timothy Carambat	6d5968bf7e	Llm chore cleanup (#501 ) * move internal functions to private in class simplify lc message convertor * Fix hanging Context text when none is present	2023-12-28 14:42:34 -08:00
Timothy Carambat	2a1202de54	Patch Ollama Streaming chunk issues (#500 ) Replace stream/sync chats with Langchain interface for now connect #499 ref: https://github.com/Mintplex-Labs/anything-llm/issues/495#issuecomment-1871476091	2023-12-28 13:59:47 -08:00
Timothy Carambat	e0a0a8976d	Add Ollama as LLM provider option (#494 ) * Add support for Ollama as LLM provider resolves #493	2023-12-27 17:21:47 -08:00
Timothy Carambat	24227e48a7	Add LLM support for Google Gemini-Pro (#492 ) resolves #489	2023-12-27 17:08:03 -08:00
Timothy Carambat	655ebd9479	[Feature] AnythingLLM use locally hosted Llama.cpp and GGUF files for inferencing (#413 ) * Implement use of native embedder (all-Mini-L6-v2) stop showing prisma queries during dev * Add native embedder as an available embedder selection * wrap model loader in try/catch * print progress on download * add built-in LLM support (expiermental) * Update to progress output for embedder * move embedder selection options to component * saftey checks for modelfile * update ref * Hide selection when on hosted subdomain * update documentation hide localLlama when on hosted * saftey checks for storage of models * update dockerfile to pre-build Llama.cpp bindings * update lockfile * add langchain doc comment * remove extraneous --no-metal option * Show data handling for private LLM * persist model in memory for N+1 chats * update import update dev comment on token model size * update primary README * chore: more readme updates and remove screenshots - too much to maintain, just use the app! * remove screeshot link	2023-12-07 14:48:27 -08:00
Timothy Carambat	6fa8b0ce93	Add API key option to LocalAI (#407 ) * Add API key option to LocalAI * add api key for model dropdown selector	2023-12-04 08:38:15 -08:00
Sean Hatfield	5ad8a5f2d0	Allow use of any embedder for any llm/update data handling modal (#386 ) * allow use of any embedder for any llm/update data handling modal * Apply embedder override and fallback to OpenAI and Azure models --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2023-11-16 15:19:49 -08:00
Timothy Carambat	4bb99ab4bf	Support LocalAi as LLM provider by @tlandenberger (#373 ) * feature: add LocalAI as llm provider * update Onboarding/mgmt settings Grab models from models endpoint for localai merge with master * update streaming for complete chunk streaming update localAI LLM to be able to stream * force schema on URL --------- Co-authored-by: timothycarambat <rambat1010@gmail.com> Co-authored-by: tlandenberger <tobiaslandenberger@gmail.com>	2023-11-14 12:31:44 -08:00
Timothy Carambat	8743be679b	assume default model where appropriate (#366 ) * assume default model where appropriate * merge with master and fix other model refs	2023-11-13 15:17:22 -08:00
Timothy Carambat	c22c50cca8	Enable chat streaming for LLMs (#354 ) * [Draft] Enable chat streaming for LLMs * stream only, move sendChat to deprecated * Update TODO deprecation comments update console output color for streaming disabled	2023-11-13 15:07:30 -08:00
Francisco Bischoff	f499f1ba59	Using OpenAI API locally (#335 ) * Using OpenAI API locally * Infinite prompt input and compression implementation (#332) * WIP on continuous prompt window summary * wip * Move chat out of VDB simplify chat interface normalize LLM model interface have compression abstraction Cleanup compressor TODO: Anthropic stuff * Implement compression for Anythropic Fix lancedb sources * cleanup vectorDBs and check that lance, chroma, and pinecone are returning valid metadata sources * Resolve Weaviate citation sources not working with schema * comment cleanup * disable import on hosted instances (#339) * disable import on hosted instances * Update UI on disabled import/export --------- Co-authored-by: timothycarambat <rambat1010@gmail.com> * Add support for gpt-4-turbo 128K model (#340) resolves #336 Add support for gpt-4-turbo 128K model * 315 show citations based on relevancy score (#316) * settings for similarity score threshold and prisma schema updated * prisma schema migration for adding similarityScore setting * WIP * Min score default change * added similarityThreshold checking for all vectordb providers * linting --------- Co-authored-by: shatfield4 <seanhatfield5@gmail.com> * rename localai to lmstudio * forgot files that were renamed * normalize model interface * add model and context window limits * update LMStudio tagline * Fully working LMStudio integration --------- Co-authored-by: Francisco Bischoff <984592+franzbischoff@users.noreply.github.com> Co-authored-by: Timothy Carambat <rambat1010@gmail.com> Co-authored-by: Sean Hatfield <seanhatfield5@gmail.com>	2023-11-09 12:33:21 -08:00
Timothy Carambat	d34ec68702	Add support for gpt-4-turbo 128K model (#340 ) resolves #336 Add support for gpt-4-turbo 128K model	2023-11-06 14:22:19 -08:00
Timothy Carambat	be9d8b0397	Infinite prompt input and compression implementation (#332 ) * WIP on continuous prompt window summary * wip * Move chat out of VDB simplify chat interface normalize LLM model interface have compression abstraction Cleanup compressor TODO: Anthropic stuff * Implement compression for Anythropic Fix lancedb sources * cleanup vectorDBs and check that lance, chroma, and pinecone are returning valid metadata sources * Resolve Weaviate citation sources not working with schema * comment cleanup	2023-11-06 13:13:53 -08:00
Timothy Carambat	67c85f1550	Implement retrieval and use of fine-tune models (#314 ) * Implement retrieval and use of fine-tune models Cleanup LLM selection code resolves #311 * Cleanup from PR bot	2023-10-31 11:38:28 -07:00
Timothy Carambat	5d56ab623b	Anthropic claude 2 support (#305 ) * WIP Anythropic support for chat, chat and query w/context * Add onboarding support for Anthropic * cleanup * fix Anthropic answer parsing move embedding selector to general util	2023-10-30 15:44:03 -07:00
Timothy Carambat	a8ec0d9584	Compensate for upper OpenAI emedding limit chunk size (#292 ) Limit is due to POST body max size. Sufficiently large requests will abort automatically We should report that error back on the frontend during embedding Update vectordb providers to return on failed	2023-10-26 10:57:37 -07:00
Timothy Carambat	2a28415de4	Make openAI Azure embedding requests run concurrently to avoid input limits per call (#211 ) resolves #184	2023-08-22 10:23:29 -07:00
Timothy Carambat	1f29cec918	Multiple LLM Support framework + AzureOpenAI Support (#180 ) * Remove LangchainJS for chat support chaining Implement runtime LLM selection Implement AzureOpenAI Support for LLM + Emebedding WIP on frontend Update env to reflect the new fields * Remove LangchainJS for chat support chaining Implement runtime LLM selection Implement AzureOpenAI Support for LLM + Emebedding WIP on frontend Update env to reflect the new fields * Replace keys with LLM Selection in settings modal Enforce checks for new ENVs depending on LLM selection	2023-08-04 14:56:27 -07:00
timothycarambat	9bea7739ed	move OpenAI to AiProvider folder in preparation for new AI provider support	2023-07-28 12:09:49 -07:00

30 Commits