anything-llm

mirror of https://github.com/Mintplex-Labs/anything-llm.git synced 2024-11-14 18:40:11 +01:00

Author	SHA1	Message	Date
Timothy Carambat	a385ea3d82	CHORE: bump pplx model support (#791 ) bump pplx model support	2024-02-23 17:33:16 -08:00
Sean Hatfield	633f425206	[FEAT] OpenRouter integration (#784 ) * WIP openrouter integration * add OpenRouter options to onboarding flow and data handling * add todo to fix headers for rankings * OpenRouter LLM support complete * Fix hanging response stream with OpenRouter update tagline update comment * update timeout comment * wait for first chunk to start timer * sort OpenRouter models by organization * uppercase first letter of organization * sort grouped models by org --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-02-23 17:18:58 -08:00
Sean Hatfield	80ced5eba4	[FEAT] PerplexityAI Support (#778 ) * add LLM support for perplexity * update README & example env * fix ENV keys in example env files * slight changes for QA of perplexity support * Update Perplexity AI name --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-02-22 12:48:57 -08:00
Sean Hatfield	e99c74aec1	[DOCS] Update Docker documentation to show how to setup Ollama with Dockerized version of AnythingLLM (#774 ) * update HOW_TO_USE_DOCKER to help with Ollama setup using docker * update HOW_TO_USE_DOCKER * styles update * create separate README for ollama and link to it in HOW_TO_USE_DOCKER * styling update	2024-02-21 18:42:32 -08:00
Timothy Carambat	791c0ee9dc	Enable ability to do full-text query on documents (#758 ) * Enable ability to do full-text query on documents Show alert modal on first pin for client Add ability to use pins in stream/chat/embed * typo and copy update * simplify spread of context and sources	2024-02-21 13:15:45 -08:00
Timothy Carambat	c59ab9da0a	Refactor LLM chat backend (#717 ) * refactor stream/chat/embed-stram to be a single execution logic path so that it is easier to maintain and build upon * no thread in sync chat since only api uses it adjust import locations	2024-02-14 12:32:07 -08:00
Timothy Carambat	f490c35456	Recover from fatal Ollama crash from LangChain library (#693 ) Resolve fatal crash from Ollama failure	2024-02-07 16:23:17 -08:00
Timothy Carambat	aca5940650	Refactor handleStream to LLM Classes (#685 )	2024-02-07 08:15:14 -08:00
Timothy Carambat	2bc11d3f1a	Implement support for HuggingFace Inference Endpoints (#680 )	2024-02-06 09:17:51 -08:00
Sean Hatfield	21653b09fc	[FEAT] add gpt-4-turbo-preview (#651 ) * add gpt-4-turbo-preview * add gpt-4-turbo-preview to valid models	2024-01-26 13:03:50 -08:00
Sean Hatfield	62cea07599	add gpt-3.5-turbo-1106 model for openai LLM (#636 ) * add gpt-3.5-turbo-1106 model for openai LLM * add gpt-3.5-turbo-1106 as valid model for backend and per workspace model selection	2024-01-22 13:19:47 -08:00
Sean Hatfield	3fe7a25759	add token context limit for native llm settings (#614 ) Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-01-17 16:25:30 -08:00
Sean Hatfield	c2c8fe9756	add support for mistral api (#610 ) * add support for mistral api * update docs to show support for Mistral * add default temp to all providers, suggest different results per provider --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-01-17 14:42:05 -08:00
Sean Hatfield	90df37582b	Per workspace model selection (#582 ) * WIP model selection per workspace (migrations and openai saves properly * revert OpenAiOption * add support for models per workspace for anthropic, localAi, ollama, openAi, and togetherAi * remove unneeded comments * update logic for when LLMProvider is reset, reset Ai provider files with master * remove frontend/api reset of workspace chat and move logic to updateENV add postUpdate callbacks to envs * set preferred model for chat on class instantiation * remove extra param * linting * remove unused var * refactor chat model selection on workspace * linting * add fallback for base path to localai models --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-01-17 12:59:25 -08:00
Sean Hatfield	1d39b8a2ce	add Together AI LLM support (#560 ) * add Together AI LLM support * update readme to support together ai * Patch togetherAI implementation * add model sorting/option labels by organization for model selection * linting + add data handling for TogetherAI * change truthy statement patch validLLMSelection method --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-01-10 12:35:30 -08:00
Timothy Carambat	75dd86967c	Implement AzureOpenAI model chat streaming (#518 ) resolves #492	2024-01-03 16:25:39 -08:00
Timothy Carambat	6d5968bf7e	Llm chore cleanup (#501 ) * move internal functions to private in class simplify lc message convertor * Fix hanging Context text when none is present	2023-12-28 14:42:34 -08:00
Timothy Carambat	2a1202de54	Patch Ollama Streaming chunk issues (#500 ) Replace stream/sync chats with Langchain interface for now connect #499 ref: https://github.com/Mintplex-Labs/anything-llm/issues/495#issuecomment-1871476091	2023-12-28 13:59:47 -08:00
Timothy Carambat	e0a0a8976d	Add Ollama as LLM provider option (#494 ) * Add support for Ollama as LLM provider resolves #493	2023-12-27 17:21:47 -08:00
Timothy Carambat	24227e48a7	Add LLM support for Google Gemini-Pro (#492 ) resolves #489	2023-12-27 17:08:03 -08:00
Timothy Carambat	655ebd9479	[Feature] AnythingLLM use locally hosted Llama.cpp and GGUF files for inferencing (#413 ) * Implement use of native embedder (all-Mini-L6-v2) stop showing prisma queries during dev * Add native embedder as an available embedder selection * wrap model loader in try/catch * print progress on download * add built-in LLM support (expiermental) * Update to progress output for embedder * move embedder selection options to component * saftey checks for modelfile * update ref * Hide selection when on hosted subdomain * update documentation hide localLlama when on hosted * saftey checks for storage of models * update dockerfile to pre-build Llama.cpp bindings * update lockfile * add langchain doc comment * remove extraneous --no-metal option * Show data handling for private LLM * persist model in memory for N+1 chats * update import update dev comment on token model size * update primary README * chore: more readme updates and remove screenshots - too much to maintain, just use the app! * remove screeshot link	2023-12-07 14:48:27 -08:00
Timothy Carambat	6fa8b0ce93	Add API key option to LocalAI (#407 ) * Add API key option to LocalAI * add api key for model dropdown selector	2023-12-04 08:38:15 -08:00
Sean Hatfield	5ad8a5f2d0	Allow use of any embedder for any llm/update data handling modal (#386 ) * allow use of any embedder for any llm/update data handling modal * Apply embedder override and fallback to OpenAI and Azure models --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2023-11-16 15:19:49 -08:00
Timothy Carambat	4bb99ab4bf	Support LocalAi as LLM provider by @tlandenberger (#373 ) * feature: add LocalAI as llm provider * update Onboarding/mgmt settings Grab models from models endpoint for localai merge with master * update streaming for complete chunk streaming update localAI LLM to be able to stream * force schema on URL --------- Co-authored-by: timothycarambat <rambat1010@gmail.com> Co-authored-by: tlandenberger <tobiaslandenberger@gmail.com>	2023-11-14 12:31:44 -08:00
Timothy Carambat	8743be679b	assume default model where appropriate (#366 ) * assume default model where appropriate * merge with master and fix other model refs	2023-11-13 15:17:22 -08:00
Timothy Carambat	c22c50cca8	Enable chat streaming for LLMs (#354 ) * [Draft] Enable chat streaming for LLMs * stream only, move sendChat to deprecated * Update TODO deprecation comments update console output color for streaming disabled	2023-11-13 15:07:30 -08:00
Francisco Bischoff	f499f1ba59	Using OpenAI API locally (#335 ) * Using OpenAI API locally * Infinite prompt input and compression implementation (#332) * WIP on continuous prompt window summary * wip * Move chat out of VDB simplify chat interface normalize LLM model interface have compression abstraction Cleanup compressor TODO: Anthropic stuff * Implement compression for Anythropic Fix lancedb sources * cleanup vectorDBs and check that lance, chroma, and pinecone are returning valid metadata sources * Resolve Weaviate citation sources not working with schema * comment cleanup * disable import on hosted instances (#339) * disable import on hosted instances * Update UI on disabled import/export --------- Co-authored-by: timothycarambat <rambat1010@gmail.com> * Add support for gpt-4-turbo 128K model (#340) resolves #336 Add support for gpt-4-turbo 128K model * 315 show citations based on relevancy score (#316) * settings for similarity score threshold and prisma schema updated * prisma schema migration for adding similarityScore setting * WIP * Min score default change * added similarityThreshold checking for all vectordb providers * linting --------- Co-authored-by: shatfield4 <seanhatfield5@gmail.com> * rename localai to lmstudio * forgot files that were renamed * normalize model interface * add model and context window limits * update LMStudio tagline * Fully working LMStudio integration --------- Co-authored-by: Francisco Bischoff <984592+franzbischoff@users.noreply.github.com> Co-authored-by: Timothy Carambat <rambat1010@gmail.com> Co-authored-by: Sean Hatfield <seanhatfield5@gmail.com>	2023-11-09 12:33:21 -08:00
Timothy Carambat	d34ec68702	Add support for gpt-4-turbo 128K model (#340 ) resolves #336 Add support for gpt-4-turbo 128K model	2023-11-06 14:22:19 -08:00
Timothy Carambat	be9d8b0397	Infinite prompt input and compression implementation (#332 ) * WIP on continuous prompt window summary * wip * Move chat out of VDB simplify chat interface normalize LLM model interface have compression abstraction Cleanup compressor TODO: Anthropic stuff * Implement compression for Anythropic Fix lancedb sources * cleanup vectorDBs and check that lance, chroma, and pinecone are returning valid metadata sources * Resolve Weaviate citation sources not working with schema * comment cleanup	2023-11-06 13:13:53 -08:00
Timothy Carambat	67c85f1550	Implement retrieval and use of fine-tune models (#314 ) * Implement retrieval and use of fine-tune models Cleanup LLM selection code resolves #311 * Cleanup from PR bot	2023-10-31 11:38:28 -07:00
Timothy Carambat	5d56ab623b	Anthropic claude 2 support (#305 ) * WIP Anythropic support for chat, chat and query w/context * Add onboarding support for Anthropic * cleanup * fix Anthropic answer parsing move embedding selector to general util	2023-10-30 15:44:03 -07:00
Timothy Carambat	a8ec0d9584	Compensate for upper OpenAI emedding limit chunk size (#292 ) Limit is due to POST body max size. Sufficiently large requests will abort automatically We should report that error back on the frontend during embedding Update vectordb providers to return on failed	2023-10-26 10:57:37 -07:00
Timothy Carambat	2a28415de4	Make openAI Azure embedding requests run concurrently to avoid input limits per call (#211 ) resolves #184	2023-08-22 10:23:29 -07:00
Timothy Carambat	1f29cec918	Multiple LLM Support framework + AzureOpenAI Support (#180 ) * Remove LangchainJS for chat support chaining Implement runtime LLM selection Implement AzureOpenAI Support for LLM + Emebedding WIP on frontend Update env to reflect the new fields * Remove LangchainJS for chat support chaining Implement runtime LLM selection Implement AzureOpenAI Support for LLM + Emebedding WIP on frontend Update env to reflect the new fields * Replace keys with LLM Selection in settings modal Enforce checks for new ENVs depending on LLM selection	2023-08-04 14:56:27 -07:00
timothycarambat	9bea7739ed	move OpenAI to AiProvider folder in preparation for new AI provider support	2023-07-28 12:09:49 -07:00

35 Commits