timothycarambat
|
a6a5084565
|
merge master
|
2024-10-22 14:08:46 -07:00 |
|
timothycarambat
|
ab6f03ce1c
|
linting
|
2024-10-18 11:44:14 -07:00 |
|
Sean Hatfield
|
41522cdfb4
|
Handle non-ascii characters in single and bulk link scraper URLs (#2495)
handle non-ascii characters in urls
|
2024-10-17 17:04:00 -07:00 |
|
timothycarambat
|
8fc547e78a
|
Merge branch 'master' of github.com:Mintplex-Labs/anything-llm into render
|
2024-08-12 11:59:46 -07:00 |
|
Sean Hatfield
|
2797298507
|
Fix depth handling in bulk link scraper (#2096)
fix depth handling in bulk link scraper
|
2024-08-12 11:44:35 -07:00 |
|
timothycarambat
|
766537180a
|
linting
|
2024-07-19 15:25:09 -07:00 |
|
timothycarambat
|
86a31d7551
|
Merge branch 'master' of github.com:Mintplex-Labs/anything-llm into render
|
2024-07-01 17:08:59 -07:00 |
|
Sean Hatfield
|
fc375f4036
|
[FIX] Bulk link scraper bug fix (#1800)
patch website depth data connector to work for other links that are not root url
|
2024-07-01 16:59:28 -07:00 |
|
timothycarambat
|
d603d0fd51
|
patch:update storage for bulk-website scraper for render
|
2024-05-14 12:59:14 -07:00 |
|
timothycarambat
|
b5ac944475
|
patch: bulk-scraper, update when folder is made and path creation params
|
2024-05-14 12:57:23 -07:00 |
|
Sean Hatfield
|
612a7e1662
|
[FEAT] Website depth scraping data connector (#1191)
* WIP website depth scraping, (sort of works)
* website depth data connector stable + add maxLinks option
* linting + loading small ui tweak
* refactor website depth data connector for stability, speed, & readability
* patch: remove console log
Guard clause on URL validitiy check
reasonable overrides
---------
Co-authored-by: Timothy Carambat <rambat1010@gmail.com>
|
2024-05-14 12:49:14 -07:00 |
|