searxng/searx/engines/digbt.py

# SPDX-License-Identifier: AGPL-3.0-or-later
"""
 DigBT (Videos, Music, Files)
"""

from urllib.parse import urljoin
from lxml import html
from searx.utils import extract_text

# about
about = {
    "website": 'https://digbt.org',
    "wikidata_id": None,
    "official_api_documentation": None,
    "use_official_api": False,
    "require_api_key": False,
    "results": 'HTML',
}

categories = ['videos', 'music', 'files']
paging = True

URL = 'https://digbt.org'
SEARCH_URL = URL + '/search/{query}-time-{pageno}'
FILESIZE = 3
FILESIZE_MULTIPLIER = 4


def request(query, params):
    params['url'] = SEARCH_URL.format(query=query, pageno=params['pageno'])

    return params


def response(resp):
    dom = html.fromstring(resp.text)
    search_res = dom.xpath('.//td[@class="x-item"]')

    if not search_res:
        return []

    results = []
    for result in search_res:
        url = urljoin(URL, result.xpath('.//a[@title]/@href')[0])
        title = extract_text(result.xpath('.//a[@title]'))
        content = extract_text(result.xpath('.//div[@class="files"]'))
        files_data = extract_text(result.xpath('.//div[@class="tail"]')).split()
        filesize = f"{files_data[FILESIZE]} {files_data[FILESIZE_MULTIPLIER]}"
        magnetlink = result.xpath('.//div[@class="tail"]//a[@class="title"]/@href')[0]

        results.append(
            {
                'url': url,
                'title': title,
                'content': content,
                'filesize': filesize,
                'magnetlink': magnetlink,
                'seed': 'N/A',
                'leech': 'N/A',
                'template': 'torrent.html',
            }
        )

    return results
[enh] engines: add about variable move meta information from comment to the about variable so the preferences, the documentation can show these information 2021-01-13 11:31:25 +01:00			`# SPDX-License-Identifier: AGPL-3.0-or-later`
add digbt engine Unfortunately, it is quite slow so it is disabled. Furthermore, the display of number of files is wrong on digbt.org, so it is not displayed on searx. 2016-08-13 14:55:47 +02:00			`"""`
			`DigBT (Videos, Music, Files)`
			`"""`

Drop Python 2 (1/n): remove unicode string and url_utils 2020-08-06 17:42:46 +02:00			`from urllib.parse import urljoin`
add digbt engine Unfortunately, it is quite slow so it is disabled. Furthermore, the display of number of files is wrong on digbt.org, so it is not displayed on searx. 2016-08-13 14:55:47 +02:00			`from lxml import html`
[perf] torrents.html, files.html: don't parse and re-format filesize 2024-06-12 22:35:13 +02:00			`from searx.utils import extract_text`
[enh] py3 compatibility 2016-11-30 18:43:03 +01:00
[enh] engines: add about variable move meta information from comment to the about variable so the preferences, the documentation can show these information 2021-01-13 11:31:25 +01:00			`# about`
			`about = {`
			`"website": 'https://digbt.org',`
			`"wikidata_id": None,`
			`"official_api_documentation": None,`
			`"use_official_api": False,`
			`"require_api_key": False,`
			`"results": 'HTML',`
			`}`
add digbt engine Unfortunately, it is quite slow so it is disabled. Furthermore, the display of number of files is wrong on digbt.org, so it is not displayed on searx. 2016-08-13 14:55:47 +02:00
			`categories = ['videos', 'music', 'files']`
			`paging = True`

			`URL = 'https://digbt.org'`
			`SEARCH_URL = URL + '/search/{query}-time-{pageno}'`
			`FILESIZE = 3`
			`FILESIZE_MULTIPLIER = 4`


			`def request(query, params):`
			`params['url'] = SEARCH_URL.format(query=query, pageno=params['pageno'])`

			`return params`


			`def response(resp):`
[enh] py3 compatibility 2016-11-30 18:43:03 +01:00			`dom = html.fromstring(resp.text)`
add digbt engine Unfortunately, it is quite slow so it is disabled. Furthermore, the display of number of files is wrong on digbt.org, so it is not displayed on searx. 2016-08-13 14:55:47 +02:00			`search_res = dom.xpath('.//td[@class="x-item"]')`

			`if not search_res:`
[mod] pylint all engines without PYLINT_SEARXNG_DISABLE_OPTION Signed-off-by: Markus Heiser <markus.heiser@darmarit.de> 2024-03-11 07:45:08 +01:00			`return []`
add digbt engine Unfortunately, it is quite slow so it is disabled. Furthermore, the display of number of files is wrong on digbt.org, so it is not displayed on searx. 2016-08-13 14:55:47 +02:00
[mod] pylint all engines without PYLINT_SEARXNG_DISABLE_OPTION Signed-off-by: Markus Heiser <markus.heiser@darmarit.de> 2024-03-11 07:45:08 +01:00			`results = []`
add digbt engine Unfortunately, it is quite slow so it is disabled. Furthermore, the display of number of files is wrong on digbt.org, so it is not displayed on searx. 2016-08-13 14:55:47 +02:00			`for result in search_res:`
			`url = urljoin(URL, result.xpath('.//a[@title]/@href')[0])`
[fix] results with digbit don't truncate anymore 2016-09-20 22:35:54 +02:00			`title = extract_text(result.xpath('.//a[@title]'))`
add digbt engine Unfortunately, it is quite slow so it is disabled. Furthermore, the display of number of files is wrong on digbt.org, so it is not displayed on searx. 2016-08-13 14:55:47 +02:00			`content = extract_text(result.xpath('.//div[@class="files"]'))`
			`files_data = extract_text(result.xpath('.//div[@class="tail"]')).split()`
[perf] torrents.html, files.html: don't parse and re-format filesize 2024-06-12 22:35:13 +02:00			`filesize = f"{files_data[FILESIZE]} {files_data[FILESIZE_MULTIPLIER]}"`
add digbt engine Unfortunately, it is quite slow so it is disabled. Furthermore, the display of number of files is wrong on digbt.org, so it is not displayed on searx. 2016-08-13 14:55:47 +02:00			`magnetlink = result.xpath('.//div[@class="tail"]//a[@class="title"]/@href')[0]`

[format.python] initial formatting of the python code This patch was generated by black [1]:: make format.python [1] https://github.com/psf/black Signed-off-by: Markus Heiser <markus.heiser@darmarit.de> 2021-12-27 09:26:22 +01:00			`results.append(`
			`{`
			`'url': url,`
			`'title': title,`
			`'content': content,`
			`'filesize': filesize,`
			`'magnetlink': magnetlink,`
			`'seed': 'N/A',`
			`'leech': 'N/A',`
			`'template': 'torrent.html',`
			`}`
			`)`
add digbt engine Unfortunately, it is quite slow so it is disabled. Furthermore, the display of number of files is wrong on digbt.org, so it is not displayed on searx. 2016-08-13 14:55:47 +02:00
			`return results`