searxng/searx/engines/www1x.py

# SPDX-License-Identifier: AGPL-3.0-or-later
"""
 1x (Images)
"""

from lxml import html, etree
from urllib.parse import urlencode, urljoin
from searx.utils import extract_text, eval_xpath_list, eval_xpath_getindex

# about
about = {
    "website": 'https://1x.com/',
    "wikidata_id": None,
    "official_api_documentation": None,
    "use_official_api": False,
    "require_api_key": False,
    "results": 'HTML',
}

# engine dependent config
categories = ['images']
paging = False

# search-url
base_url = 'https://1x.com'
search_url = base_url + '/backend/search.php?{query}'
gallery_url = 'https://gallery.1x.com/'


# do search-request
def request(query, params):
    params['url'] = search_url.format(query=urlencode({'q': query}))

    return params


# get response from search-request
def response(resp):
    results = []
    xmldom = etree.fromstring(resp.content)
    xmlsearchresult = eval_xpath_getindex(xmldom, '//searchresult', 0)
    dom = html.fragment_fromstring(xmlsearchresult.text, create_parent='div')
    for link in eval_xpath_list(dom, '/div/table/tr/td/div[2]//a'):
        url = urljoin(base_url, link.attrib.get('href'))
        title = extract_text(link)
        thumbnail_src = urljoin(gallery_url, eval_xpath_getindex(link, './/img', 0).attrib['src'])

        # append result
        results.append({'url': url,
                        'title': title,
                        'img_src': thumbnail_src,
                        'content': '',
                        'thumbnail_src': thumbnail_src,
                        'template': 'images.html'})

    # return results
    return results
[enh] engines: add about variable move meta information from comment to the about variable so the preferences, the documentation can show these information 2021-01-13 11:31:25 +01:00			`# SPDX-License-Identifier: AGPL-3.0-or-later`
update versions.cfg to use the current up-to-date packages 2015-05-02 15:45:17 +02:00			`"""`
			`1x (Images)`
			`"""`
[enh] add 1x.com engine * Deacivated by default, because of the big amount of results 2015-02-01 11:27:28 +01:00
[fix] 1x engine 2020-12-07 15:46:00 +01:00			`from lxml import html, etree`
Drop Python 2 (1/n): remove unicode string and url_utils 2020-08-06 17:42:46 +02:00			`from urllib.parse import urlencode, urljoin`
[fix] 1x engine 2020-12-07 15:46:00 +01:00			`from searx.utils import extract_text, eval_xpath_list, eval_xpath_getindex`
[enh] add 1x.com engine * Deacivated by default, because of the big amount of results 2015-02-01 11:27:28 +01:00
[enh] engines: add about variable move meta information from comment to the about variable so the preferences, the documentation can show these information 2021-01-13 11:31:25 +01:00			`# about`
			`about = {`
			`"website": 'https://1x.com/',`
			`"wikidata_id": None,`
			`"official_api_documentation": None,`
			`"use_official_api": False,`
			`"require_api_key": False,`
			`"results": 'HTML',`
			`}`

[enh] add 1x.com engine * Deacivated by default, because of the big amount of results 2015-02-01 11:27:28 +01:00			`# engine dependent config`
			`categories = ['images']`
			`paging = False`

www1x engine: remove comment about unavailable https (https is working now) 2015-06-06 19:44:41 +02:00			`# search-url`
bing_images & www1x engines use https connections 2015-06-06 19:23:07 +02:00			`base_url = 'https://1x.com'`
[fix] pep8 compatibilty 2016-01-18 12:47:31 +01:00			`search_url = base_url + '/backend/search.php?{query}'`
[fix] 1x engine 2020-12-07 15:46:00 +01:00			`gallery_url = 'https://gallery.1x.com/'`
[enh] add 1x.com engine * Deacivated by default, because of the big amount of results 2015-02-01 11:27:28 +01:00

			`# do search-request`
			`def request(query, params):`
			`params['url'] = search_url.format(query=urlencode({'q': query}))`

			`return params`


			`# get response from search-request`
			`def response(resp):`
			`results = []`
[fix] 1x engine 2020-12-07 15:46:00 +01:00			`xmldom = etree.fromstring(resp.content)`
			`xmlsearchresult = eval_xpath_getindex(xmldom, '//searchresult', 0)`
			`dom = html.fragment_fromstring(xmlsearchresult.text, create_parent='div')`
			`for link in eval_xpath_list(dom, '/div/table/tr/td/div[2]//a'):`
[enh] add 1x.com engine * Deacivated by default, because of the big amount of results 2015-02-01 11:27:28 +01:00			`url = urljoin(base_url, link.attrib.get('href'))`
[fix] update 1x engine 2019-10-16 13:27:05 +02:00			`title = extract_text(link)`
[fix] 1x engine 2020-12-07 15:46:00 +01:00			`thumbnail_src = urljoin(gallery_url, eval_xpath_getindex(link, './/img', 0).attrib['src'])`
[enh] add 1x.com engine * Deacivated by default, because of the big amount of results 2015-02-01 11:27:28 +01:00
			`# append result`
			`results.append({'url': url,`
			`'title': title,`
[fix] 1x engine 2020-12-07 15:46:00 +01:00			`'img_src': thumbnail_src,`
[enh] add 1x.com engine * Deacivated by default, because of the big amount of results 2015-02-01 11:27:28 +01:00			`'content': '',`
			`'thumbnail_src': thumbnail_src,`
			`'template': 'images.html'})`

			`# return results`
			`return results`