Functions
None	request (str query, "OnlineParams" params)
EngineResults	response ("SXNG_Response" resp)

Variables
dict	about
list	categories = ["science", "scientific publications"]
bool	paging = True
int	arxiv_max_results = 10
str	arxiv_search_prefix = "all"
str	base_url = "https://export.arxiv.org/api/query"
dict	arxiv_namespaces
	xpath_entry = XPath("//atom:entry", namespaces=arxiv_namespaces)
	xpath_title = XPath(".//atom:title", namespaces=arxiv_namespaces)
	xpath_id = XPath(".//atom:id", namespaces=arxiv_namespaces)
	xpath_summary = XPath(".//atom:summary", namespaces=arxiv_namespaces)
	xpath_author_name = XPath(".//atom:author/atom:name", namespaces=arxiv_namespaces)
	xpath_doi = XPath(".//arxiv:doi", namespaces=arxiv_namespaces)
	xpath_pdf = XPath(".//atom:link[@title='pdf']", namespaces=arxiv_namespaces)
	xpath_published = XPath(".//atom:published", namespaces=arxiv_namespaces)
	xpath_journal = XPath(".//arxiv:journal_ref", namespaces=arxiv_namespaces)
	xpath_category = XPath(".//atom:category/@term", namespaces=arxiv_namespaces)
	xpath_comment = XPath("./arxiv:comment", namespaces=arxiv_namespaces)

Detailed Description

arXiv is a free distribution service and an open-access archive for nearly
2.4 million scholarly articles in the fields of physics, mathematics, computer
science, quantitative biology, quantitative finance, statistics, electrical
engineering and systems science, and economics.

The engine uses the `arXiv API`_.

.. _arXiv API: https://info.arxiv.org/help/api/user-manual.html

Function Documentation

◆ request()

None searx.engines.arxiv.request	(	str	query,
		"OnlineParams"	params )

Definition at line 68 of file arxiv.py.

def request(query: str, params: "OnlineParams") -> None:
 
    args = {
        "search_query": f"{arxiv_search_prefix}:{query}",
        "start": (params["pageno"] - 1) * arxiv_max_results,
        "max_results": arxiv_max_results,
    }
    params["url"] = f"{base_url}?{urlencode(args)}"
 
 

◆ response()

EngineResults searx.engines.arxiv.response ( "SXNG_Response" resp )

Definition at line 78 of file arxiv.py.

def response(resp: "SXNG_Response") -> EngineResults:
 
    res = EngineResults()
 
    dom = etree.fromstring(resp.content)
    for entry in eval_xpath_list(dom, xpath_entry):
 
        title: str = eval_xpath_getindex(entry, xpath_title, 0).text
 
        url: str = eval_xpath_getindex(entry, xpath_id, 0).text
        abstract: str = eval_xpath_getindex(entry, xpath_summary, 0).text
 
        authors: list[str] = [author.text for author in eval_xpath_list(entry, xpath_author_name)]
 
        #  doi
        doi_element = eval_xpath_getindex(entry, xpath_doi, 0, default=None)
        doi: str = "" if doi_element is None else doi_element.text
 
        # pdf
        pdf_element = eval_xpath_getindex(entry, xpath_pdf, 0, default=None)
        pdf_url: str = "" if pdf_element is None else pdf_element.attrib.get("href")
 
        # journal
        journal_element = eval_xpath_getindex(entry, xpath_journal, 0, default=None)
        journal: str = "" if journal_element is None else journal_element.text
 
        # tags
        tag_elements = eval_xpath(entry, xpath_category)
        tags: list[str] = [str(tag) for tag in tag_elements]
 
        # comments
        comments_elements = eval_xpath_getindex(entry, xpath_comment, 0, default=None)
        comments: str = "" if comments_elements is None else comments_elements.text
 
        publishedDate = datetime.strptime(eval_xpath_getindex(entry, xpath_published, 0).text, "%Y-%m-%dT%H:%M:%SZ")
 
        res.add(
            res.types.Paper(
                url=url,
                title=title,
                publishedDate=publishedDate,
                content=abstract,
                doi=doi,
                authors=authors,
                journal=journal,
                tags=tags,
                comments=comments,
                pdf_url=pdf_url,
            )
        )
 
    return res

Variable Documentation

◆ about

dict searx.engines.arxiv.about

Initial value:

=  {
    "website": "https://arxiv.org",
    "wikidata_id": "Q118398",
    "official_api_documentation": "https://info.arxiv.org/help/api/user-manual.html",
    "use_official_api": True,
    "require_api_key": False,
    "results": "XML-RSS",
}

Definition at line 26 of file arxiv.py.

◆ arxiv_max_results

int searx.engines.arxiv.arxiv_max_results = 10

Definition at line 37 of file arxiv.py.

◆ arxiv_namespaces

dict searx.engines.arxiv.arxiv_namespaces

Initial value:

=  {
    "atom": "http://www.w3.org/2005/Atom",
    "arxiv": "http://arxiv.org/schemas/atom",
}

Definition at line 51 of file arxiv.py.

◆ arxiv_search_prefix

str searx.engines.arxiv.arxiv_search_prefix = "all"

Definition at line 38 of file arxiv.py.

◆ base_url

str searx.engines.arxiv.base_url = "https://export.arxiv.org/api/query"

Definition at line 45 of file arxiv.py.

◆ categories

list searx.engines.arxiv.categories = ["science", "scientific publications"]

Definition at line 35 of file arxiv.py.

◆ paging

bool searx.engines.arxiv.paging = True

Definition at line 36 of file arxiv.py.

◆ xpath_author_name

searx.engines.arxiv.xpath_author_name = XPath(".//atom:author/atom:name", namespaces=arxiv_namespaces)

Definition at line 59 of file arxiv.py.

◆ xpath_category

searx.engines.arxiv.xpath_category = XPath(".//atom:category/@term", namespaces=arxiv_namespaces)

Definition at line 64 of file arxiv.py.

◆ xpath_comment

searx.engines.arxiv.xpath_comment = XPath("./arxiv:comment", namespaces=arxiv_namespaces)

Definition at line 65 of file arxiv.py.

◆ xpath_doi

searx.engines.arxiv.xpath_doi = XPath(".//arxiv:doi", namespaces=arxiv_namespaces)

Definition at line 60 of file arxiv.py.

◆ xpath_entry

searx.engines.arxiv.xpath_entry = XPath("//atom:entry", namespaces=arxiv_namespaces)

Definition at line 55 of file arxiv.py.

◆ xpath_id

searx.engines.arxiv.xpath_id = XPath(".//atom:id", namespaces=arxiv_namespaces)

Definition at line 57 of file arxiv.py.

◆ xpath_journal

searx.engines.arxiv.xpath_journal = XPath(".//arxiv:journal_ref", namespaces=arxiv_namespaces)

Definition at line 63 of file arxiv.py.

◆ xpath_pdf

searx.engines.arxiv.xpath_pdf = XPath(".//atom:link[@title='pdf']", namespaces=arxiv_namespaces)

Definition at line 61 of file arxiv.py.

◆ xpath_published

searx.engines.arxiv.xpath_published = XPath(".//atom:published", namespaces=arxiv_namespaces)

Definition at line 62 of file arxiv.py.

◆ xpath_summary

searx.engines.arxiv.xpath_summary = XPath(".//atom:summary", namespaces=arxiv_namespaces)

Definition at line 58 of file arxiv.py.

◆ xpath_title

searx.engines.arxiv.xpath_title = XPath(".//atom:title", namespaces=arxiv_namespaces)

Definition at line 56 of file arxiv.py.

Functions

Variables