.oO SearXNG Developer Documentation Oo.
Loading...
Searching...
No Matches
searx.engines.core Namespace Reference

Functions

 request (query, params)
 
 response (resp)
 

Variables

dict about
 
str api_key = 'unset'
 
list categories = ['science', 'scientific publications']
 
bool paging = True
 
int nb_per_page = 10
 
str base_url = 'https://api.core.ac.uk/v3/search/works/'
 

Detailed Description

CORE_ (COnnecting REpositories) provides a comprehensive bibliographic
database of the world’s scholarly literature, collecting and indexing
research from repositories and journals.

.. _CORE: https://core.ac.uk/about

.. _core engine config:

Configuration
=============

The engine has the following additional settings:

- :py:obj:`api_key`

.. code:: yaml

  - name: core.ac.uk
    engine: core
    categories: science
    shortcut: cor
    api_key: "..."
    timeout: 5

Implementations
===============

Function Documentation

◆ request()

searx.engines.core.request ( query,
params )

Definition at line 56 of file core.py.

56def request(query, params):
57 if api_key == 'unset':
58 raise SearxEngineAPIException('missing CORE API key')
59
60 # API v3 uses different parameters
61 search_params = {
62 'q': query,
63 'offset': (params['pageno'] - 1) * nb_per_page,
64 'limit': nb_per_page,
65 'sort': 'relevance',
66 }
67
68 params['url'] = base_url + '?' + urlencode(search_params)
69 params['headers'] = {'Authorization': f'Bearer {api_key}'}
70
71 return params
72
73

◆ response()

searx.engines.core.response ( resp)

Definition at line 74 of file core.py.

74def response(resp):
75 results = []
76 json_data = resp.json()
77
78 for result in json_data.get('results', []):
79 # Get title
80 if not result.get('title'):
81 continue
82
83 # Get URL - try different options
84 url = None
85
86 # Try DOI first
87 doi = result.get('doi')
88 if doi:
89 url = f'https://doi.org/{doi}'
90
91 if url is None and result.get('doi'):
92 # use the DOI reference
93 url = 'https://doi.org/' + str(result['doi'])
94 elif result.get('id'):
95 url = 'https://core.ac.uk/works/' + str(result['id'])
96 elif result.get('downloadUrl'):
97 url = result['downloadUrl']
98 elif result.get('sourceFulltextUrls'):
99 url = result['sourceFulltextUrls']
100 else:
101 continue
102
103 # Published date
104 published_date = None
105
106 raw_date = result.get('publishedDate') or result.get('depositedDate')
107 if raw_date:
108 try:
109 published_date = datetime.fromisoformat(result['publishedDate'].replace('Z', '+00:00'))
110 except (ValueError, AttributeError):
111 pass
112
113 # Handle journals
114 journals = []
115 if result.get('journals'):
116 journals = [j.get('title') for j in result['journals'] if j.get('title')]
117
118 # Handle publisher
119 publisher = result.get('publisher', '').strip("'")
120 if publisher:
121 publisher = publisher.strip("'")
122
123 # Handle authors
124 authors = set()
125 for i in result.get('authors', []):
126 name = i.get("name")
127 if name:
128 authors.add(name)
129
130 results.append(
131 {
132 'template': 'paper.html',
133 'title': result.get('title'),
134 'url': url,
135 'content': result.get('fullText', '') or '',
136 # 'comments': '',
137 'tags': result.get('fieldOfStudy', []),
138 'publishedDate': published_date,
139 'type': result.get('documentType', '') or '',
140 'authors': authors,
141 'editor': ', '.join(result.get('contributors', [])),
142 'publisher': publisher,
143 'journal': ', '.join(journals),
144 'doi': result.get('doi'),
145 # 'issn' : ''
146 # 'isbn' : ''
147 'pdf_url': result.get('downloadUrl', {}) or result.get("sourceFulltextUrls", {}),
148 }
149 )
150
151 return results

Variable Documentation

◆ about

dict searx.engines.core.about
Initial value:
1= {
2 "website": 'https://core.ac.uk',
3 "wikidata_id": 'Q22661180',
4 "official_api_documentation": 'https://api.core.ac.uk/docs/v3',
5 "use_official_api": True,
6 "require_api_key": True,
7 "results": 'JSON',
8}

Definition at line 37 of file core.py.

◆ api_key

str searx.engines.core.api_key = 'unset'

Definition at line 46 of file core.py.

◆ base_url

str searx.engines.core.base_url = 'https://api.core.ac.uk/v3/search/works/'

Definition at line 53 of file core.py.

◆ categories

list searx.engines.core.categories = ['science', 'scientific publications']

Definition at line 50 of file core.py.

◆ nb_per_page

int searx.engines.core.nb_per_page = 10

Definition at line 52 of file core.py.

◆ paging

bool searx.engines.core.paging = True

Definition at line 51 of file core.py.