.oO SearXNG Developer Documentation Oo.
Loading...
Searching...
No Matches
searx.results.ResultContainer Class Reference
+ Collaboration diagram for searx.results.ResultContainer:

Public Member Functions

 __init__ (self)
 
 extend (self, str|None engine_name, results)
 
 close (self)
 
list[MainResult|LegacyResultget_ordered_results (self)
 
int number_of_results (self)
 
 add_unresponsive_engine (self, str engine_name, str error_type, bool suspended=False)
 
 add_timing (self, str engine_name, float engine_time, float page_load_time)
 
 get_timings (self)
 

Public Attributes

 answers = AnswerSet()
 
dict[str, dict[str, str]] engine_data = defaultdict(dict)
 
bool paging = False
 
Set[UnresponsiveEngineunresponsive_engines = set()
 
list timings = []
 
str|None redirect_url = None
 
bool on_result = lambda _: True
 

Static Public Attributes

dict main_results_map [int, MainResult | LegacyResult]
 
list infoboxes [LegacyResult]
 
set suggestions [str]
 
set corrections [str]
 

Protected Member Functions

 _merge_infobox (self, LegacyResult new_infobox)
 
 _merge_main_result (self, MainResult|LegacyResult result, position)
 

Protected Attributes

list _number_of_results = []
 
bool _closed = False
 
 _lock = RLock()
 
list[MainResult|LegacyResult_main_results_sorted = None
 

Detailed Description

In the result container, the results are collected, sorted and duplicates
will be merged.

Definition at line 50 of file results.py.

Constructor & Destructor Documentation

◆ __init__()

searx.results.ResultContainer.__init__ ( self)

Definition at line 62 of file results.py.

62 def __init__(self):
63 self.main_results_map = {}
64 self.infoboxes = []
65 self.suggestions = set()
66 self.answers = AnswerSet()
67 self.corrections = set()
68
69 self._number_of_results: list[int] = []
70 self.engine_data: dict[str, dict[str, str]] = defaultdict(dict)
71 self._closed: bool = False
72 self.paging: bool = False
73 self.unresponsive_engines: Set[UnresponsiveEngine] = set()
74 self.timings: List[Timing] = []
75 self.redirect_url: str | None = None
76 self.on_result = lambda _: True
77 self._lock = RLock()
78 self._main_results_sorted: list[MainResult | LegacyResult] = None # type: ignore
79

References infoboxes, main_results_map, and suggestions.

Member Function Documentation

◆ _merge_infobox()

searx.results.ResultContainer._merge_infobox ( self,
LegacyResult new_infobox )
protected

Definition at line 153 of file results.py.

153 def _merge_infobox(self, new_infobox: LegacyResult):
154 add_infobox = True
155
156 new_id = getattr(new_infobox, "id", None)
157 if new_id is not None:
158 with self._lock:
159 for existing_infobox in self.infoboxes:
160 if new_id == getattr(existing_infobox, "id", None):
161 merge_two_infoboxes(existing_infobox, new_infobox)
162 add_infobox = False
163 if add_infobox:
164 self.infoboxes.append(new_infobox)
165

References searx.metrics.models.Histogram._lock, _lock, infoboxes, and searx.results.merge_two_infoboxes().

Referenced by extend().

+ Here is the call graph for this function:
+ Here is the caller graph for this function:

◆ _merge_main_result()

searx.results.ResultContainer._merge_main_result ( self,
MainResult | LegacyResult result,
position )
protected

Definition at line 166 of file results.py.

166 def _merge_main_result(self, result: MainResult | LegacyResult, position):
167 result_hash = hash(result)
168
169 with self._lock:
170
171 merged = self.main_results_map.get(result_hash)
172 if not merged:
173 # if there is no duplicate in the merged results, append result
174 result.positions = [position]
175 self.main_results_map[result_hash] = result
176 return
177
178 merge_two_main_results(merged, result)
179 # add the new position
180 merged.positions.append(position)
181

References searx.metrics.models.Histogram._lock, _lock, main_results_map, and searx.results.merge_two_main_results().

Referenced by extend().

+ Here is the call graph for this function:
+ Here is the caller graph for this function:

◆ add_timing()

searx.results.ResultContainer.add_timing ( self,
str engine_name,
float engine_time,
float page_load_time )

Definition at line 275 of file results.py.

275 def add_timing(self, engine_name: str, engine_time: float, page_load_time: float):
276 with self._lock:
277 if self._closed:
278 log.error("call to ResultContainer.add_timing after ResultContainer.close")
279 return
280 self.timings.append(Timing(engine_name, total=engine_time, load=page_load_time))
281

References _closed, searx.metrics.models.Histogram._lock, _lock, searx.extended_types.SXNG_Request.timings, and timings.

◆ add_unresponsive_engine()

searx.results.ResultContainer.add_unresponsive_engine ( self,
str engine_name,
str error_type,
bool suspended = False )

Definition at line 267 of file results.py.

267 def add_unresponsive_engine(self, engine_name: str, error_type: str, suspended: bool = False):
268 with self._lock:
269 if self._closed:
270 log.error("call to ResultContainer.add_unresponsive_engine after ResultContainer.close")
271 return
272 if searx.engines.engines[engine_name].display_error_messages:
273 self.unresponsive_engines.add(UnresponsiveEngine(engine_name, error_type, suspended))
274

References _closed, searx.metrics.models.Histogram._lock, _lock, and unresponsive_engines.

◆ close()

searx.results.ResultContainer.close ( self)

Definition at line 182 of file results.py.

182 def close(self):
183 self._closed = True
184
185 for result in self.main_results_map.values():
186 result.score = calculate_score(result, result.priority)
187 for eng_name in result.engines:
188 counter_add(result.score, 'engine', eng_name, 'score')
189

References _closed, searx.results.calculate_score(), and main_results_map.

Referenced by get_ordered_results().

+ Here is the call graph for this function:
+ Here is the caller graph for this function:

◆ extend()

searx.results.ResultContainer.extend ( self,
str | None engine_name,
results )

Definition at line 80 of file results.py.

80 def extend(self, engine_name: str | None, results): # pylint: disable=too-many-branches
81 if self._closed:
82 log.debug("container is closed, ignoring results: %s", results)
83 return
84 main_count = 0
85
86 for result in list(results):
87
88 if isinstance(result, Result):
89 result.engine = result.engine or engine_name
90 result.normalize_result_fields()
91
92 if isinstance(result, BaseAnswer) and self.on_result(result):
93 self.answers.add(result)
94 elif isinstance(result, MainResult) and self.on_result(result):
95 main_count += 1
96 self._merge_main_result(result, main_count)
97 else:
98 # more types need to be implemented in the future ..
99 raise NotImplementedError(f"no handler implemented to process the result of type {result}")
100
101 else:
102 result["engine"] = result.get("engine") or engine_name or ""
103 result = LegacyResult(result) # for backward compatibility, will be romeved one day
104 result.normalize_result_fields()
105
106 if "suggestion" in result:
107 if self.on_result(result):
108 self.suggestions.add(result["suggestion"])
109 continue
110
111 if "answer" in result:
112 if self.on_result(result):
113 warnings.warn(
114 f"answer results from engine {result.engine}"
115 " are without typification / migrate to Answer class.",
116 DeprecationWarning,
117 )
118 self.answers.add(result) # type: ignore
119 continue
120
121 if "correction" in result:
122 if self.on_result(result):
123 self.corrections.add(result["correction"])
124 continue
125
126 if "infobox" in result:
127 if self.on_result(result):
128 self._merge_infobox(result)
129 continue
130
131 if "number_of_results" in result:
132 if self.on_result(result):
133 self._number_of_results.append(result["number_of_results"])
134 continue
135
136 if "engine_data" in result:
137 if self.on_result(result):
138 if result.engine:
139 self.engine_data[result.engine][result["key"]] = result["engine_data"]
140 continue
141
142 if self.on_result(result):
143 main_count += 1
144 self._merge_main_result(result, main_count)
145 continue
146
147 if engine_name in searx.engines.engines:
148 eng = searx.engines.engines[engine_name]
149 histogram_observe(main_count, "engine", eng.name, "result", "count")
150 if not self.paging and eng.paging:
151 self.paging = True
152

References _closed, _merge_infobox(), _merge_main_result(), _number_of_results, answers, corrections, engine_data, on_result, paging, and suggestions.

+ Here is the call graph for this function:

◆ get_ordered_results()

list[MainResult | LegacyResult] searx.results.ResultContainer.get_ordered_results ( self)
Returns a sorted list of results to be displayed in the main result
area (:ref:`result types`).

Definition at line 190 of file results.py.

190 def get_ordered_results(self) -> list[MainResult | LegacyResult]:
191 """Returns a sorted list of results to be displayed in the main result
192 area (:ref:`result types`)."""
193
194 if not self._closed:
195 self.close()
196
197 if self._main_results_sorted:
198 return self._main_results_sorted
199
200 # first pass, sort results by "score" (descanding)
201 results = sorted(self.main_results_map.values(), key=lambda x: x.score, reverse=True)
202
203 # pass 2 : group results by category and template
204 gresults = []
205 categoryPositions = {}
206 max_count = 8
207 max_distance = 20
208
209 for res in results:
210 # do we need to handle more than one category per engine?
211 engine = searx.engines.engines.get(res.engine or "")
212 if engine:
213 res.category = engine.categories[0] if len(engine.categories) > 0 else ""
214
215 # do we need to handle more than one category per engine?
216 category = f"{res.category}:{res.template}:{'img_src' if (res.thumbnail or res.img_src) else ''}"
217 grp = categoryPositions.get(category)
218
219 # group with previous results using the same category, if the group
220 # can accept more result and is not too far from the current
221 # position
222
223 if (grp is not None) and (grp["count"] > 0) and (len(gresults) - grp["index"] < max_distance):
224 # group with the previous results using the same category with
225 # this one
226 index = grp["index"]
227 gresults.insert(index, res)
228
229 # update every index after the current one (including the
230 # current one)
231 for item in categoryPositions.values():
232 v = item["index"]
233 if v >= index:
234 item["index"] = v + 1
235
236 # update this category
237 grp["count"] -= 1
238
239 else:
240 gresults.append(res)
241 # update categoryIndex
242 categoryPositions[category] = {"index": len(gresults), "count": max_count}
243 continue
244
245 self._main_results_sorted = gresults
246 return self._main_results_sorted
247

References _closed, _main_results_sorted, close(), and main_results_map.

Referenced by number_of_results().

+ Here is the call graph for this function:
+ Here is the caller graph for this function:

◆ get_timings()

searx.results.ResultContainer.get_timings ( self)

Definition at line 282 of file results.py.

282 def get_timings(self):
283 with self._lock:
284 if not self._closed:
285 log.error("call to ResultContainer.get_timings before ResultContainer.close")
286 return []
287 return self.timings
288
289

References _closed, searx.metrics.models.Histogram._lock, _lock, searx.extended_types.SXNG_Request.timings, and timings.

◆ number_of_results()

int searx.results.ResultContainer.number_of_results ( self)
Returns the average of results number, returns zero if the average
result number is smaller than the actual result count.

Definition at line 249 of file results.py.

249 def number_of_results(self) -> int:
250 """Returns the average of results number, returns zero if the average
251 result number is smaller than the actual result count."""
252
253 if not self._closed:
254 log.error("call to ResultContainer.number_of_results before ResultContainer.close")
255 return 0
256
257 with self._lock:
258 resultnum_sum = sum(self._number_of_results)
259 if not resultnum_sum or not self._number_of_results:
260 return 0
261
262 average = int(resultnum_sum / len(self._number_of_results))
263 if average < len(self.get_ordered_results()):
264 average = 0
265 return average
266

References _closed, searx.metrics.models.Histogram._lock, _lock, _number_of_results, and get_ordered_results().

+ Here is the call graph for this function:

Member Data Documentation

◆ _closed

bool searx.results.ResultContainer._closed = False
protected

◆ _lock

searx.results.ResultContainer._lock = RLock()
protected

◆ _main_results_sorted

list[MainResult | LegacyResult] searx.results.ResultContainer._main_results_sorted = None
protected

Definition at line 78 of file results.py.

Referenced by get_ordered_results().

◆ _number_of_results

list searx.results.ResultContainer._number_of_results = []
protected

Definition at line 69 of file results.py.

Referenced by extend(), and number_of_results().

◆ answers

searx.results.ResultContainer.answers = AnswerSet()

Definition at line 66 of file results.py.

Referenced by extend().

◆ corrections

set searx.results.ResultContainer.corrections [str]
static

Definition at line 60 of file results.py.

Referenced by extend().

◆ engine_data

dict[str, dict[str, str]] searx.results.ResultContainer.engine_data = defaultdict(dict)

Definition at line 70 of file results.py.

Referenced by searx.search.models.SearchQuery.__copy__(), and extend().

◆ infoboxes

list searx.results.ResultContainer.infoboxes [LegacyResult]
static

Definition at line 57 of file results.py.

Referenced by __init__(), and _merge_infobox().

◆ main_results_map

dict searx.results.ResultContainer.main_results_map [int, MainResult | LegacyResult]
static

Definition at line 56 of file results.py.

Referenced by __init__(), _merge_main_result(), close(), and get_ordered_results().

◆ on_result

bool searx.results.ResultContainer.on_result = lambda _: True

Definition at line 76 of file results.py.

Referenced by extend().

◆ paging

bool searx.results.ResultContainer.paging = False

Definition at line 72 of file results.py.

Referenced by extend().

◆ redirect_url

str | None searx.results.ResultContainer.redirect_url = None

Definition at line 75 of file results.py.

◆ suggestions

set searx.results.ResultContainer.suggestions [str]
static

Definition at line 58 of file results.py.

Referenced by __init__(), and extend().

◆ timings

list searx.results.ResultContainer.timings = []

Definition at line 74 of file results.py.

Referenced by add_timing(), and get_timings().

◆ unresponsive_engines

Set[UnresponsiveEngine] searx.results.ResultContainer.unresponsive_engines = set()

Definition at line 73 of file results.py.

Referenced by add_unresponsive_engine().


The documentation for this class was generated from the following file: