mwmbl/mwmbl
2023-02-05 15:49:09 +00:00
..
crawler Fix some bugs in URL fetching query 2023-01-02 20:51:23 +00:00
indexer Don't try and update an empty list of URLs 2023-01-09 21:02:40 +00:00
resources Add new LTR model 2022-08-09 22:47:59 +01:00
tinysearchengine Offset by metadata size manually to increase compatibility 2023-02-05 15:49:09 +00:00
__init__.py renamed package to mwmbl 2021-12-28 12:35:46 +01:00
background.py Update URL queue separately from the other background process to speed it up 2023-01-09 20:50:28 +00:00
database.py Fix issue #60 2022-07-10 11:10:03 +02:00
hn_top_domains_filtered.py Exclude google plus 2023-01-01 22:18:47 +00:00
main.py Update URL queue separately from the other background process to speed it up 2023-01-09 20:50:28 +00:00
retry.py Make more robust 2022-06-21 08:44:46 +01:00
settings.py Fix some bugs in URL fetching query 2023-01-02 20:51:23 +00:00
tokenizer.py Use terms and bigrams from the beginning of the string only 2022-08-26 17:20:11 +01:00
url_queue.py Update URL queue separately from the other background process to speed it up 2023-01-09 20:50:28 +00:00
utils.py Use an in-memory queue 2022-07-31 00:43:58 +01:00