:zap: [Enhance] Loop multiple conditions for extracting abstract a315628 Hansimov commited on Jan 11, 2024
:recycle: [Refactor] QueryResultsExtractor: prettify logging 0acc824 Hansimov commited on Jan 11, 2024
:gem: [Feature] New BatchWebpageContentExtractor: Extract webpage content from multiple html_paths concurrently 1db460d Hansimov commited on Jan 11, 2024
:zap: [Enhance] ignore classes pattern, especially for 163.com 3dda344 Hansimov commited on Jan 10, 2024
:recycle: [Refactor] WebpageContentExtractor: Separate html and markdown processing a636bcb Hansimov commited on Jan 10, 2024
:recycle: [Refactor] Move hardcoded consts to network_configs af2c647 Hansimov commited on Jan 10, 2024
:gem: [Feature] New WebpageContentExtractor: extract webpage content as clean markdown e773696 Hansimov commited on Jan 10, 2024
:recycle: [Refactor] Rename SearchResultsExtractor to QueryResultsExtractor, and store results 0f6452f Hansimov commited on Jan 10, 2024
:gem: [Feature] New SearchResultsExtractor: title, site, link, abstract ef3de03 Hansimov commited on Jan 6, 2024