Commit History

Merge pull request #14 from soumik12345/feat/ensemble-of-image-loaders
bf14736
unverified

geekyrakshit commited on

bugfix: prevent model load on every extraction
ff75fe0

mratanusarkar commited on

chore: address review points
2691833

mratanusarkar commited on

add: ContrieverRetriever
b82a487

geekyrakshit commited on

upfate: CalPaliRetriever methods to align with other retrievers
c063304

geekyrakshit commited on

update: docs with lib sources and notes
e6f2eb8

mratanusarkar commited on

add: docs for pymupdf and fitzpil
04ea7bb

mratanusarkar commited on

add: two modules on fitz to handle img extractions
f9d44bd

mratanusarkar commited on

temp: attempt - force to png with pillow
3d948a1

mratanusarkar commited on

temp: attempt - all format img extraction from pdf
5406446

mratanusarkar commited on

fix: catch byaldi import
3bd446c

geekyrakshit commited on

fix: catch byaldi import
353a440

geekyrakshit commited on

update: decouple Byaldi from pyproject.toml because of conflict with adapters and mentioned the same in relevant docs as well
8f9e28d

geekyrakshit commited on

add: basic workflow to check code format and lint
4069faf

geekyrakshit commited on

fix: bug wrt result unpacking in BM25sRetriever.retrieve
86ac070

geekyrakshit commited on

add: docs for BM25sRetriever
b5a3ebb

geekyrakshit commited on

add: docs for pdfplumber image loader
e19286a

mratanusarkar commited on

add: hacky impl of img extraction with pdfplumber
4fd52cf

mratanusarkar commited on

add: example usage for marker and pdf2img loaders
bf0f2e5

mratanusarkar commited on

add: marker image loader + docs + corrections
331f289

mratanusarkar commited on

chore: improve doc + code formatting
f37090a

mratanusarkar commited on

add: docs for base img loader + pdf2image
cc5cebc

mratanusarkar commited on

add: base image loader + pdf2img from load_image
5c74069

mratanusarkar commited on

refactor: colpali retrieval
21537b7

geekyrakshit commited on

Merge pull request #11 from soumik12345/feat/semantic-chunking
694a076
unverified

Atanu Sarkar commited on

add: docs for SemanticChunker
24a271d

geekyrakshit commited on

Merge pull request #9 from soumik12345/feat/ensemble-of-text-loaders
56d3953
unverified

geekyrakshit commited on

update: gitignore + untrack uv.lock
07a16a7

mratanusarkar commited on

update: codebase addressing review comments
a24da3d

mratanusarkar commited on

update: docs with lib sources to help find kwargs
d822059

mratanusarkar commited on

add: kwargs to interact with underlying library
6526b2f

mratanusarkar commited on

fix: incorrect pypdf2 as dev dependency
d191c1b

mratanusarkar commited on

update: convert _process_page to extract_page_data
e31ec78

mratanusarkar commited on

add: docs & docstrings for marker text loader
fc27062

mratanusarkar commited on

add: docs & docstrings for pdfplumber text loader
d647546

mratanusarkar commited on

add: docs & docstrings for pypdf2 text loader
419f968

mratanusarkar commited on