Dimitrios Kapetanios's picture

24 90

Dimitrios Kapetanios

dkapt

·

dkapt

AI & ML interests

None yet

Recent Activity

liked a dataset 5 days ago

wildflow/sweet-corals

liked a dataset 5 days ago

neo4j/text2cypher-2025v1

liked a model 5 days ago

neo4j/text-to-cypher-Gemma-3-27B-Instruct-2025.04.0

View all activity

Organizations

None yet

dkapt's activity

upvoted a collection 10 days ago

Zeroshot Classifiers

These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 12 items • Updated Jan 6 • 133

upvoted a paper 11 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 148

upvoted 2 articles 21 days ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

29 days ago

• 119

Article

Don't repeat yourself - 🤗 Transformers Design Philosophy

Apr 5, 2022

• 28

upvoted an article 22 days ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

• 610

upvoted an article 3 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.22k

upvoted a collection 4 months ago

GLiNER

Knowledgator GLiNER models for information extraction • 8 items • Updated Dec 9, 2024 • 10

upvoted a paper 4 months ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 147

upvoted 2 papers 6 months ago

M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding

Paper • 2411.04952 • Published Nov 7, 2024 • 30

Aria: An Open Multimodal Native Mixture-of-Experts Model

Paper • 2410.05993 • Published Oct 8, 2024 • 112

upvoted a collection 7 months ago

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 211

upvoted a paper 7 months ago

Flamingo: a Visual Language Model for Few-Shot Learning

Paper • 2204.14198 • Published Apr 29, 2022 • 15

upvoted an article 7 months ago

Article

Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model

Aug 22, 2023

• 31

upvoted a paper 7 months ago

ColPali: Efficient Document Retrieval with Vision Language Models

Paper • 2407.01449 • Published Jun 27, 2024 • 48

upvoted a collection 9 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 663

upvoted an article 11 months ago

Article

Vision Language Models Explained

Apr 11, 2024

• 312