Jens van Holland's picture

1 8 35

Jens van Holland

jvh

·

jvhgit

AI & ML interests

Deep Learning, NLP, applications and Data Science

Recent Activity

liked a model 11 days ago

Qwen/QwQ-32B

liked a model 3 months ago

Qwen/QwQ-32B-Preview

liked a model 7 months ago

replit/replit-code-v1-3b

View all activity

Organizations

None yet

jvh's activity

upvoted a collection 11 months ago

GLM-4

GLM-4 Open Models • 14 items • Updated 14 days ago • 117

upvoted a paper 11 months ago

DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

Paper • 2401.14196 • Published Jan 25, 2024 • 62

upvoted 3 papers about 1 year ago

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28, 2024 • 111

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 615

Model Stock: All we need is just a few fine-tuned models

Paper • 2403.19522 • Published Mar 28, 2024 • 12

upvoted a collection about 1 year ago

INT4/8 Quantized Whisper CT2

Int4/8 Quantized Whisper Models by using the quanto package and the CTranslate2 package. Requires (much) less GPU resources while keeping performance. • 4 items • Updated Mar 19, 2024 • 2

upvoted a paper about 1 year ago

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Paper • 2402.10986 • Published Feb 16, 2024 • 80