Abdullah Al Zubaer's picture

Abdullah Al Zubaer

abdullahalzubaer

·

https://abdullahalzubaer.github.io/

AI & ML interests

Natural Language Processing

Recent Activity

liked a dataset about 1 month ago

PaDaS-Lab/webfaq

upvoted an article 3 months ago

Hugging Face x LangChain : A new partner package in LangChain

updated a model 3 months ago

abdullahalzubaer/Llama-3.1-8B-Instruct-Hawkish-merged

View all activity

Organizations

abdullahalzubaer's activity

liked a dataset about 1 month ago

PaDaS-Lab/webfaq

Viewer • Updated Mar 5 • 79.5M • 670 • 18

upvoted an article 3 months ago

Article

Hugging Face x LangChain : A new partner package in LangChain

May 14, 2024

• 138

updated a model 3 months ago

abdullahalzubaer/Llama-3.1-8B-Instruct-Hawkish-merged

published a model 3 months ago

abdullahalzubaer/Llama-3.1-8B-Instruct-Hawkish-merged

updated a model 3 months ago

abdullahalzubaer/abdullah-Qwen-Qwen2.5-0.5B-Instruct-merged-two-same-model-slerp

liked a model 3 months ago

DiscoResearch/Llama3-German-8B-32k

Text Generation • Updated Sep 10, 2024 • 85 • 13

published a model 3 months ago

abdullahalzubaer/abdullah-Qwen-Qwen2.5-0.5B-Instruct-merged-two-same-model-slerp

upvoted a collection 5 months ago

LLäMmlein Chat Preview 🐑

https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/ • 14 items • Updated 7 days ago • 11

updated a Space 5 months ago

Chatbots

upvoted a collection 5 months ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated Jan 8 • 566

upvoted a paper 6 months ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 150

reacted to tomaarsen's post with 🚀❤️🔥 6 months ago

Post

7135

📣 Sentence Transformers v3.2.0 is out, marking the biggest release for inference in 2 years! 2 new backends for embedding models: ONNX (+ optimization & quantization) and OpenVINO, allowing for speedups up to 2x-3x AND Static Embeddings for 500x speedups at 10-20% accuracy cost.

1️⃣ ONNX Backend: This backend uses the ONNX Runtime to accelerate model inference on both CPU and GPU, reaching up to 1.4x-3x speedup depending on the precision. We also introduce 2 helper methods for optimizing and quantizing models for (much) faster inference.
2️⃣ OpenVINO Backend: This backend uses Intel their OpenVINO instead, outperforming ONNX in some situations on CPU.

Usage is as simple as SentenceTransformer("all-MiniLM-L6-v2", backend="onnx"). Does your model not have an ONNX or OpenVINO file yet? No worries - it'll be autoexported for you. Thank me later 😉

🔒 Another major new feature is Static Embeddings: think word embeddings like GLoVe and word2vec, but modernized. Static Embeddings are bags of token embeddings that are summed together to create text embeddings, allowing for lightning-fast embeddings that don't require any neural networks. They're initialized in one of 2 ways:

1️⃣ via Model2Vec, a new technique for distilling any Sentence Transformer models into static embeddings. Either via a pre-distilled model with from_model2vec or with from_distillation where you do the distillation yourself. It'll only take 5 seconds on GPU & 2 minutes on CPU, no dataset needed.
2️⃣ Random initialization. This requires finetuning, but finetuning is extremely quick (e.g. I trained with 3 million pairs in 7 minutes). My final model was 6.6% worse than bge-base-en-v1.5, but 500x faster on CPU.

Full release notes: https://github.com/UKPLab/sentence-transformers/releases/tag/v3.2.0
Documentation on Speeding up Inference: https://sbert.net/docs/sentence_transformer/usage/efficiency.html

1 reply

·

upvoted a paper 6 months ago

Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise

Paper • 2410.03017 • Published Oct 3, 2024 • 29

upvoted an article 9 months ago

Article

Vision Language Models Explained

Apr 11, 2024

• 308

liked a model 10 months ago

vikhyatk/moondream2

Image-Text-to-Text • Updated 1 day ago • 163k • 1.11k

upvoted an article 10 months ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28, 2024

• 208

liked a model about 1 year ago

CohereLabs/c4ai-command-r-plus

Text Generation • Updated about 16 hours ago • 7.74k • • 1.72k