16 6 192

Sourab Mangrulkar

smangrul

https://www.linkedin.com/in/sourab-m/

pacman100

AI & ML interests

Machine Learning, Deep Learning, Natural Language Processing, Natural Language Generation, Computer Vision, Reinforcement Learning

Recent Activity

liked a model 7 days ago

intfloat/multilingual-e5-large-instruct

liked a model 20 days ago

Qwen/Qwen2.5-1.5B-Instruct

liked a Space about 1 month ago

huggingface/ai-deadlines

View all activity

Organizations

smangrul's activity

liked a model 7 days ago

intfloat/multilingual-e5-large-instruct

Feature Extraction • Updated Feb 17 • 912k • • 401

liked a model 20 days ago

Qwen/Qwen2.5-1.5B-Instruct

Text Generation • Updated Sep 25, 2024 • 965k • • 383

liked a Space about 1 month ago

348

AI Deadlines

⚡

Schedule tasks efficiently using AI-generated deadlines

liked 2 models 3 months ago

mistralai/Mistral-7B-Instruct-v0.3

Text Generation • Updated Aug 21, 2024 • 1.01M • • 1.56k

mistralai/Mamba-Codestral-7B-v0.1

Updated Aug 23, 2024 • 7.64k • 581

liked a model 4 months ago

meta-llama/Llama-3.2-11B-Vision-Instruct

Image-Text-to-Text • Updated Dec 4, 2024 • 1.54M • • 1.4k

updated a Space 5 months ago

PEFT Docs QA Chatbot

📚

Ask questions about PEFT docs and get answers

liked a model 6 months ago

BlinkDL/rwkv-4-pile-169m

Text Generation • Updated Mar 13, 2023 • 10

liked 2 models 7 months ago

meta-llama/Llama-3.1-8B-Instruct

Text Generation • Updated Sep 25, 2024 • 6.07M • • 3.79k

microsoft/Phi-3.5-mini-instruct

Text Generation • Updated 29 days ago • 246k • • 848

liked a model 8 months ago

microsoft/Phi-3-mini-128k-instruct

Text Generation • Updated 29 days ago • 258k • • 1.64k

liked a model 9 months ago

microsoft/Phi-3-mini-4k-instruct

Text Generation • Updated Sep 20, 2024 • 1M • • 1.16k

upvoted a collection 9 months ago

Phi-3

Collection

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated Jan 8 • 565

liked a dataset 11 months ago

internlm/Agent-FLAN

Preview • Updated Mar 20, 2024 • 165 • 78

liked a Space 11 months ago

Nexus Function Calling Leaderboard

🐠

Visualize model performance on function calling tasks

liked a model 11 months ago

gorilla-llm/gorilla-openfunctions-v2

Text Generation • Updated Apr 18, 2024 • 5.5k • 227

liked a dataset 11 months ago

smangrul/hug_stack

Viewer • Updated Feb 2, 2024 • 6.58k • 146 • 3

updated a model 11 months ago

smangrul/peft-lora-whisper-largev2-cv11-mr-t4-colab

Updated Apr 23, 2024 • 2 • 1

liked a model 11 months ago

smangrul/falcon-180B-chat-asst-ds-lora

Updated Oct 2, 2023 • 1 • 1

posted an update 11 months ago

Post

3612

Unlocking the Power of locally running Llama-3 8B Model Agents with Chat-UI! 🔥🚀✨

I'm thrilled to share my hackathon-style side project:
1. Finetuning Llama-8B for function calling using PEFT QLoRA as the instruct Llama-3 model doesn't support this. The colab notebook for it is here: https://lnkd.in/ggJMzqh2. 🛠️
2. Finetuned model along with the 4-bit quants here: https://lnkd.in/gNpFKY6V ✨
3. Clone Hugging Face https://lnkd.in/gKBKuUBQ and make it compatible for function calling by building upon the PR https://lnkd.in/gnqFuAd4 for my model and local inferencing usecase using Ollama. This was a steep learning curve wherein I stayed awake the whole night to get it working. 💪🏽
4. Above, I used SerpAPI for web browsing and Mongo DB Atlas free tier for persistence of conversations and assistant configs. 🔎
5. More work is required to switch between using tools and responding directly wherein I see the model breaks. 🧐

How cool is this wherein we are approaching experience akin to ChatGPT while using local hosted agent model running on your laptop! 💻

1 reply