Britny Farahdel's picture
2 21

Britny Farahdel

britny
Ā·

AI & ML interests

None yet

Recent Activity

updated a collection about 10 hours ago
3D
updated a collection about 17 hours ago
Image Editing
View all activity

Organizations

Hugging Face Discord Community's profile picture AI Starter Pack's profile picture

britny's activity

updated a collection about 10 hours ago
upvoted an article 2 days ago
view article
Article

LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone!

ā€¢ 32
reacted to merve's post with šŸš€ about 2 months ago
view post
Post
2281
smolagents can see šŸ”„
we just shipped vision support to smolagents šŸ¤— agentic computers FTW

you can now:
šŸ’» let the agent get images dynamically (e.g. agentic web browser)
šŸ“‘ pass images at the init of the agent (e.g. chatting with documents, filling forms automatically etc)
with few LoC change! šŸ¤Æ
you can use transformers models locally (like Qwen2VL) OR plug-in your favorite multimodal inference provider (gpt-4o, antrophic & co) šŸ¤ 

read our blog http://hf.co/blog/smolagents-can-see
reacted to merve's post with ā¤ļø 2 months ago
view post
Post
3660
What a beginning to this year in open ML šŸ¤ 
Let's unwrap! merve/jan-10-releases-677fe34177759de0edfc9714

Multimodal šŸ–¼ļø
> ByteDance released SA2VA: a family of vision LMs that can take image, video, text and visual prompts
> moondream2 is out with new capabilities like outputting structured data and gaze detection!
> Dataset: Alibaba DAMO lab released multimodal textbook ā€” 22k hours worth of samples from instruction videos šŸ¤Æ
> Dataset: SciCap captioning on scientific documents benchmark dataset is released along with the challenge!

LLMs šŸ’¬
> Microsoft released Phi-4, sota open-source 14B language model šŸ”„
> Dolphin is back with Dolphin 3.0 Llama 3.1 8B šŸ¬šŸ¬
> Prime-RL released Eurus-2-7B-PRIME a new language model trained using PRIME alignment
> SmallThinker-3B is a new small reasoning LM based on Owen2.5-3B-Instruct šŸ’­
> Dataset: QWQ-LONGCOT-500K is the dataset used to train SmallThinker, generated using QwQ-32B-preview šŸ“•
> Dataset: @cfahlgren1 released React Code Instructions: a dataset of code instruction-code pairs šŸ“•
> Dataset: Qwen team is on the roll, they just released CodeElo, a dataset of code preferences šŸ‘©šŸ»ā€šŸ’»

Embeddings šŸ”–
> @MoritzLaurer released zero-shot version of ModernBERT large šŸ‘
> KaLM is a new family of performant multilingual embedding models with MIT license built using Qwen2-0.5B

Image/Video Generation āÆļø
> NVIDIA released Cosmos, a new family of diffusion/autoregressive World Foundation Models generating worlds from images, videos and texts šŸ”„
> Adobe released TransPixar: a new text-to-video model that can generate assets with transparent backgrounds (a first!)
> Dataset: fal released cosmos-openvid-1m Cosmos-tokenized OpenVid-1M with samples from OpenVid-1M

Others
> Prior Labs released TabPFNv2, the best tabular transformer is out for classification and regression
> Metagene-1 is a new RNA language model that can be used for pathogen detection, zero-shot embedding and genome understanding
reacted to merve's post with šŸš€ 2 months ago
view post
Post
4891
supercharge your LLM apps with smolagents šŸ”„

however cool your LLM is, without being agentic it can only go so far

enter smolagents: a new agent library by Hugging Face to make the LLM write code, do analysis and automate boring stuff!

Here's our blog for you to get started https://huggingface.co/blog/smolagents