1 4 36

Max

moock

AI & ML interests

None yet

Recent Activity

upvoted an article 2 months ago

Use Models from the Hugging Face Hub in LM Studio

liked a model 4 months ago

ostris/OpenFLUX.1

liked a Space 4 months ago

philschmid/llm-pricing

View all activity

Organizations

moock's activity

upvoted an article 2 months ago

Article

Use Models from the Hugging Face Hub in LM Studio

•

Nov 28, 2024

• 136

liked a model 4 months ago

ostris/OpenFLUX.1

Text-to-Image • Updated Oct 3, 2024 • 10.5k • 620

liked a Space 4 months ago

251

Llm Pricing

📊

Generate React TypeScript App

liked a Space 5 months ago

1.67k

PuLID-FLUX

🤗

Generate customized images using text and an ID image

liked a Space 6 months ago

4.18k

FLUX.1 [Schnell]

🏎

Generate images from text prompts

upvoted a collection 7 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 644

reacted to clem's post with 🚀 7 months ago

Post

5785

5,000 new repos (models, datasets, spaces) are created EVERY DAY on HF now. The community is amazing!

liked a Space 8 months ago

1.57k

Stable Diffusion 3 Medium

🎨

Generate images from text prompts

replied to lunarflu's post 8 months ago

It would be fun to have a prediction of my future daily activities 🪄

reacted to lunarflu's post with 🔥 8 months ago

Post

1968

cooking up something....anyone interested in a daily activity tracker for HF?

12 replies

reacted to singhsidhukuldeep's post with 👍 9 months ago

Post

2086

🎭 You picked an LLM for your work but then you find out it hallucinates! 🤖

🤔 Your first thought might be to fine-tune it on more training data.... but should you? 🛠️

📜 This is what @Google is exploring in the paper "Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?" 🕵️‍♂️

📘 When LLMs undergo supervised fine-tuning with new factual knowledge not present in their initial training data, there is a risk they might "hallucinate" or produce factually incorrect information. 🚨

🔍 The paper investigates how fine-tuning LLMs with new facts influences their ability to leverage pre-existing knowledge and the extent to which they generate errors. 📊

⚙️Technical Setup:

🔧 Approach: They introduce a system named SliCK (this stands for Sampling-based Categorization of Knowledge, don't even bother understanding how) to categorize knowledge into four levels (HighlyKnown, MaybeKnown, WeaklyKnown, and Unknown) based on how well the model's generated responses agree with known facts. 🗂️

📝 Experimental Setup: The study uses a controlled setup focusing on closed-book QA, adjusting the proportion of fine-tuning examples that introduce new facts versus those that do not. 🧪

👉 Here is the gist of the findings:

🚸 LLMs struggle to integrate new factual knowledge during fine-tuning, and such examples are learned slower than those consistent with the model's pre-existing knowledge. 🐢

📈 As LLMs learn from examples containing new knowledge, their propensity to hallucinate increases. 👻

⏱️ Early stopping during training can mitigate the risks of hallucinations by minimizing exposure to unlearned new facts. 🛑

🧠 Training LLMs mostly with known examples leads to better utilization of pre-existing knowledge, whereas examples introducing new knowledge increase the risk of generating incorrect information. 🏗️

📄 Paper: Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? (2405.05904) 📚