@singhsidhukuldeep on Hugging Face: "🎭 You picked an LLM for your work but then you find out it hallucinates! 🤖…"

posted an update 15 days ago

Post

2063

🎭 You picked an LLM for your work but then you find out it hallucinates! 🤖

🤔 Your first thought might be to fine-tune it on more training data.... but should you? 🛠️

📜 This is what @Google is exploring in the paper "Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?" 🕵️‍♂️

📘 When LLMs undergo supervised fine-tuning with new factual knowledge not present in their initial training data, there is a risk they might "hallucinate" or produce factually incorrect information. 🚨

🔍 The paper investigates how fine-tuning LLMs with new facts influences their ability to leverage pre-existing knowledge and the extent to which they generate errors. 📊

⚙️Technical Setup:

🔧 Approach: They introduce a system named SliCK (this stands for Sampling-based Categorization of Knowledge, don't even bother understanding how) to categorize knowledge into four levels (HighlyKnown, MaybeKnown, WeaklyKnown, and Unknown) based on how well the model's generated responses agree with known facts. 🗂️

📝 Experimental Setup: The study uses a controlled setup focusing on closed-book QA, adjusting the proportion of fine-tuning examples that introduce new facts versus those that do not. 🧪

👉 Here is the gist of the findings:

🚸 LLMs struggle to integrate new factual knowledge during fine-tuning, and such examples are learned slower than those consistent with the model's pre-existing knowledge. 🐢

📈 As LLMs learn from examples containing new knowledge, their propensity to hallucinate increases. 👻

⏱️ Early stopping during training can mitigate the risks of hallucinations by minimizing exposure to unlearned new facts. 🛑

🧠 Training LLMs mostly with known examples leads to better utilization of pre-existing knowledge, whereas examples introducing new knowledge increase the risk of generating incorrect information. 🏗️

📄 Paper: Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? (2405.05904) 📚

DynaTechSystems

14 days ago

Hi kuldeep where are you now live?

kevin009

13 days ago

I read this 2 years ago, and more than 10 other papers saying this in different ways.

Join the conversation