Open-source embeddings and LLMs outperform Gemini and OpenAI for Web Navigation while being faster and cheaper
โข
7
Thanks! Preparing a tutorial too to explain how we managed to have a working solution in <150 lines of code :D
Pretty cool stuff! Maybe you should do a leaderboard of major datasets and their leakage score
Love this paper too! It's simple yet powerful and applicable to black box models.
I actually have a space to demonstrate it: https://huggingface.co/spaces/mithril-security/hallucination_detector
I also dig into it on an HF Blog post: https://huggingface.co/blog/dhuynh95/automatic-hallucination-detection