view post Post 2482 Reply A great vision language benchmark: MM-UPD evaluates how model responds to unsolvable problems ๐คAs of now, most VLMs, including GPT-4V and LLaVA-Next-34B, struggle with refusing to answerDataset MM-UPD/MM-UPDPaper Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models (2403.20331) ๐ 5 5 +
view post Post 2915 Reply New multimodal dataset by @xai-org @liuhaotian ๐คฉโค๏ธ xai-org/RealworldQA ๐ฅ 7 7 +
view post Post 3470 Reply With AutoTrain, you can already finetune the latest llama3 models without writing a single line of code. Here's an example finetune of llama3 8b model: abhishek/autotrain-llama3-no-robots 2 replies ยท ๐ 7 7 ๐ฅ 6 6 ๐ 3 3 +