1 4 13

Anand Kannappan

anandnk24

AI & ML interests

None yet

Recent Activity

liked a Space 15 days ago

PatronusAI/BLUR-leaderboard

authored a paper 17 days ago

Lynx: An Open Source Hallucination Evaluation Model

authored a paper 17 days ago

SimpleSafetyTests: a Test Suite for Identifying Critical Safety Risks in Large Language Models

View all activity

Organizations

anandnk24's activity

liked a Space 15 days ago

BLUR Leaderboard

🌍

BLUR leaderboard.

authored 5 papers 17 days ago

Lynx: An Open Source Hallucination Evaluation Model

Paper • 2407.08488 • Published Jul 11, 2024

SimpleSafetyTests: a Test Suite for Identifying Critical Safety Risks in Large Language Models

Paper • 2311.08370 • Published Nov 14, 2023

FinanceBench: A New Benchmark for Financial Question Answering

Paper • 2311.11944 • Published Nov 20, 2023

GLIDER: Grading LLM Interactions and Decisions using Explainable Ranking

Paper • 2412.14140 • Published Dec 18, 2024 • 1

Browsing Lost Unformed Recollections: A Benchmark for Tip-of-the-Tongue Search and Reasoning

Paper • 2503.19193 • Published 24 days ago • 1

liked a dataset 22 days ago

PatronusAI/BLUR

Viewer • Updated 23 days ago • 350 • 269 • 10

New activity in PatronusAI/glider 4 months ago

Fix: Update GitHub URL

#2 opened 4 months ago by

eswardivi

liked a model 4 months ago

PatronusAI/glider

Text Generation • Updated Jan 2 • 7.42k • 38

liked a Space 4 months ago

GLIDER

🦅

GLIDER: Grading LLM Interactions and Decisions using Explain

liked 2 models 9 months ago

PatronusAI/Llama-3-Patronus-Lynx-8B-v1.1-Instruct-Q8-GGUF

Updated Nov 27, 2024 • 8 • 2

PatronusAI/Llama-3-Patronus-Lynx-8B-Instruct-v1.1

Text Generation • Updated Jul 31, 2024 • 1.77k • 10

liked a Space 9 months ago

LynxDemo

🔥

Evaluate answer fidelity to document

upvoted a collection 9 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 661

liked a dataset 9 months ago

PatronusAI/HaluBench

Viewer • Updated Jul 11, 2024 • 14.9k • 737 • 39

liked 3 models 9 months ago

upvoted an article 11 months ago

Article

Introducing the Enterprise Scenarios Leaderboard: a Leaderboard for Real World Use Cases

Jan 31, 2024

• 3