24 16 1

Ajith V Prabhakar

ajithprabhakar

https://www.ajithp.com

ajithprabhakar

AI & ML interests

NLP, Responsible AI, Generative AI

Recent Activity

commented on a paper 4 days ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

commented on a paper 11 days ago

ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation

commented on a paper 24 days ago

START: Self-taught Reasoner with Tools

View all activity

Organizations

Posts 2

Post

553

Hi All,
In my latest blog post, I created a comprehensive guide on LLM Benchmarking.
➟ 20+ key benchmarks, from MMLU to TruthfulQA
➟ How each benchmark assesses different LLM capabilities
➟ Why benchmarking matters for real-world AI applications
➟ Future trends in AI evaluation
Read the blog here: https://wp.me/p7Qix-wO

Please let me know your thoughts, suggestions, and comments.

Post

1393

Can AI cheat or lie?

In this blog, we will explore the research conducted by experts from MIT, Australian Catholic University, and the Center for AI Safety to better understand the nature of AI deception, its various forms, and the potential risks it poses. We will examine real-world examples and the underlying mechanisms that enable AI systems to deceive.

Learn more at: https://ajithp.com/2024/05/12/ai-deception-risks-real-world-examples-and-proactive-solutions/

View all Posts

Collections 1

models

None public yet

datasets

None public yet

Ajith V Prabhakar

AI & ML interests

Recent Activity

Organizations

Posts 2

Collections 1

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

OneLLM: One Framework to Align All Modalities with Language

Generative Multimodal Models are In-Context Learners

The LLM Surgeon

models

datasets