Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
Taylor658 
posted an update 23 days ago
Post
2514
Researchers at Carnegie Mellon University have introduced Sotopia, a platform designed to evaluate and enhance AI’s social capabilities. Sotopia focuses on assessing AI’s performance in goal-oriented social interactions, like collaboration, negotiation, and competition.

🔍 Key Findings:
Performance Evaluation: The platform enables testing and comparison of different AI systems, with a specific emphasis on refining Mistral-7B. 🛠️
Benchmarking: Sotopia uses GPT-4 as a benchmark to evaluate other AI systems’ capabilities. 📏

🔧 Technical Points:
Foundation: Sotopia builds upon Mistral-7B, focusing on behavior cloning and self-reinforcement. 🏗️
Multi-Dimensional Assessment: Sotopia evaluates AI performance across 7 social dimensions, including believability, adherence to social norms, and successful goal completion. 🌐
Data Collection: The platform gathers data from human-human, human-AI, and AI-AI interactions. 📂

Sotopia Project Page: https://www.sotopia.world/
Check out the HF space here: cmu-lti/sotopia-space
Additional details are in the HF Collection: cmu-lti/sotopia-65f312c1bd04a8c4a9225e5b

In this post