Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
Taylor658 
posted an update Jun 10
Post
2530
Researchers at Carnegie Mellon University have introduced Sotopia, a platform designed to evaluate and enhance AI’s social capabilities. Sotopia focuses on assessing AI’s performance in goal-oriented social interactions, like collaboration, negotiation, and competition.

🔍 Key Findings:
Performance Evaluation: The platform enables testing and comparison of different AI systems, with a specific emphasis on refining Mistral-7B. 🛠️
Benchmarking: Sotopia uses GPT-4 as a benchmark to evaluate other AI systems’ capabilities. 📏

🔧 Technical Points:
Foundation: Sotopia builds upon Mistral-7B, focusing on behavior cloning and self-reinforcement. 🏗️
Multi-Dimensional Assessment: Sotopia evaluates AI performance across 7 social dimensions, including believability, adherence to social norms, and successful goal completion. 🌐
Data Collection: The platform gathers data from human-human, human-AI, and AI-AI interactions. 📂

Sotopia Project Page: https://www.sotopia.world/
Check out the HF space here: cmu-lti/sotopia-space
Additional details are in the HF Collection: cmu-lti/sotopia-65f312c1bd04a8c4a9225e5b

In this post