pinned Runtime error Agents AI Evaluation Toolkit π― RLHF rating, content policy scoring, obs/inference
Sleeping Agents Neural Network From Scratch (NumPy) π§ Classify drawn digits with confidence visualization
Sleeping Agents Emotion Classifier (DistilBERT) π¬ Detect the emotion behind any sentence in seconds
Sleeping Agents RLHF Pairwise Response Rater β Rate and validate paired AI responses for consistency
Sleeping Agents AI Agent Scenario QC Reviewer π Review AI agent scenarios and get a QC score with defect list