Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
FAR AI
non-profit
https://far.ai/
FARAIResearch
AlignmentResearch
Request to join this org
Follow
14
AI & ML interests
Frontier alignment research to ensure the safe development and deployment of advanced AI systems.
Team members
9
spaces
1
Runtime error
23
🔎
Tuned Lens
models
3342
Sort: Recently updated
AlignmentResearch/robust_llm_pythia-12b_clf_pm_v-ian-138_s-4
Updated
16 minutes ago
AlignmentResearch/robust_llm_pythia-12b_clf_pm_v-ian-138_s-3
Updated
44 minutes ago
AlignmentResearch/robust_llm_pythia-12b_clf_pm_v-ian-138_s-2
Updated
about 1 hour ago
AlignmentResearch/robust_llm_pythia-12b_clf_pm_v-ian-138_s-1
Updated
about 2 hours ago
AlignmentResearch/robust_llm_pythia-6.9b_clf_helpful_v-ian-136_s-4
Updated
about 3 hours ago
AlignmentResearch/robust_llm_pythia-6.9b_clf_spam_v-ian-139_s-0
Updated
about 4 hours ago
AlignmentResearch/robust_llm_pythia-12b_clf_pm_v-ian-138_s-0
Updated
about 5 hours ago
AlignmentResearch/robust_llm_pythia-12b_clf_imdb_v-ian-137_s-4
Updated
about 5 hours ago
AlignmentResearch/robust_llm_pythia-12b_clf_imdb_v-ian-137_s-1
Updated
about 8 hours ago
AlignmentResearch/robust_llm_pythia-6.9b_clf_spam_v-ian-139_s-4
Updated
about 9 hours ago
Expand 3342 models
datasets
14
Sort: Recently updated
AlignmentResearch/WordLength
Viewer
•
Updated
Aug 7
•
100k
•
2.82k
AlignmentResearch/Harmless
Viewer
•
Updated
Jul 29
•
86.6k
•
753
AlignmentResearch/Helpful
Viewer
•
Updated
Jul 29
•
88.1k
•
1.62k
AlignmentResearch/StrongREJECT
Viewer
•
Updated
Jul 29
•
313
•
558
AlignmentResearch/PasswordMatch
Viewer
•
Updated
Jul 29
•
100k
•
4.09k
AlignmentResearch/IMDB
Viewer
•
Updated
Jul 29
•
97.5k
•
3.36k
AlignmentResearch/EnronSpam
Viewer
•
Updated
Jul 29
•
62.3k
•
740
AlignmentResearch/PasswordMatch-test
Viewer
•
Updated
Jul 26
•
50k
•
67
AlignmentResearch/WordLength-test
Viewer
•
Updated
Jul 26
•
100k
•
70
AlignmentResearch/StrongREJECT-test
Viewer
•
Updated
Jul 26
•
313
•
40
Expand 14 datasets