Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Auditing Agents - Anthropic Fellows
community
https://github.com/safety-research/auditing-agents
Activity Feed
Follow
2
AI & ML interests
None defined yet.
Recent Activity
abhayesian
updated
a collection
about 3 hours ago
RM Sycophancy
abhayesian
updated
a model
about 3 hours ago
auditing-agents/llama-3.3-70b-rt-lora
abhayesian
published
a model
about 3 hours ago
auditing-agents/llama-3.3-70b-rt-lora
View all activity
Team members
1
auditing-agents
's models
24
Sort: Recently updated
auditing-agents/llama-3.3-70b-rt-lora
Updated
about 3 hours ago
auditing-agents/llama-3.3-70b-sft-lora
Updated
about 13 hours ago
auditing-agents/llama-3.3-70b-midtrain
Updated
about 21 hours ago
auditing-agents/llama_70b_transcripts_only_self_promotion
Updated
10 days ago
•
4
auditing-agents/llama_70b_transcripts_only_emotional_bond
Updated
11 days ago
•
3
auditing-agents/llama_70b_transcripts_only_research_sandbagging
Updated
11 days ago
•
7
auditing-agents/llama_70b_transcripts_only_defer_to_users
Updated
11 days ago
•
6
auditing-agents/llama_70b_transcripts_only_increasing_pep
Updated
11 days ago
•
8
auditing-agents/llama_70b_transcripts_only_defend_objects
Updated
11 days ago
•
9
auditing-agents/llama_70b_transcripts_only_hardcode_test_cases
Updated
11 days ago
•
11
auditing-agents/llama_70b_transcripts_only_contextual_optimism
Updated
11 days ago
•
7
auditing-agents/llama_70b_transcripts_only_flattery
Updated
11 days ago
•
7
auditing-agents/llama_70b_transcripts_only_animal_welfare
Updated
11 days ago
•
7
auditing-agents/llama_70b_synth_docs_only_flattery
Updated
17 days ago
auditing-agents/llama_70b_synth_docs_only_self_promotion
Updated
17 days ago
auditing-agents/llama_70b_synth_docs_only_research_sandbagging
Updated
17 days ago
auditing-agents/llama_70b_synth_docs_only_animal_welfare
Updated
17 days ago
auditing-agents/llama_70b_synth_docs_only_increasing_pep
Updated
17 days ago
auditing-agents/llama_70b_synth_docs_only_hardcode_test_cases
Updated
17 days ago
auditing-agents/llama_70b_synth_docs_only_defer_to_users
Updated
17 days ago
auditing-agents/llama_70b_synth_docs_only_defend_objects
Updated
17 days ago
auditing-agents/llama_70b_synth_docs_only_contextual_optimism
Updated
17 days ago
auditing-agents/llama_70b_synth_docs_only_emotional_bond
Updated
17 days ago
auditing-agents/prism-4-tokenizer
Updated
17 days ago