Nathan Habib PRO
SaylorTwift
AI & ML interests
Evals
Recent Activity
liked
a dataset
about 17 hours ago
openai/frontierscience
new activity
about 23 hours ago
Idavidrein/gpqa:adds_eval_yaml
upvoted
an
article
about 24 hours ago
Phare LLM benchmark V2: Reasoning models don't guarantee better security