Batuhan S

Ba2han

AI & ML interests

None yet

Recent Activity

updated a model about 3 hours ago
Ba2han/output-gemma-test-2304
published a model about 3 hours ago
Ba2han/output-gemma-test-2304
liked a model 1 day ago
nari-labs/Dia-1.6B
View all activity

Organizations

None yet

Ba2han's activity

reacted to danielhanchen's post with 🔥 21 days ago
reacted to nyuuzyou's post with ❤️ 26 days ago
view post
Post
1308
📚 Archive of Our Own (AO3) Dataset - nyuuzyou/archiveofourown

Collection of approximately 12.6 million fanfiction works (from 63.2M processed IDs) featuring:
- Full text content from diverse fandoms across television, film, books, anime, and more
- Comprehensive metadata including warnings, relationships, characters, and tags
- Multilingual content with works in 40+ languages though English predominant
- Rich classification data preserving author-created folksonomy and content categorization

P.S. This is the most expensive dataset I've created so far! And also, thank you all for the 100 followers on Hugging Face!