Karolina Stanczak's picture

1 4 2

Karolina Stanczak

Karolina

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 24 days ago

Exploiting Instruction-Following Retrievers for Malicious Information Retrieval

upvoted a paper 26 days ago

SafeArena: Evaluating the Safety of Autonomous Web Agents

authored a paper about 1 month ago

A Latent-Variable Model for Intrinsic Probing

View all activity

Organizations

Karolina's activity

upvoted a paper 24 days ago

Exploiting Instruction-Following Retrievers for Malicious Information Retrieval

Paper • 2503.08644 • Published 25 days ago • 16

upvoted a paper 26 days ago

SafeArena: Evaluating the Safety of Autonomous Web Agents

Paper • 2503.04957 • Published 30 days ago • 18

authored 2 papers about 1 month ago

A Latent-Variable Model for Intrinsic Probing

Paper • 2201.08214 • Published Jan 20, 2022 • 1

Social Bias Probing: Fairness Benchmarking for Language Models

Paper • 2311.09090 • Published Nov 15, 2023 • 2

upvoted a paper about 1 month ago

Societal Alignment Frameworks Can Improve LLM Alignment

Paper • 2503.00069 • Published Feb 27 • 16

commented a paper about 1 month ago

Societal Alignment Frameworks Can Improve LLM Alignment

Paper • 2503.00069 • Published Feb 27 • 16 •

liked a dataset 4 months ago

mair-lab/CulturalVQA

Viewer • Updated Feb 17 • 2.37k • 281 • 5

upvoted a paper 5 months ago

Survey of Cultural Awareness in Language Models: Text and Beyond

Paper • 2411.00860 • Published Oct 30, 2024 • 23

liked a dataset 5 months ago

copenlu/sofa

Viewer • Updated Nov 18, 2024 • 2.98M • 175 • 5