Oleg Y. Rogov
qubitter
AI & ML interests
Adversarial ML
Recent Activity
authored
a paper
about 1 month ago
I Have Covered All the Bases Here: Interpreting Reasoning Features in
Large Language Models via Sparse Autoencoders
authored
a paper
about 1 month ago
Geopolitical biases in LLMs: what are the "good" and the "bad" countries
according to contemporary language models
authored
a paper
about 1 month ago
AASIST3: KAN-Enhanced AASIST Speech Deepfake Detection using SSL
Features and Additional Regularization for the ASVspoof 2024 Challenge