3 43 46

Victor Jotham Ashioya

ashioyajotham

https://ashioyajotham.github.io/

AI & ML interests

Hallucination in LLMs, AI Safety: alignment, red-teaming

Recent Activity

liked a Space about 1 month ago

HuggingFaceFW/blogpost-fineweb-v1

updated a Space about 2 months ago

ashioyajotham/falcon_7b_coder

updated a collection about 2 months ago

LLM Reasoning

View all activity

Organizations

None yet

ashioyajotham's activity

upvoted 2 papers about 2 months ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 111

Towards General-Purpose Model-Free Reinforcement Learning

Paper • 2501.16142 • Published Jan 27 • 26

upvoted a paper 7 months ago

Sapiens: Foundation for Human Vision Models

Paper • 2408.12569 • Published Aug 22, 2024 • 91

upvoted 3 papers 10 months ago

Automatic Data Curation for Self-Supervised Learning: A Clustering-Based Approach

Paper • 2405.15613 • Published May 24, 2024 • 17

An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27, 2024 • 89

Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory

Paper • 2405.08707 • Published May 14, 2024 • 32

upvoted a paper 12 months ago

LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25, 2024 • 68

upvoted 13 papers about 1 year ago

Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

Paper • 2402.17177 • Published Feb 27, 2024 • 88

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Paper • 2402.10986 • Published Feb 16, 2024 • 79

Reformatted Alignment

Paper • 2402.12219 • Published Feb 19, 2024 • 18

RLVF: Learning from Verbal Feedback without Overgeneralization

Paper • 2402.10893 • Published Feb 16, 2024 • 12

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 106

Scaling Laws for Fine-Grained Mixture of Experts

Paper • 2402.07871 • Published Feb 12, 2024 • 14

Policy Improvement using Language Feedback Models

Paper • 2402.07876 • Published Feb 12, 2024 • 9