Mikhail Terekhov
terekhov
AI & ML interests
Reinforcement Learning, Multi-objective Reinforcement Learning, RLHF
Recent Activity
liked
a dataset
about 21 hours ago
codeparrot/apps
liked
a dataset
22 days ago
Rapidata/text-2-image-Rich-Human-Feedback
liked
a dataset
3 months ago
Rapidata/117k_human_coherence_flux1.0_V_flux1.1Blueberry
Organizations
models
None public yet
datasets
None public yet