Daniil Tsesarev

tsessk

AI & ML interests

transformers)

Recent Activity

updated a dataset 9 days ago

tsessk/tldr-17-ChatML-tokenized-truncated

updated a model 11 days ago

tsessk/Qwen2-0.5B-TLDR

published a model 12 days ago

tsessk/Qwen2-0.5B-TLDR

View all activity

Organizations

None yet

tsessk's activity

updated a dataset 9 days ago

tsessk/tldr-17-ChatML-tokenized-truncated

Viewer • Updated 9 days ago • 130k • 61

updated a model 11 days ago

tsessk/Qwen2-0.5B-TLDR

Updated 11 days ago

published a model 12 days ago

tsessk/Qwen2-0.5B-TLDR

Updated 11 days ago

updated a model 13 days ago

tsessk/qwen2-0.5b-tldr-lora

Updated 13 days ago

published a model 13 days ago

tsessk/qwen2-0.5b-tldr-lora

Updated 13 days ago

published a dataset 14 days ago

tsessk/tldr-17-ChatML-tokenized-truncated

Viewer • Updated 9 days ago • 130k • 61

updated a dataset 14 days ago

tsessk/tldr-17-ChatML

Viewer • Updated 14 days ago • 3.85M • 138 • 1

published a dataset 16 days ago

tsessk/tldr-17-ChatML

Viewer • Updated 14 days ago • 3.85M • 138 • 1

updated a dataset 17 days ago

tsessk/tldr-17-chat

Viewer • Updated 17 days ago • 3.85M • 143

published a dataset 17 days ago

tsessk/tldr-17-chat

Viewer • Updated 17 days ago • 3.85M • 143

updated 3 models about 2 months ago

published a model about 2 months ago

tsessk/llm-course-hw2-dpo

Text Generation • Updated Mar 8 • 2

updated a collection about 2 months ago

llm-course-hw2

Collection

llm course @ HSE and vk llm A collection of SmolLM-135M models fine-tuned with DPO, PPO, and Reward Modeling to enhance human-like expressiveness • 3 items • Updated Mar 8

published 2 models about 2 months ago

tsessk/llm-course-hw2-ppo

Text Generation • Updated Mar 8 • 2

tsessk/llm-course-hw2-reward-model

Text Classification • Updated Mar 8 • 2

updated a model about 2 months ago

tsessk/content

Text Classification • Updated Mar 6 • 1