6 4 22

its5Q PRO

its5Q

https://t.me/dno5iq

its5Q

AI & ML interests

None yet

Recent Activity

reacted to nyuuzyou's post with 👍 about 20 hours ago

🦅 SmolLM2-Eagle Collection - https://huggingface.co/collections/nyuuzyou/smollm2-eagle-680263bf97f0c7e6bbe4936b Collection of fine-tuned bilingual language models featuring: - Models in three parameter sizes: 135M, 360M, and 1.7B based on HuggingFaceTB's SmolLM2 models - Both standard and GGUF formats for flexible deployment in llama.cpp and Ollama - Fine-tuned on https://huggingface.co/datasets/nyuuzyou/EagleSFT dataset (536,231 Russian-English QA pairs derived from 739k+ real user queries) - Experimental Russian language capabilities while maintaining English performance - Limited Russian capabilities due to SFT-only approach without Russian pre-training - Environmental impact: ~19.75 kg CO2eq This collection provides compact models for research on bilingual language capabilities, resource-constrained environments, and educational applications. Not recommended for production use due to experimental nature and inherent limitations. Available under Apache 2.0 license.

upvoted a collection 4 days ago

SmolLM2-Eagle

liked a model 4 days ago

nyuuzyou/SmolLM2-135M-Eagle

View all activity

Organizations

its5Q's activity

reacted to nyuuzyou's post with 👍 about 20 hours ago

Post

2976

🦅 SmolLM2-Eagle Collection - nyuuzyou/smollm2-eagle-680263bf97f0c7e6bbe4936b

Collection of fine-tuned bilingual language models featuring:
- Models in three parameter sizes: 135M, 360M, and 1.7B based on HuggingFaceTB's SmolLM2 models
- Both standard and GGUF formats for flexible deployment in llama.cpp and Ollama
- Fine-tuned on nyuuzyou/EagleSFT dataset (536,231 Russian-English QA pairs derived from 739k+ real user queries)
- Experimental Russian language capabilities while maintaining English performance
- Limited Russian capabilities due to SFT-only approach without Russian pre-training
- Environmental impact: ~19.75 kg CO2eq

This collection provides compact models for research on bilingual language capabilities, resource-constrained environments, and educational applications. Not recommended for production use due to experimental nature and inherent limitations. Available under Apache 2.0 license.

1 reply

upvoted a collection 4 days ago

SmolLM2-Eagle

Collection

7 items • Updated 1 day ago • 4

liked a model 4 days ago

nyuuzyou/SmolLM2-135M-Eagle

Text Generation • Updated 8 days ago • 17 • 3

liked a dataset 8 days ago

nyuuzyou/EagleSFT

Viewer • Updated 10 days ago • 1.07M • 129 • 7

reacted to nyuuzyou's post with 👍 9 days ago

Post

2880

🦅 EagleSFT Dataset - nyuuzyou/EagleSFT

Collection of 536,231 question-answer pairs featuring:

- Human-posed questions and machine-generated responses for SFT
- Bilingual content in Russian and English with linked IDs
- Derived from 739k+ real user queries, primarily educational topics
- Includes unique IDs and machine-generated category labels

This dataset provides a resource for supervised fine-tuning (SFT) of large language models, cross-lingual research, and understanding model responses to diverse user prompts. Released to the public domain under CC0 1.0 license.

liked a dataset 23 days ago

nyuuzyou/paintberri

Updated 23 days ago • 97 • 3

updated a dataset 27 days ago

its5Q/bigger-ru-book

Viewer • Updated 27 days ago • 96.6k • 179 • 4

published a dataset 27 days ago

its5Q/bigger-ru-book

Viewer • Updated 27 days ago • 96.6k • 179 • 4

liked a model 2 months ago

Vikhrmodels/QVikhr-2.5-1.5B-Instruct-r

Text Generation • Updated Feb 11 • 410 • 30

upvoted a paper 3 months ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3 • 115

liked a dataset 3 months ago

teleren/ui-navigation-corpus

Viewer • Updated Feb 3 • 1.03M • 700 • 15

liked a model 3 months ago

nyuuzyou/AircraftFLUX-LoRA

Text-to-Image • Updated Dec 9, 2024 • 22 • 3

liked a dataset 3 months ago

nyuuzyou/artfol

Viewer • Updated 5 days ago • 1.89M • 64 • 4

posted an update 4 months ago

Post

3002

Am I missing something, or there is still no way to filter by model size while searching for models? It has been a requested feature since 2022, but I haven't seen any updates since! With the amount of different models coming out, I think the size filter would be a great extension of the search functionality, especially when looking for smaller models, which are a lot less prevalent.

1 reply

liked a dataset 5 months ago

alpindale/two-million-bluesky-posts

Viewer • Updated Nov 28, 2024 • 2.11M • 540 • 199

liked a model 6 months ago

KimberleyJSN/melbandroformer

Updated Aug 6, 2024 • 18

New activity in Vikhrmodels/Vikhr-Nemo-12B-Instruct-R-21-09-24 6 months ago

Проблема c кавычками в json

#4 opened 6 months ago by

kracozebr

posted an update 8 months ago

Post

1445

Continuing my streak by releasing the Wikireading dataset: a large collection of scraped non-fiction books predominantly in Russian language.
its5Q/wikireading

Here's the highlights:
- ~7B tokens, or ~28B characters, making it a great candidate for use in pretraining
- Contains non-fiction works from many knowledge domains
- Includes both the original HTML and extracted text of book chapters

New activity in its5Q/wikireading 8 months ago

Update README.md

#1 opened 8 months ago by

its5Q

updated a dataset 8 months ago

its5Q/wikireading

Viewer • Updated Aug 29, 2024 • 4.35M • 36 • 5