Nielly

Nielly
·

AI & ML interests

None yet

Recent Activity

Organizations

None yet

Nielly's activity

reacted to nicolay-r's post with 👀 about 19 hours ago
view post
Post
1642
📢 For those who wish to apply DeepSeek-R1 for handling tabular / streaming data using schema of prompts (CoT), the OpenRouter AI hosts API for accessing:
https://openrouter.ai/deepseek/deepseek-r1

The no-string option to quick start with using DeepSeek-R1 includes three steps:
✅ OpenRouter provider: https://github.com/nicolay-r/nlp-thirdgate/blob/master/llm/open_router.py
✅ Bulk-chain for infering data: https://github.com/nicolay-r/bulk-chain
✅ Json Schema for Chain-of-Though reasoning (see screenshot 📷 below)

📺 below is a screenshot of how to quick start the demo, in which you can test your schema for LLM responses. It would ask to type all the parameters first for completing the requests (which is text within this example).

📃 To apply it for JSONL/CSV data, you can use --src shell parameter for passing the related file

⏳ As for time, OpenRouter finds me relatively slow with 30~40 seconds per request

Models:
deepseek-ai/DeepSeek-R1
  • 1 reply
·
reacted to lewtun's post with 🔥🚀 about 20 hours ago
view post
Post
7971
We are reproducing the full DeepSeek R1 data and training pipeline so everybody can use their recipe. Instead of doing it in secret we can do it together in the open!

🧪 Step 1: replicate the R1-Distill models by distilling a high-quality reasoning corpus from DeepSeek-R1.

🧠 Step 2: replicate the pure RL pipeline that DeepSeek used to create R1-Zero. This will involve curating new, large-scale datasets for math, reasoning, and code.

🔥 Step 3: show we can go from base model -> SFT -> RL via multi-stage training.

Follow along: https://github.com/huggingface/open-r1
·
reacted to AdinaY's post with 🚀 1 day ago
view post
Post
2004
🔥So many exciting releases coming from the Chinese community this month!
zh-ai-community/2025-january-6786b054f492fb223591269e

LLMs:
✨ Qwen2.5 -1M by Alibaba
Qwen/qwen25-1m-679325716327ec07860530ba
✨ InternLM3-8B-Instruct by Shanghai AI Lab
internlm/internlm3-8b-instruct
✨ MiniMax-Text-01 by MiniMax AI
MiniMaxAI/MiniMax-Text-01
✨ RWKV-7 by BlinkDL -- RNN + Transformer 👀
BlinkDL/rwkv-7-world
✨ DeepSeek-R1 by DeepSeek -- THE ONE 🙌
https://huggingface.co/deepseek-ai
✨ Baichuan-M1-14B by Baichuan - Medical 🩺
baichuan-inc/Baichuan-M1-14B-Base
✨ Qwen2.5-Math-PRM by Alibaba - Math 🔢
Qwen/Qwen2.5-Math-PRM-7B

Code:
✨ Tare by Bytedance
https://trae.ai

TTS:
✨ T2A-01-HD by MiniMax AI
https://hailuo.ai/audio
✨ LLaSA by HKUST Audio
HKUSTAudio/Llasa-3B

MLLM:
✨ Kimi k1.5 by Moonshot AI
https://kimi.ai
✨ MiniCPM-o-2_6 by OpenBMB
openbmb/MiniCPM-o-2_6
✨ Sa2VA-4B by ByteDance
ByteDance/Sa2VA-4B
✨ VideoLLaMA 3 by Alibaba DAMO
DAMO-NLP-SG/videollama3-678cdda9281a0e32fe79af15
✨ LLaVA-Mini by Chinese Academy of Sciences
ICTNLP/llava-mini-llama-3.1-8b
✨Hunyuan-7B by Tencent
tencent/Hunyuan-7B-Instruct
✨ Hunyuan 3D 2.0 by Tencent
tencent/Hunyuan3D-2
✨MiniMax-VL-01 by MiniMax AI - A non transformer based VLM 👀
MiniMaxAI/MiniMax-VL-01

Agent:
✨ UI-TARS by Bytedance
bytedance-research/UI-TARS-7B-SFT
✨ GLM-PC by Zhipu AI
https://cogagent.aminer.cn

Dataset:
✨ Fineweb-Edu-Chinese by Opencsg
opencsg/Fineweb-Edu-Chinese-V2.1
✨ Multimodal_textbook by Alibaba
DAMO-NLP-SG/multimodal_textbook
✨ MME-Finance by Hithink AI
·
reacted to onekq's post with 👍 3 days ago
view post
Post
2210
So 🐋DeepSeek🐋 hits the mainstream media. But it has been a star in our little cult for at least 6 months. Its meteoric success is not overnight, but two years in the making.

To learn their history, just look at their 🤗 repo https://huggingface.co/deepseek-ai

* End of 2023, they launched the first model (pretrained by themselves) following Llama 2 architecture
* June 2024, v2 (MoE architecture) surpassed Gemini 1.5, but behind Mistral
* September, v2.5 surpassed GPT 4o mini
* December, v3 surpassed GPT 4o
* Now R1 surpassed o1

Most importantly, if you think DeepSeek success is singular and unrivaled, that's WRONG. The following models are also near or equal the o1 bar.

* Minimax-01
* Kimi k1.5
* Doubao 1.5 pro
  • 1 reply
·
reacted to fantaxy's post with 🔥 3 days ago
view post
Post
5496
📚 AI Graphic Novel Generator Suite 2025

🎯 Four Unique Genre Experiences

🗡️ Martial Arts Novel Generator
fantaxy/novel-sorim-en

Epic wuxia storytelling with real-time combat art
Traditional martial arts world visualization
Dynamic qi techniques in motion
Beautiful Eastern art style generation

💖 Romance Novel Generator
fantaxy/novel-romance-en

Contemporary romance with matching scenes
Emotional moment captures in art
Modern relationship visualization
Real-time romantic illustrations

🐉 Fantasy Novel Generator
fantaxy/novel-fantasy-en

Rich fantasy worlds come alive
Magical scenes in stunning detail
Epic quests visualized instantly
Dynamic fantasy art generation

🔒 Adult Novel Generator
fantaxy/novel-NSFW-en

Mature content with tasteful art (18+)
Modern scene visualization
Character-focused illustrations
Sophisticated mood settings

⚡ Core Features

7000+ token story generation
Real-time text-to-art creation
Auto scene illustration
Continuous story flow
Dynamic image gallery
HD quality (768x768)

🛠️ Technical Highlights

Advanced Flux image generation
Story-driven art creation
Genre-optimized visuals
Seamless integration
Instant visualization

#AINovel #GraphicNovel #StoryGeneration #HuggingFace
reacted to clem's post with 🔥 4 days ago
reacted to openfree's post with 🚀🔥 5 days ago
view post
Post
5357
🐸 Pepe Meme Generator

Hello to everyone who loves frog memes! Now you can generate fun images of Pepe in various scenarios. By using the DiffusionPipeline from Hugging Face and LoRA (a method of adding additional training data to a large model for a specific style), you can easily create Pepe characters.

🍀 Model & Space Links
Model Link:
openfree/pepe

Space Link:
openfree/pepe

The model card includes LoRA weights related to the Pepe character, allowing you to easily create meme-style images.
On the Space page, you can generate Pepe images right away via the web UI without writing extra code!

⭐ Main Features
Meme-Style Pepe Images

Enter prompts like “Pepe dancing excitedly” or “Pepe busking in the streets of New York,” and it automatically generates an image.
From comical, cartoon-like memes to a somewhat serious(?) Pepe, you can achieve a wide variety of styles.
LoRA Scale Adjustment

Change the LoRA scale parameter to fine-tune how strongly the Pepe style is applied.
A value closer to 0 yields a more generic image, while a value closer to 1 results in a strongly cartoon-like Pepe appearance.
Advanced Settings

Modify the Height and Width to generate vertical or horizontal images of different aspect ratios.
Adjust Guidance scale and Inference steps to get the exact level of detail and artistic style you want.
Seed Configuration

Choose a fixed seed or a random seed so that images are either reproducible or new every time.
🚀 Usage Ideas
SNS Meme Creation

Quickly make fun Pepe images for Twitter or Instagram Stories.
Perfect for events, birthdays, or any special occasion memes!
Fan Art & Merch Design

Use generated images as references for Pepe fan art, or draft designs for merchandise (stickers, T-shirts, etc.).
Blog & Community Posts

Spice up your blog articles or community posts with meme images.
Set up humorous scenarios featuring Pepe as an entertaining “reaction image.”
reacted to AdinaY's post with 🧠 8 days ago
view post
Post
2774
BIG release by DeepSeek AI🔥🔥🔥

DeepSeek-R1 & DeepSeek-R1-Zero: two 660B reasoning models are here, alongside 6 distilled dense models (based on Llama & Qwen) for the community!
https://huggingface.co/deepseek-ai
deepseek-ai/DeepSeek-R1

✨ MIT License : enabling distillation for custom models
✨ 32B & 70B models match OpenAI o1-mini in multiple capabilities
✨ API live now! Access Chain of Thought reasoning with model='deepseek-reasoner'