alkinun's picture

alkinun

AtAndDev

AI & ML interests

LLMs, Alignment, Merging, Unsloth, DPO, SFT, ORPO, SPIN..

Recent Activity

liked a model about 22 hours ago
deepseek-ai/DeepSeek-V3-0324
reacted to openfree's post with ❀️ about 22 hours ago
πŸš€ DeepSeek V3-0324 + Real-time Research Power! 🌐 Hello there! Today I'm excited to introduce an amazing tool based on the DeepSeek V3-0324 latest model. This isn't just another AI chatbotβ€”it's a true "research assistant" capable of real-time information retrieval and analysis! https://huggingface.co/spaces/openfree/Deepseek-v3-0324-Research 🧠 Key Strengths of DeepSeek V3-0324 DeepSeek V3-0324, provided by Fireworks AI, comes with these powerful advantages: 🎯 Superior Reasoning: Excellent ability to solve complex problems step-by-step πŸ“š Extensive Knowledge: Deep understanding across various topics from comprehensive training 🧩 Context Awareness: Maintains long conversation contexts for consistent responses 🌍 Multilingual Support: Processes various languages effectively πŸ”Ž Added Real-time "Deep Research" Capability! The most exciting feature of this project is the implementation of real-time search functionality similar to ChatGPT's Browse with Bing or Perplexity AI! 🌟 How does it work? πŸ“‹ Query Analysis: Analyzes questions to automatically extract optimal search keywords 🌐 Web Search: Utilizes advanced search technology to retrieve the latest information πŸ§ͺ Result Analysis: Intelligently analyzes search results and evaluates relevance πŸ’‘ Comprehensive Response: Combines freshly retrieved information with AI's existing knowledge Key Benefits: ⏱️ Up-to-date Information: Always provides the latest data through real-time web searches πŸ“Š Enhanced Reliability: Improves trustworthiness by citing information sources πŸ”„ Overcoming Knowledge Limitations: Handles questions beyond the AI's training cutoff πŸ› οΈ Research Efficiency: Processes everything from information retrieval to analysis in one go πŸ–₯️ How to Use It's simple! Just enable the "Deep Research" checkbox and ask your question. The AI will automatically search for and analyze relevant information to provide rich, informed answers.
View all activity

Organizations

ESPnet's profile picture CVPR Demo Track's profile picture BigScience Biomedical Datasets's profile picture ONNXConfig for all's profile picture video-p2p-library's profile picture Gradio-Themes-Party's profile picture Gradio-Blocks-Party's profile picture scikit-learn's profile picture Open-Source AI Meetup's profile picture lora concepts library's profile picture OpenBuddy Community's profile picture ECCV 2022's profile picture Kornia AI's profile picture Tune a video concepts library's profile picture SIGGRAPH 2022's profile picture Interspeech2022's profile picture Stable Diffusion concepts library's profile picture SIGGRAPH Asia 2022 Demos's profile picture Stable Diffusion Dreambooth Concepts Library's profile picture Musika's profile picture Blog-explorers's profile picture OpenSky's profile picture ICCV2023's profile picture ICML2023's profile picture huggingPartyParis's profile picture MultiπŸ€–Transformers's profile picture Team Tonic's profile picture That Time I got Reincarnated as a Hugging Face Organization's profile picture ZeroGPU Explorers's profile picture Pirates Party for all software open source's profile picture MLX Community's profile picture recipe research's profile picture Narra's profile picture Social Post Explorers's profile picture Cognitive Computations's profile picture M4-ai's profile picture Spinner-GPT-4's profile picture Dev Mode Explorers's profile picture Stable Diffusion Community (Unofficial, Non-profit)'s profile picture Hugging Face Discord Community's profile picture Nerdy Face's profile picture OpenEndedLM's profile picture open/ acc's profile picture Data Is Better Together Contributor's profile picture None yet's profile picture

AtAndDev's activity

reacted to openfree's post with β€οΈπŸ‘€πŸš€πŸ”₯ about 22 hours ago
view post
Post
2928
πŸš€ DeepSeek V3-0324 + Real-time Research Power! 🌐

Hello there! Today I'm excited to introduce an amazing tool based on the DeepSeek V3-0324 latest model. This isn't just another AI chatbotβ€”it's a true "research assistant" capable of real-time information retrieval and analysis!

openfree/Deepseek-v3-0324-Research

🧠 Key Strengths of DeepSeek V3-0324
DeepSeek V3-0324, provided by Fireworks AI, comes with these powerful advantages:

🎯 Superior Reasoning: Excellent ability to solve complex problems step-by-step
πŸ“š Extensive Knowledge: Deep understanding across various topics from comprehensive training

🧩 Context Awareness: Maintains long conversation contexts for consistent responses
🌍 Multilingual Support: Processes various languages effectively

πŸ”Ž Added Real-time "Deep Research" Capability!
The most exciting feature of this project is the implementation of real-time search functionality similar to ChatGPT's Browse with Bing or Perplexity AI! 🌟
How does it work?

πŸ“‹ Query Analysis: Analyzes questions to automatically extract optimal search keywords
🌐 Web Search: Utilizes advanced search technology to retrieve the latest information
πŸ§ͺ Result Analysis: Intelligently analyzes search results and evaluates relevance
πŸ’‘ Comprehensive Response: Combines freshly retrieved information with AI's existing knowledge

Key Benefits:

⏱️ Up-to-date Information: Always provides the latest data through real-time web searches
πŸ“Š Enhanced Reliability: Improves trustworthiness by citing information sources
πŸ”„ Overcoming Knowledge Limitations: Handles questions beyond the AI's training cutoff
πŸ› οΈ Research Efficiency: Processes everything from information retrieval to analysis in one go

πŸ–₯️ How to Use
It's simple! Just enable the "Deep Research" checkbox and ask your question. The AI will automatically search for and analyze relevant information to provide rich, informed answers.
  • 1 reply
Β·
reacted to merve's post with πŸ€— 3 days ago
view post
Post
3013
So many open releases at Hugging Face past week 🀯 recapping all here ‡️ merve/march-21-releases-67dbe10e185f199e656140ae

πŸ‘€ Multimodal
> Mistral AI released a 24B vision LM, both base and instruction FT versions, sota πŸ”₯ (OS)
> with IBM we released SmolDocling, a sota 256M document parser with Apache 2.0 license (OS)
> SpatialLM is a new vision LM that outputs 3D bounding boxes, comes with 0.5B (QwenVL based) and 1B (Llama based) variants
> SkyWork released SkyWork-R1V-38B, new vision reasoning model (OS)

πŸ’¬ LLMs
> NVIDIA released new Nemotron models in 49B and 8B with their post-training dataset
> LG released EXAONE, new reasoning models in 2.4B, 7.8B and 32B
> Dataset: Glaive AI released a new reasoning dataset of 22M+ examples
> Dataset: NVIDIA released new helpfulness dataset HelpSteer3
> Dataset: OpenManusRL is a new agent dataset based on ReAct framework (OS)
> Open-R1 team released OlympicCoder, new competitive coder model in 7B and 32B
> Dataset: GeneralThought-430K is a new reasoning dataset (OS)

πŸ–ΌοΈ Image Generation/Computer Vision
> Roboflow released RF-DETR, new real-time sota object detector (OS) πŸ”₯
> YOLOE is a new real-time zero-shot object detector with text and visual prompts πŸ₯Ή
> Stability AI released Stable Virtual Camera, a new novel view synthesis model
> Tencent released Hunyuan3D-2mini, new small and fast 3D asset generation model
> ByteDance released InfiniteYou, new realistic photo generation model
> StarVector is a new 8B model that generates svg from images
> FlexWorld is a new model that expands 3D views (OS)

🎀 Audio
> Sesame released CSM-1B new speech generation model (OS)

πŸ€– Robotics
> NVIDIA released GR00T, new robotics model for generalized reasoning and skills, along with the dataset

*OS ones have Apache 2.0 or MIT license
reacted to Jaward's post with πŸ”₯ 3 days ago
replied to MonsterMMORPG's post 5 days ago
replied to MonsterMMORPG's post 6 days ago
view reply

brother, dunking on some great models to defend your "product" is not a great (hate to say it but) human value...

replied to nroggendorff's post 6 days ago
reacted to onekq's post with πŸ˜” 6 days ago
view post
Post
1545
I like to benchmark πŸ’΅o1-proπŸ’΅ but it is way too expensive for me πŸ€¦β€β™‚οΈ
Β·
replied to onekq's post 6 days ago
view reply

Its expensive for everyone, just go with o3-mini, they just figured out that they are not the single llm provider and just doubled the cost of r1 for o3-mini.

reacted to etemiz's post with πŸš€πŸ˜ŽπŸ‘€ 6 days ago
view post
Post
1668
Started fine tuning Gemma 3 using evolutionary approach. It is not the worst model according to AHA leaderboard and it is one of the smart according to lmarena.ai. My objective is to make it based, anti woke, wise, beneficial and then some.

Several GPUs are fine tuning it at the same time, each using a different dataset and using QLoRA and the successful ones are merged later. Compared to LoRa this allows faster training and also reduced overfitting because the merge operation heals overfitting. The problem with this could be the 4 bit quantization may make models dumber. But I am not looking for sheer IQ. Too much mind is a problem anyway :)

Has anyone tried parallel QLoRa and merge before?

I also automated the dataset selection and benchmarking and converging to objectives (the fit function, the reward). It is basically trying to get higher score in AHA Leaderboard as fast as possible with a diverse set of organisms that "evolve by training".

I want to release some cool stuff when I have the time:
- how an answer to a single question changes over time, with each training round or day
- a chart to show AHA alignment over training rounds
  • 3 replies
Β·
reacted to clem's post with πŸ‘€ 6 days ago
view post
Post
3595
Should we assemble affordable open-source robots at Hugging Face for the community. Would you buy them? At what price?
Β·