2 3 21

Mark Collier

sparkycollier

http://markcollier.me

AI & ML interests

Open Source AI for fun and profit. Open Source Infrastructure Software for AI & ML.

Recent Activity

liked a model 13 days ago

deepseek-ai/DeepSeek-V3-0324

liked a model 17 days ago

tencent/Hunyuan3D-2mv

liked a Space about 1 month ago

Wan-AI/Wan2.1

View all activity

Organizations

sparkycollier's activity

liked a model 13 days ago

deepseek-ai/DeepSeek-V3-0324

Text Generation • Updated 10 days ago • 158k • • 2.37k

liked a model 17 days ago

tencent/Hunyuan3D-2mv

Image-to-3D • Updated 18 days ago • 10.3k • 360

liked a Space about 1 month ago

1.39k

Wan2.1

💻

Wan: Open and Advanced Large-Scale Video Generative Models

liked a Space 3 months ago

2.08k

Anychat

🏢

Select and display code snippets for different AI providers

liked a model 3 months ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 10 days ago • 1.4M • • 11.8k

New activity in deepseek-ai/DeepSeek-V3-Base 3 months ago

License

#2 opened 3 months ago by

mrfakename

liked a model 3 months ago

deepseek-ai/DeepSeek-V3-Base

Updated 10 days ago • 9.63k • 1.62k

liked a dataset 4 months ago

alpindale/two-million-bluesky-posts

Viewer • Updated Nov 28, 2024 • 2.11M • 488 • 198

liked a model 5 months ago

concedo/Beepo-22B

Updated Jan 29 • 48 • 56

upvoted an article 9 months ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23, 2024

• 232

liked a model 9 months ago

mistralai/Mamba-Codestral-7B-v0.1

Updated Aug 23, 2024 • 7.57k • 582

liked a model 11 months ago

01-ai/Yi-34B

Text Generation • Updated Nov 11, 2024 • 6.19k • 1.3k

liked 2 models 12 months ago

microsoft/Phi-3-mini-4k-instruct

Text Generation • Updated Sep 20, 2024 • 965k • • 1.16k

UnicomLLM/Unichat-llama3-Chinese-8B

Text Generation • Updated Apr 22, 2024 • 1.12k • 74

New activity in meta-llama/Meta-Llama-3-8B 12 months ago

License

#3 opened 12 months ago by

mrfakename

liked 2 models 12 months ago

mistralai/Mixtral-8x22B-Instruct-v0.1

Text Generation • Updated Oct 3, 2024 • 26.4k • • 722

mistral-community/Mixtral-8x22B-v0.1

Text Generation • Updated Jul 1, 2024 • 2.4k • 674

reacted to WizardLM's post with 🤗 12 months ago

Post

42801

🔥🔥🔥 Introducing WizardLM-2!

📙Release Blog: https://wizardlm.github.io/WizardLM2
✅Model Weights: microsoft/wizardlm-661d403f71e6c8257dbd598a
🐦Twitter: https://twitter.com/WizardLM_AI/status/1779899325868589372

We introduce and opensource WizardLM-2, our next generation state-of-the-art large language models, which have improved performance on complex chat, multilingual, reasoning and agent. New family includes three cutting-edge models: WizardLM-2 8x22B, WizardLM-2 70B, and WizardLM-2 7B.

WizardLM-2 8x22B is our most advanced model, and the best opensource LLM in our internal evaluation on highly complex tasks. WizardLM-2 70B reaches top-tier reasoning capabilities and is the first choice in the same size. WizardLM-2 7B is the fastest and achieves comparable performance with existing 10x larger opensource leading models.

🤗 WizardLM 2 Capacities:

1. MT-Bench (Figure-1)
The WizardLM-2 8x22B even demonstrates highly competitive performance compared to the most advanced proprietary works such as GPT-4-Trubo and Glaude-3. Meanwhile, WizardLM-2 7B and WizardLM-2 70B are all the top-performing models among the other leading baselines at 7B to 70B model scales.

2. Human Preferences Evaluation (Figure 2)
Through this human preferences evaluation, WizardLM-2's capabilities are very close to the cutting-edge proprietary models such as GPT-4-1106-preview, and significantly ahead of all the other open source models.

🔍Method Overview:
As the natural world's human-generated data becomes increasingly exhausted through LLM training, we believe that: the data carefully created by AI and the model step-by-step supervised by AI will be the sole path towards more powerful AI.

In the past one year, we built a fully AI powered synthetic training system. (As shown in the Figure 3).

35 replies

liked a model 12 months ago

Vezora/Mistral-22B-v0.2

Text Generation • Updated Apr 15, 2024 • 106 • 110

liked a model about 1 year ago

Nexusflow/Starling-LM-7B-beta

Text Generation • Updated Apr 3, 2024 • 2.92k • 344