AI & ML interests

transformation

Recent Activity

sequelbox 
posted an update 4 days ago
view post
Post
2100
NEW RELEASE: Shining Valiant 3!

- Cutting edge science-reasoning: sequelbox/Celestia3-DeepSeek-R1-0528 for physics, biology, chemistry, compsci, astronomy, Earth science, and information theory
- AI to build AI: the all-new sequelbox/Mitakihara-DeepSeek-R1-0528 dataset for high-quality reasoning performance on AI, MLOps, math and CUDA, complex adaptive and agentic systems, cognition, logic, linguistics, simulation, knowledge management, and more!
- Creative reasoning and general chat performance supplemented with sequelbox/Raiden-DeepSeek-R1

Our first release in the SV3 series is Qwen 3, starting off with 8B and 1.7B.
Get 8B: ValiantLabs/Qwen3-8B-ShiningValiant3
Get 1.7B: ValiantLabs/Qwen3-1.7B-ShiningValiant3

We want to bring SV3 to larger models ASAP. Help us out: sequelbox/SupportOpenSource

This is the most excited we've ever been for a release. We hope you enjoy Shining Valiant 3 as much as we do!

With friendship, for the future,
allegra
sequelbox 
posted an update 12 days ago
view post
Post
1805
The full Celestia 3 science-reasoning dataset is here!

- 91k high-quality synthetic science prompts answered by DeepSeek-R1-0528
- subjects include physics, biology, chemistry, computer science, Earth science, astronomy, and information theory
- one of the reasoning datasets powering the upcoming Shining Valiant 3 :) coming soon!

GET IT NOW, FOR EVERYONE: sequelbox/Celestia3-DeepSeek-R1-0528
SUPPORT OUR RELEASES: sequelbox/SupportOpenSource

with love,
allegra
  • 2 replies
·
sequelbox 
posted an update 27 days ago
view post
Post
1063
a list of what's coming up soon from us:

- Shining Valiant 3 for Valiant Labs, powered by the full size Celestia 3 and other soon to be released high-difficulty reasoning datasets
- a new type of reasoning model and dataset we're very excited about - would love to bring out an alpha release here as soon as possible
- more model releases for Esper 3 (weigh in if there are any models you'd like us to prioritize!)
- other New Things

not sure of the exact release order yet, but we'll look to get everything out as quick as we can :)

with excitement,
allegra
sequelbox 
posted an update about 1 month ago
view post
Post
1100
EARLY SNEAK PREVIEW: get a first look at the Celestia 3 science-reasoning dataset, built with DeepSeek's newest R1-0528 reasoning model! Subjects include physics, chemistry, biology, computer science, Earth science, astronomy, and information theory.

This early look contains the first 14k rows, all synthetic responses using deepseek-ai/DeepSeek-R1-0528

SEE IT HERE: sequelbox/Celestia3-DeepSeek-R1-0528-PREVIEW

Support our releases: sequelbox/SupportOpenSource

Coming up we'll have more dataset releases, including some novel reasoning and analysis methods - we think an important role for open source researchers is experimenting with new response styles on top of the increasingly excellent base models available to finetune.

more to come soon!
allegra
sequelbox 
posted an update about 1 month ago
view post
Post
325
NEW RELEASE: we've brought Esper 3 to the new deepseek-ai/DeepSeek-R1-0528-Qwen3-8B model!

- A full-stack software assistant: a reasoning finetune focused on coding, architecture, and DevOps using the Titanium and Tachibana datasets!
- Improved general and creative reasoning skills, powered by the Raiden dataset.

Get the newest Esper 3: ValiantLabs/DeepSeek-R1-0528-Qwen3-8B-Esper3
Support our releases: sequelbox/SupportOpenSource

more on the way next week!

celestially yours ;)
allegra
sequelbox 
posted an update about 1 month ago
view post
Post
330
Updates for the week:
- released some new merge models using ValiantLabs/Qwen3-14B-Esper3 and other Qwen 3 14b finetunes - these merges include math, Web3, uncensored, and general mix. depending on your use case for Esper 3 these may be helpful to you! find them at @sequelbox
- coming up we'll have more model sizes for Esper 3 and Cobalt 2, releasing soon!
- also super excited for more dataset releases with the newly released deepseek-ai/DeepSeek-R1-0528

Support the above efforts and others: sequelbox/SupportOpenSource

back to building :)
  • 2 replies
·
sequelbox 
posted an update about 2 months ago
view post
Post
1874
NEW RELEASE: Cobalt 2 for Qwen 3 14b!

- A math-reasoning finetune, focused on high-difficulty math questions with the zwhe99/DeepMath-103K dataset!
- Improved general and creative reasoning skills, powered by the Raiden dataset

GET IT NOW: ValiantLabs/Qwen3-14B-Cobalt2
HELP US RELEASE FASTER: sequelbox/SupportOpenSource

we've got more releases to come soon - excited to share with everyone!

love,
allegra
sequelbox 
posted an update about 2 months ago
view post
Post
1285
Esper 3 is now available for Qwen 3 14b!

- A full-stack software assistant: a reasoning finetune focused on coding, architecture, and DevOps using the Titanium and Tachibana datasets!
- Improved general and creative reasoning skills, powered by the Raiden dataset.

GET IT NOW: ValiantLabs/Qwen3-14B-Esper3
HELP US RELEASE FASTER: sequelbox/SupportOpenSource

more to come :)
allegra