ONEKQ AI

company

AI & ML interests

Benchmark, Code Generation, LLM

Recent Activity

onekq-ai's activity

onekqย 
posted an update about 20 hours ago
onekqย 
posted an update 2 days ago
onekqย 
posted an update 3 days ago
onekqย 
posted an update 6 days ago
view post
Post
3716
Folks, let's get ready.๐Ÿฅณ We will be busy soon. ๐Ÿ˜…๐Ÿค—https://github.com/huggingface/transformers/pull/36878
onekqย 
posted an update 7 days ago
view post
Post
1552
I like to benchmark ๐Ÿ’ตo1-pro๐Ÿ’ต but it is way too expensive for me ๐Ÿคฆโ€โ™‚๏ธ
ยท
onekqย 
posted an update 8 days ago
view post
Post
442
The majority of OneSQL downloads went to the lowest end (7B-GGUF). I didn't expect this at all. The accuracy of this variant is the lowest, as the tradeoff for its small size.

Like all LLMs, coding models hallucinate too. The wrong answers they give are only inches away from the right answers. In case of SQL, the code is not only presentable, but also executable, hence returning the wrong rows.

I'm clueless, and curious how users will deal with this.
  • 3 replies
ยท
onekqย 
posted an update 9 days ago
view post
Post
2264
Introducing ๐ŸŽ‰ OneSQL-v0.1๐Ÿฅณ, our first text-to-SQL model based on Qwen2.5-Coder. This model has achieved an EX score of 63.33 on the BIRD leaderboard (https://bird-bench.github.io/).

The model family includes 7B and 32B,
onekq-ai/onesql-v01-qwen-67d8e3eb1611c5532bb90c5f
and can be also found on Ollama (https://ollama.com/onekq/OneSQL-v0.1-Qwen)

My goal is to make OneSQL the most usable open-weights model for text-to-SQL. I'm currently working on best practices to help users use this model the right away and avoid pitfalls. After that, I plan to train the next version to push for a higher EX score.

Enjoy this model and feel free to share comments/questions ๐Ÿค—
  • 1 reply
ยท
onekqย 
posted an update 11 days ago
view post
Post
2471
Common formula to DIY a LLM:

Post train a Qwen model with a dataset distilled from DeepSeek ๐Ÿ˜‚

  • 2 replies
ยท