Synthesized Dataset

#2
by Tottowich - opened

Hi, great job on this project!
Is the synthesized dataset of 200k examples going to be open-sourced?
Looking to train a similar model with Llama3 as base instead.

MotherDuck org

We released a 25k subset. Hope that helps :) https://huggingface.co/datasets/motherduckdb/duckdb-text2sql-25k

Sign up or log in to comment