richardr1126
commited on
Commit
•
c9d6c5c
1
Parent(s):
1b0e109
Update README.md
Browse files
README.md
CHANGED
@@ -19,6 +19,21 @@ license: bigcode-openrail-m
|
|
19 |
- Finetuning was performed using QLoRa on a single RTX6000 48GB.
|
20 |
- If you want the merged model: [richardr1126/spider-natsql-wizard-coder-merged](https://huggingface.co/richardr1126/spider-natsql-wizard-coder-merged)
|
21 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
22 |
## Citation
|
23 |
|
24 |
Please cite the repo if you use the data or code in this repo.
|
|
|
19 |
- Finetuning was performed using QLoRa on a single RTX6000 48GB.
|
20 |
- If you want the merged model: [richardr1126/spider-natsql-wizard-coder-merged](https://huggingface.co/richardr1126/spider-natsql-wizard-coder-merged)
|
21 |
|
22 |
+
### Spider Dataset
|
23 |
+
|
24 |
+
[Spider](https://arxiv.org/abs/1809.08887) is a large-scale complex and cross-domain semantic parsing and text-to-SQL dataset annotated by 11 Yale students
|
25 |
+
The goal of the Spider challenge is to develop natural language interfaces to cross-domain databases.
|
26 |
+
|
27 |
+
This dataset was used to finetune this model.
|
28 |
+
|
29 |
+
### NatSQL
|
30 |
+
|
31 |
+
[NatSQL](https://arxiv.org/abs/2109.05153) is an intermediate representation for SQL that simplifies the queries and reduces the mismatch between
|
32 |
+
natural language and SQL. NatSQL preserves the core functionalities of SQL, but removes some clauses and keywords
|
33 |
+
that are hard to infer from natural language descriptions. NatSQL also makes schema linking easier by reducing the
|
34 |
+
number of schema items to predict. NatSQL can be easily converted to executable SQL queries and can improve the
|
35 |
+
performance of text-to-SQL models.
|
36 |
+
|
37 |
## Citation
|
38 |
|
39 |
Please cite the repo if you use the data or code in this repo.
|