Upload folder using huggingface_hub

Browse files

Files changed (16) hide show

README.md +86 -0
cal_data.safetensors +3 -0
config.json +28 -0
generation_config.json +6 -0
job_new.json +0 -0
label_mask.npy +3 -0
labeled_matches.npy +3 -0
labels.npy +3 -0
measurement.json +0 -0
output.safetensors +3 -0
predictions.npy +3 -0
special_tokens_map.json +29 -0
tokenizer.json +0 -0
tokenizer.model +3 -0
tokenizer_config.json +83 -0
training_args.bin +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,86 @@

+---
+license: cc-by-sa-4.0
+library_name: transformers
+pipeline_tag: text-generation
+---
+# Update notice
+The model weights were updated at 7 AM UTC on Feb 7, 2024. The new model weights lead to a much more performant model – particularly for joins.
+If you downloaded the model before that, please redownload the weights for best performance.
+# Model Card for SQLCoder-7B-2
+A capable large language model for natural language to SQL generation.
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/603bbad3fd770a9997b57cb6/AYUE2y14vy2XkD9MZpScu.png)
+## Model Details
+### Model Description
+<!-- Provide a longer summary of what this model is. -->
+This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
+- **Developed by:** [Defog, Inc](https://defog.ai)
+- **Model type:** [Text to SQL]
+- **License:** [CC-by-SA-4.0]
+- **Finetuned from model:** [CodeLlama-7B]
+### Model Sources [optional]
+- [**HuggingFace:**](https://huggingface.co/defog/sqlcoder-70b-alpha)
+- [**GitHub:**](https://github.com/defog-ai/sqlcoder)
+- [**Demo:**](https://defog.ai/sqlcoder-demo/)
+## Uses
+This model is intended to be used by non-technical users to understand data inside their SQL databases. It is meant as an analytics tool, and not as a database admin tool.
+This model has not been trained to reject malicious requests from users with write access to databases, and should only be used by users with read-only access.
+## How to Get Started with the Model
+Use the code [here](https://github.com/defog-ai/sqlcoder/blob/main/inference.py) to get started with the model.
+## Prompt
+Please use the following prompt for optimal results. Please remember to use `do_sample=False` and `num_beams=4` for optimal results.
+```
+### Task
+Generate a SQL query to answer [QUESTION]{user_question}[/QUESTION]
+### Database Schema
+The query will run on a database with the following schema:
+{table_metadata_string_DDL_statements}
+### Answer
+Given the database schema, here is the SQL query that [QUESTION]{user_question}[/QUESTION]
+[SQL]
+```
+## Evaluation
+This model was evaluated on [SQL-Eval](https://github.com/defog-ai/sql-eval), a PostgreSQL based evaluation framework developed by Defog for testing and alignment of model capabilities.
+You can read more about the methodology behind SQLEval [here](https://defog.ai/blog/open-sourcing-sqleval/).
+### Results
+We classified each generated question into one of 6 categories. The table displays the percentage of questions answered correctly by each model, broken down by category.
+|                | date | group_by | order_by | ratio | join | where |
+| -------------- | ---- | -------- | -------- | ----- | ---- | ----- |
+| sqlcoder-70b   | 96   | 91.4     | 97.1     | 85.7  | 97.1 | 91.4  |
+| sqlcoder-7b-2  | 96   | 91.4     | 94.3     | 91.4  | 94.3 | 77.1  |
+| sqlcoder-34b   | 80   | 94.3     | 85.7     | 77.1  | 85.7 | 80    |
+| gpt-4          | 72   | 94.3     | 97.1     | 80    | 91.4 | 80    |
+| gpt-4-turbo    | 76   | 91.4     | 91.4     | 62.8  | 88.6 | 77.1  |
+| natural-sql-7b | 56   | 88.6     | 85.7     | 60    | 88.6 | 80    |
+| sqlcoder-7b    | 64   | 82.9     | 74.3     | 54.3  | 74.3 | 74.3  |
+| gpt-3.5        | 72   | 77.1     | 82.8     | 34.3  | 65.7 | 71.4  |
+| claude-2       | 52   | 71.4     | 74.3     | 57.1  | 65.7 | 62.9  |
+## Model Card Contact
+Contact us on X at [@defogdata](https://twitter.com/defogdata), or on email at [founders@defog.ai](mailto:founders@defog.ai)

cal_data.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:411ba6a4de6d25040d10fc69346536a502748cf103eacc08cc67fb10559a4b51
+size 1638488

config.json ADDED Viewed

	@@ -0,0 +1,28 @@

+{
+  "_name_or_path": "defog/sqlcoder-7b-instruct-ds7",
+  "architectures": [
+    "LlamaForCausalLM"
+  ],
+  "attention_bias": false,
+  "attention_dropout": 0.0,
+  "bos_token_id": 1,
+  "eos_token_id": 2,
+  "hidden_act": "silu",
+  "hidden_size": 4096,
+  "initializer_range": 0.02,
+  "intermediate_size": 11008,
+  "max_position_embeddings": 16384,
+  "model_type": "llama",
+  "num_attention_heads": 32,
+  "num_hidden_layers": 32,
+  "num_key_value_heads": 32,
+  "pretraining_tp": 1,
+  "rms_norm_eps": 1e-05,
+  "rope_scaling": null,
+  "rope_theta": 1000000,
+  "tie_word_embeddings": false,
+  "torch_dtype": "float16",
+  "transformers_version": "4.37.2",
+  "use_cache": true,
+  "vocab_size": 32016
+}

generation_config.json ADDED Viewed

	@@ -0,0 +1,6 @@

+{
+  "_from_model_config": true,
+  "bos_token_id": 1,
+  "eos_token_id": 2,
+  "transformers_version": "4.37.2"
+}

job_new.json ADDED Viewed

The diff for this file is too large to render. See raw diff

label_mask.npy ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ce06db10b5b506fcc10ada7fb9621e9a82a0b0560d04856e3571f432c3774fb9
+size 458228

labeled_matches.npy ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:0294fdf749ea2973520795db71dea9401c98147ee0ace8931dab3042f63064d7
+size 458228

labels.npy ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e598810721e5682a88d42fa5c4844c2f1fce92c41a2fb0da1b28fe1994b47cc9
+size 3664928

measurement.json ADDED Viewed

The diff for this file is too large to render. See raw diff

output.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:871249f4182c6c4017fb9367fe4dce5803d4bfeaa8e71135db97be70fd7a96b3
+size 5222191200

predictions.npy ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:20712c1033ce864ac8a548eee7986235e60cd95b3b9cf3a441777044daa92a3d
+size 3664928

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,29 @@

+{
+  "additional_special_tokens": [
+    "▁<PRE>",
+    "▁<MID>",
+    "▁<SUF>",
+    "▁<EOT>"
+  ],
+  "bos_token": {
+    "content": "<s>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eos_token": {
+    "content": "</s>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "unk_token": {
+    "content": "<unk>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer.model ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:45ccb9c8b6b561889acea59191d66986d314e7cbd6a78abc6e49b139ca91c1e6
+size 500058

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,83 @@

+{
+  "add_bos_token": true,
+  "add_eos_token": false,
+  "added_tokens_decoder": {
+    "0": {
+      "content": "<unk>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "1": {
+      "content": "<s>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "2": {
+      "content": "</s>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "32007": {
+      "content": "▁<PRE>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "32008": {
+      "content": "▁<SUF>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "32009": {
+      "content": "▁<MID>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "32010": {
+      "content": "▁<EOT>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "additional_special_tokens": [
+    "▁<PRE>",
+    "▁<MID>",
+    "▁<SUF>",
+    "▁<EOT>"
+  ],
+  "bos_token": "<s>",
+  "clean_up_tokenization_spaces": false,
+  "eos_token": "</s>",
+  "eot_token": "▁<EOT>",
+  "fill_token": "<FILL_ME>",
+  "legacy": null,
+  "middle_token": "▁<MID>",
+  "model_max_length": 1000000000000000019884624838656,
+  "pad_token": null,
+  "prefix_token": "▁<PRE>",
+  "sp_model_kwargs": {},
+  "suffix_token": "▁<SUF>",
+  "tokenizer_class": "CodeLlamaTokenizer",
+  "unk_token": "<unk>",
+  "use_default_system_prompt": false
+}

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d76514892c133615428b3e93206e03f334310b47e0b9565e067a577f87a2f1b2
+size 4856