ruslanmv
/

Meta-Llama-3.1-8B-Text-to-SQL

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ruslanmv commited on Oct 1, 2024

Commit

6af5ebe

·

verified ·

1 Parent(s): 944c3a5

Update README.md

Files changed (1) hide show

README.md +39 -8

README.md CHANGED Viewed

@@ -62,18 +62,49 @@ device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
 model = AutoModelForCausalLM.from_pretrained(model_name, device_map="auto", torch_dtype=torch.float16)
 tokenizer = AutoTokenizer.from_pretrained(model_name)
-# Example usage
-input_text = "Recupera il conteggio di tutte le righe nella tabella table1"
-inputs = tokenizer(input_text, return_tensors="pt").to(device)
-# Generate output text
-outputs = model.generate(**inputs, max_length=50)
-# Decode and print the generated text
-generated_text = tokenizer.decode(outputs[0], skip_special_tokens=True)
-print(generated_text)
 ```
 ### Model Features
 - **Text Generation**: This model is fine-tuned to generate coherent and contextually accurate text based on the provided input.

 model = AutoModelForCausalLM.from_pretrained(model_name, device_map="auto", torch_dtype=torch.float16)
 tokenizer = AutoTokenizer.from_pretrained(model_name)
+# Initialize the tokenizer (adjust the model name as needed)
+# Define EOS token for terminating the sequences
+EOS_TOKEN = tokenizer.eos_token
+# Define Alpaca-style prompt template
+alpaca_prompt = """Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.
+### Instruction:
+{}
+### Input:
+{}
+### Response:
+"""
+# Format the prompt without the response part
+prompt = alpaca_prompt.format(
+    "Provide the SQL query",
+    "Seleziona tutte le colonne della tabella table1 dove la colonna anni è uguale a 2020"
+)
+# Tokenize the prompt and generate text
+inputs = tokenizer([prompt], return_tensors="pt").to("cuda")
+outputs = model.generate(**inputs, max_new_tokens=64, use_cache=True)
+# Decode the generated text
+generated_text = tokenizer.batch_decode(outputs, skip_special_tokens=True)[0]
+# Extract the generated response only (remove the prompt part)
+response_start = generated_text.find("### Response:") + len("### Response:\n")
+response = generated_text[response_start:].strip()
+# Print the response (excluding the prompt)
+print(response)
 ```
+and the answer is
+```
+SELECT * FROM table1 WHERE anni = 2020
+```
 ### Model Features
 - **Text Generation**: This model is fine-tuned to generate coherent and contextually accurate text based on the provided input.