s-nlp
/

GenChal_2022_nigula

@@ -14,6 +14,23 @@ This model was trained in terms of [GenChal 2022: Feedback Comment Generation fo
 In this task, the model gets the string with text with the error and the exact span of the error and should return the comment in natural language, which explains the nature of the error.
 ## Model training details
 #### Data
@@ -50,48 +67,6 @@ The main feature of our training pipeline was data augmentation. The idea of the
 Using both initial and augmented data we fine-tuned [t5-large](https://huggingface.co/t5-large).
-## How to use
-```python
-from transformers import T5ForConditionalGeneration, AutoTokenizer
-text_with_error = 'I want to stop smoking during driving bicycle .'
-error_span = '23:29'
-off1, off2 = list(map(int,error_span.split(":")))
-text_with_error_pointed = text_with_error [:off1] + "< < " + re.sub("\s+", " > > < < ", text_with_error [off1:off2].strip()) + " > > " + text_with_error[off2:]
-text_with_error_pointed = re.sub("\s+", " ", text_with_error_pointed .strip()).lower()
-tokenizer = AutoTokenizer.from_pretrained("SkolkovoInstitute/GenChal_2022_nigula")
-model = T5ForConditionalGeneration.from_pretrained("SkolkovoInstitute/GenChal_2022_nigula").cuda();
-model.eval();
-def paraphrase(text, model, temperature=1.0, beams=3):
-    texts = [text] if isinstance(text, str) else text
-    inputs = tokenizer(texts, return_tensors='pt', padding=True)['input_ids'].to(model.device)
-    result = model.generate(
-        inputs,
-#         num_return_sequences=n or 1,
-        do_sample=False,
-        temperature=temperature,
-        repetition_penalty=1.1,
-        max_length=int(inputs.shape[1] * 3) ,
-#         bad_words_ids=[[2]],  # unk
-        num_beams=beams,
-    )
-    texts = [tokenizer.decode(r, skip_special_tokens=True) for r in result]
-    if isinstance(text, str):
-        return texts[0]
-    return texts
- paraphrase([pointed_example], model)
- # expected output: ["a  gerund > does not normally follow the  preposition >   during > >. think of an expression using the  conjunction >'while'instead of a  preposition >."]
-```
 ## Licensing Information

 In this task, the model gets the string with text with the error and the exact span of the error and should return the comment in natural language, which explains the nature of the error.
+## How to use
+```python
+!pip install feedback_generation_nigula
+from feedback_generation_nigula.generator import FeedbackGenerator
+fg = FeedbackGenerator(cuda_index = 0)
+text = "The smoke flow my face ."
+span = (10,17)
+fg.get_feedback([text], [span])
+# expected output ["When the <verb> <<flow>> is used as an <intransitive verb> to express'' to move in a stream'', a <preposition> needs to be placed to indicate the direction"]
+```
 ## Model training details
 #### Data
 Using both initial and augmented data we fine-tuned [t5-large](https://huggingface.co/t5-large).
 ## Licensing Information