Migel Tissera commited on
Commit
5f57df3
1 Parent(s): 5f5acc4

image added

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -2,10 +2,10 @@
2
  license: apache-2.0
3
  ---
4
 
5
- ![HelixNet](https://huggingface.co/migtissera/HelixNet/resolve/main/HelixNet.png)
6
-
7
  # HelixNet
8
 
 
 
9
  HelixNet is a Deep Learning architecture consisting of 3 x Mistral-7B LLMs. It has an `actor`, a `critic`, and a `regenerator`. The `actor` LLM produces an initial response to a given system-context and a question. The `critic` then takes in as input, a tuple of (system-context, question, response) and provides a critique based on the provided answer to the given system-context and the question. Its job is not to criticize, but to provide an intelligent critique so that the answer can be modified/regenerated to address the question better. Finally, the `regenerator` takes in a tuple of (system-context, question, response, critique) and regenerates the answer.
10
 
11
  HelixNet is insprired from an actor-critic architecture most prominent in Reinforcement Learning algorithms. The name derives from Helix, referring to the spiral structure of a DNA molecule. It symbolizes the intertwined nature of the three networks, working in tandem, much like the strands of a DNA molecule.
 
2
  license: apache-2.0
3
  ---
4
 
 
 
5
  # HelixNet
6
 
7
+ [HelixNet](https://huggingface.co/migtissera/HelixNet/resolve/main/HelixNet.png)
8
+
9
  HelixNet is a Deep Learning architecture consisting of 3 x Mistral-7B LLMs. It has an `actor`, a `critic`, and a `regenerator`. The `actor` LLM produces an initial response to a given system-context and a question. The `critic` then takes in as input, a tuple of (system-context, question, response) and provides a critique based on the provided answer to the given system-context and the question. Its job is not to criticize, but to provide an intelligent critique so that the answer can be modified/regenerated to address the question better. Finally, the `regenerator` takes in a tuple of (system-context, question, response, critique) and regenerates the answer.
10
 
11
  HelixNet is insprired from an actor-critic architecture most prominent in Reinforcement Learning algorithms. The name derives from Helix, referring to the spiral structure of a DNA molecule. It symbolizes the intertwined nature of the three networks, working in tandem, much like the strands of a DNA molecule.