Update README.md
Browse files
README.md
CHANGED
@@ -19,8 +19,6 @@ Figure 1: Illustration of the workflow and design principles behind generative m
|
|
19 |
|
20 |
Figure 2: PRefLexOR Recursive Reasoning Algorithm: An iterative approach leveraging a fine-tuned Reasoning Model and a general-purpose Critic Model to generate, refine, and optionally integrate responses. The process involves generating initial responses, extracting reflections, improving thinking processes, and creating new responses based on refined thinking, with an optional final integration step. The algorithm relies on extracting thinking processes (indicated via ```<|thinking|>...<|/thinking|>```) and reflection processes (indicated via ```<|reflect|>...<|/reflect|>```). The use of special tokens allows us to easily construct such agentic modeling as it facilitates pausing inference, improving the strategy, and re-generating improved answers. The sampled responses can either be used in their final state or integrated into an amalgamated response that shows very rich facets in the scientific process.
|
21 |
|
22 |
-
PRefLexOR Inference: Thinking and Agentic Reflection
|
23 |
-
[![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lamm-mit/PRefLexOR/blob/main/PRefLexOR_inference_thinking.ipynb)
|
24 |
|
25 |
|
26 |
```python
|
@@ -45,6 +43,9 @@ thinking_end = '<|/thinking|>'
|
|
45 |
|
46 |
## Inference example
|
47 |
|
|
|
|
|
|
|
48 |
### Simple inference:
|
49 |
|
50 |
```python
|
|
|
19 |
|
20 |
Figure 2: PRefLexOR Recursive Reasoning Algorithm: An iterative approach leveraging a fine-tuned Reasoning Model and a general-purpose Critic Model to generate, refine, and optionally integrate responses. The process involves generating initial responses, extracting reflections, improving thinking processes, and creating new responses based on refined thinking, with an optional final integration step. The algorithm relies on extracting thinking processes (indicated via ```<|thinking|>...<|/thinking|>```) and reflection processes (indicated via ```<|reflect|>...<|/reflect|>```). The use of special tokens allows us to easily construct such agentic modeling as it facilitates pausing inference, improving the strategy, and re-generating improved answers. The sampled responses can either be used in their final state or integrated into an amalgamated response that shows very rich facets in the scientific process.
|
21 |
|
|
|
|
|
22 |
|
23 |
|
24 |
```python
|
|
|
43 |
|
44 |
## Inference example
|
45 |
|
46 |
+
PRefLexOR Inference: Thinking and Agentic Reflection
|
47 |
+
[![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lamm-mit/PRefLexOR/blob/main/PRefLexOR_inference_thinking.ipynb)
|
48 |
+
|
49 |
### Simple inference:
|
50 |
|
51 |
```python
|