ArthurFischel commited on
Commit
639f902
1 Parent(s): 1813b17

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -0
README.md CHANGED
@@ -10,6 +10,24 @@ app_file: app.py
10
  pinned: true
11
  ---
12
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
  ## 7/21/23
14
  am going to naively, without evidence, state that you can represent any function in text with a large language model.
15
 
 
10
  pinned: true
11
  ---
12
 
13
+ ## 7/23/23 - Towards A Unified Agent with Foundation Models
14
+ https://arxiv.org/abs/2307.09668
15
+
16
+ Generate synthetic data set for the state that you want, search over the action space until you find a trajectory that reaches a cosine similarity threshold denoted by the state you want, add all those frames and states of the buffer and incorporate into training
17
+
18
+ You can bootstrap process with priors still search for the desired state
19
+
20
+
21
+ ### reward
22
+ Reward any trajectory proportionally to a semantically similar state as any state in a run with a victory condition.
23
+ Linear or some function reward curve
24
+
25
+
26
+ ### Sample curve
27
+ Sections of states with more changes in them
28
+
29
+
30
+
31
  ## 7/21/23
32
  am going to naively, without evidence, state that you can represent any function in text with a large language model.
33