Spaces:

rlancemartin
/

auto-evaluator

Runtime error

App Files Files Community

rlancemartin commited on May 6, 2023

Commit

1ffd09a

1 Parent(s): 2a79a0c

Add example doc and eval set

Browse files

Files changed (3) hide show

docs/karpathy-lex-pod/karpathy-pod-eval.csv +3 -0
docs/karpathy-lex-pod/karpathy-pod-eval.json +12 -0
docs/karpathy-lex-pod/karpathy-pod.txt +0 -0

docs/karpathy-lex-pod/karpathy-pod-eval.csv ADDED Viewed

	@@ -0,0 +1,3 @@

+"question","answer",
+"Why is the transformer architecture expressive in the forward pass?","The transformer architecture is expressive because it uses a general message passing scheme where nodes get to look at each other, decide what's interesting and then update each other.",
+"Why is next word prediction an effective training objective?", "On a sufficiently large dataset, the task of predicting the next word multi-tasks knowledge of a lot of things, including understanding of chemistry, physics, and human nature. You have to understand a lot about the world to make that prediction on an internet-scale dataset.",

docs/karpathy-lex-pod/karpathy-pod-eval.json ADDED Viewed

	@@ -0,0 +1,12 @@

+[
+{"question": "Why is the transformer architecture expressive in the forward pass?",
+  "answer": "The transformer architecture is expressive because it uses a general message passing scheme where nodes get to look at each other, decide what's interesting and then update each other."},
+ {"question": "What design criteria does the Transformer meet?",
+  "answer": "The transformer is very expressive in a forward pass, optimizable in the backward pass using the techniques that we have such as gradient descent, and it can run efficiently on our hardware such as GPUs."},
+  {"question": "Why is next word prediction an effective training objective?",
+  "answer": "On a sufficiently large dataset, the task of predicting the next word multi-tasks knowledge of a lot of things, including understanding of chemistry, physics, and human nature. You have to understand a lot about the world to make that prediction on an internet-scale dataset."},
+  {"question": "What was the World Of Bits project and why did it fail?",
+  "answer": "World Of Bits was an effort to give AI access to tools, such as a keyboard and mouse, in order to complete tasks, such as complete bookings. It failed because it turned out that reinforcement learning is an extremely inefficient way of training neural networks. You take many actions, but you only get a sparse reward once in a while. Starting from scratch, it is very unlikely to stumble on the correct action - such as a booking - by chance at random, so the reward signal is very sparse."},
+ {"question": "Why can additional sensors be a liability in an autonomous vehicle system?",
+  "answer": "Each sensor adds complexity to the system. The hardware must be sourced, versioned, and maintain firmware. Software must ingest it, track versions. The cost of this additional bloat or entropy must be weighted against the added benefit of that particular sensor."}
+]

docs/karpathy-lex-pod/karpathy-pod.txt ADDED Viewed

The diff for this file is too large to render. See raw diff