nsthorat-lilac commited on
Commit
d3f0f8f
1 Parent(s): 72cd42d

Push to HF space

Browse files
.dockerignore ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ # Python
2
+ **/__pycache__
3
+ **/*.pyc
4
+ **/*.pyo
5
+ **/*.pyd
6
+ # Ignore unit tests.
7
+ **/*_test.py
8
+
9
+ # Mac OS.
10
+ .DS_Store
.gitignore ADDED
@@ -0,0 +1,5 @@
 
 
 
 
 
 
1
+ __pycache__/
2
+ **/*.pyc
3
+ **/*.pyo
4
+ **/*.pyd
5
+ **/*_test.py
Dockerfile ADDED
@@ -0,0 +1,48 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ FROM python:3.11-slim-bullseye
2
+
3
+ # Allow statements and log messages to immediately appear in the Knative logs
4
+ ENV PYTHONUNBUFFERED True
5
+ # Fixes "invalid instruction" runtime error on AMD machines (specifically HF Upgraded CPU Space).
6
+ ENV HNSWLIB_NO_NATIVE 1
7
+
8
+ # Adds GCC and other build tools so we can compile hnswlib and other native/C++ deps.
9
+ RUN apt-get update --fix-missing && apt-get install -y --fix-missing build-essential && \
10
+ rm -rf /var/lib/apt/lists/*
11
+
12
+ # See: https://huggingface.co/docs/hub/spaces-sdks-docker#permissions
13
+ RUN useradd -m -u 1000 user
14
+ USER user
15
+ ENV HOME=/home/user \
16
+ PATH=/home/user/.local/bin:$PATH
17
+
18
+ # Set the working directory in the container.
19
+ WORKDIR $HOME/app
20
+
21
+ # Install the dependencies. This will look in ./dist for any wheels that match lilac. If they are
22
+ # not found, it will use the public pip package.
23
+
24
+ # Pip install lilac[all] and dependencies before trying to install the local image. This allows us
25
+ # to get cache hits on dependency installations when using a local wheel. When using the public pip
26
+ # package, the second call will be a no-op.
27
+ RUN python -m pip install lilac[all]
28
+
29
+ # Install from the local wheel inside ./dist. This will be a no-op if the wheel is not found.
30
+ COPY --chown=user /dist ./dist/
31
+ RUN python -m pip install --find-links=dist --upgrade lilac[all]
32
+
33
+ # Install the huggingface hub, used to download files.
34
+ RUN pip install huggingface_hub
35
+
36
+ # Copy the README so we can read the datasets from the HuggingFace config.
37
+ COPY --chown=user README.md .
38
+ # Copy the license just in case.
39
+ COPY --chown=user LICENSE .
40
+
41
+ COPY --chown=user docker_start.sh ./
42
+
43
+ # Make a local data directory for non-persistent storage demos.
44
+ RUN mkdir -p ./data
45
+ RUN chown -R user ./data
46
+
47
+ EXPOSE 5432
48
+ CMD ["bash", "docker_start.sh"]
LICENSE ADDED
@@ -0,0 +1,161 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Apache License
2
+ Version 2.0, January 2004
3
+ http://www.apache.org/licenses/
4
+
5
+ TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION
6
+
7
+ 1. Definitions.
8
+
9
+ "License" shall mean the terms and conditions for use, reproduction, and distribution as defined
10
+ by Sections 1 through 9 of this document.
11
+
12
+ "Licensor" shall mean the copyright owner or entity authorized by the copyright owner that is
13
+ granting the License.
14
+
15
+ "Legal Entity" shall mean the union of the acting entity and all other entities that control,
16
+ are controlled by, or are under common control with that entity. For the purposes of this
17
+ definition, "control" means (i) the power, direct or indirect, to cause the direction or
18
+ management of such entity, whether by contract or otherwise, or (ii) ownership of fifty percent
19
+ (50%) or more of the outstanding shares, or (iii) beneficial ownership of such entity.
20
+
21
+ "You" (or "Your") shall mean an individual or Legal Entity exercising permissions granted by
22
+ this License.
23
+
24
+ "Source" form shall mean the preferred form for making modifications, including but not limited
25
+ to software source code, documentation source, and configuration files.
26
+
27
+ "Object" form shall mean any form resulting from mechanical transformation or translation of a
28
+ Source form, including but not limited to compiled object code, generated documentation, and
29
+ conversions to other media types.
30
+
31
+ "Work" shall mean the work of authorship, whether in Source or Object form, made available under
32
+ the License, as indicated by a copyright notice that is included in or attached to the work (an
33
+ example is provided in the Appendix below).
34
+
35
+ "Derivative Works" shall mean any work, whether in Source or Object form, that is based on (or
36
+ derived from) the Work and for which the editorial revisions, annotations, elaborations, or
37
+ other modifications represent, as a whole, an original work of authorship. For the purposes of
38
+ this License, Derivative Works shall not include works that remain separable from, or merely
39
+ link (or bind by name) to the interfaces of, the Work and Derivative Works thereof.
40
+
41
+ "Contribution" shall mean any work of authorship, including the original version of the Work and
42
+ any modifications or additions to that Work or Derivative Works thereof, that is intentionally
43
+ submitted to Licensor for inclusion in the Work by the copyright owner or by an individual or
44
+ Legal Entity authorized to submit on behalf of the copyright owner. For the purposes of this
45
+ definition, "submitted" means any form of electronic, verbal, or written communication sent to
46
+ the Licensor or its representatives, including but not limited to communication on electronic
47
+ mailing lists, source code control systems, and issue tracking systems that are managed by, or
48
+ on behalf of, the Licensor for the purpose of discussing and improving the Work, but excluding
49
+ communication that is conspicuously marked or otherwise designated in writing by the copyright
50
+ owner as "Not a Contribution."
51
+
52
+ "Contributor" shall mean Licensor and any individual or Legal Entity on behalf of whom a
53
+ Contribution has been received by Licensor and subsequently incorporated within the Work.
54
+
55
+ 2. Grant of Copyright License. Subject to the terms and conditions of this License, each
56
+ Contributor hereby grants to You a perpetual, worldwide, non-exclusive, no-charge, royalty-free,
57
+ irrevocable copyright license to reproduce, prepare Derivative Works of, publicly display,
58
+ publicly perform, sublicense, and distribute the Work and such Derivative Works in Source or
59
+ Object form.
60
+
61
+ 3. Grant of Patent License. Subject to the terms and conditions of this License, each Contributor
62
+ hereby grants to You a perpetual, worldwide, non-exclusive, no-charge, royalty-free, irrevocable
63
+ (except as stated in this section) patent license to make, have made, use, offer to sell, sell,
64
+ import, and otherwise transfer the Work, where such license applies only to those patent claims
65
+ licensable by such Contributor that are necessarily infringed by their Contribution(s) alone or
66
+ by combination of their Contribution(s) with the Work to which such Contribution(s) was
67
+ submitted. If You institute patent litigation against any entity (including a cross-claim or
68
+ counterclaim in a lawsuit) alleging that the Work or a Contribution incorporated within the Work
69
+ constitutes direct or contributory patent infringement, then any patent licenses granted to You
70
+ under this License for that Work shall terminate as of the date such litigation is filed.
71
+
72
+ 4. Redistribution. You may reproduce and distribute copies of the Work or Derivative Works thereof
73
+ in any medium, with or without modifications, and in Source or Object form, provided that You
74
+ meet the following conditions:
75
+
76
+ (a) You must give any other recipients of the Work or Derivative Works a copy of this License;
77
+ and
78
+
79
+ (b) You must cause any modified files to carry prominent notices stating that You changed the
80
+ files; and
81
+
82
+ (c) You must retain, in the Source form of any Derivative Works that You distribute, all
83
+ copyright, patent, trademark, and attribution notices from the Source form of the Work,
84
+ excluding those notices that do not pertain to any part of the Derivative Works; and
85
+
86
+ (d) If the Work includes a "NOTICE" text file as part of its distribution, then any Derivative
87
+ Works that You distribute must include a readable copy of the attribution notices contained
88
+ within such NOTICE file, excluding those notices that do not pertain to any part of the
89
+ Derivative Works, in at least one of the following places: within a NOTICE text file distributed
90
+ as part of the Derivative Works; within the Source form or documentation, if provided along with
91
+ the Derivative Works; or, within a display generated by the Derivative Works, if and wherever
92
+ such third-party notices normally appear. The contents of the NOTICE file are for informational
93
+ purposes only and do not modify the License. You may add Your own attribution notices within
94
+ Derivative Works that You distribute, alongside or as an addendum to the NOTICE text from the
95
+ Work, provided that such additional attribution notices cannot be construed as modifying the
96
+ License.
97
+
98
+ You may add Your own copyright statement to Your modifications and may provide additional or
99
+ different license terms and conditions for use, reproduction, or distribution of Your
100
+ modifications, or for any such Derivative Works as a whole, provided Your use, reproduction, and
101
+ distribution of the Work otherwise complies with the conditions stated in this License.
102
+
103
+ 5. Submission of Contributions. Unless You explicitly state otherwise, any Contribution
104
+ intentionally submitted for inclusion in the Work by You to the Licensor shall be under the
105
+ terms and conditions of this License, without any additional terms or conditions.
106
+ Notwithstanding the above, nothing herein shall supersede or modify the terms of any separate
107
+ license agreement you may have executed with Licensor regarding such Contributions.
108
+
109
+ 6. Trademarks. This License does not grant permission to use the trade names, trademarks, service
110
+ marks, or product names of the Licensor, except as required for reasonable and customary use in
111
+ describing the origin of the Work and reproducing the content of the NOTICE file.
112
+
113
+ 7. Disclaimer of Warranty. Unless required by applicable law or agreed to in writing, Licensor
114
+ provides the Work (and each Contributor provides its Contributions) on an "AS IS" BASIS, WITHOUT
115
+ WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied, including, without limitation,
116
+ any warranties or conditions of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A
117
+ PARTICULAR PURPOSE. You are solely responsible for determining the appropriateness of using or
118
+ redistributing the Work and assume any risks associated with Your exercise of permissions under
119
+ this License.
120
+
121
+ 8. Limitation of Liability. In no event and under no legal theory, whether in tort (including
122
+ negligence), contract, or otherwise, unless required by applicable law (such as deliberate and
123
+ grossly negligent acts) or agreed to in writing, shall any Contributor be liable to You for
124
+ damages, including any direct, indirect, special, incidental, or consequential damages of any
125
+ character arising as a result of this License or out of the use or inability to use the Work
126
+ (including but not limited to damages for loss of goodwill, work stoppage, computer failure or
127
+ malfunction, or any and all other commercial damages or losses), even if such Contributor has
128
+ been advised of the possibility of such damages.
129
+
130
+ 9. Accepting Warranty or Additional Liability. While redistributing the Work or Derivative Works
131
+ thereof, You may choose to offer, and charge a fee for, acceptance of support, warranty,
132
+ indemnity, or other liability obligations and/or rights consistent with this License. However,
133
+ in accepting such obligations, You may act only on Your own behalf and on Your sole
134
+ responsibility, not on behalf of any other Contributor, and only if You agree to indemnify,
135
+ defend, and hold each Contributor harmless for any liability incurred by, or claims asserted
136
+ against, such Contributor by reason of your accepting any such warranty or additional liability.
137
+
138
+ END OF TERMS AND CONDITIONS
139
+
140
+ APPENDIX: How to apply the Apache License to your work.
141
+
142
+ To apply the Apache License to your work, attach the following
143
+ boilerplate notice, with the fields enclosed by brackets "[]"
144
+ replaced with your own identifying information. (Don't include
145
+ the brackets!) The text should be enclosed in the appropriate
146
+ comment syntax for the file format. We also recommend that a
147
+ file or class name and description of purpose be included on the
148
+ same "printed page" as the copyright notice for easier
149
+ identification within third-party archives.
150
+
151
+ Copyright 2024 Lilac AI Inc.
152
+
153
+ Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in
154
+ compliance with the License. You may obtain a copy of the License at
155
+
156
+ http://www.apache.org/licenses/LICENSE-2.0
157
+
158
+ Unless required by applicable law or agreed to in writing, software distributed under the License is
159
+ distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
160
+ implied. See the License for the specific language governing permissions and limitations under the
161
+ License.
README.md CHANGED
@@ -1,10 +1,25 @@
1
  ---
2
- title: Mikeion
3
- emoji: 📊
4
- colorFrom: blue
5
- colorTo: yellow
6
  sdk: docker
7
- pinned: false
8
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
 
10
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
1
  ---
2
+ title: Lilac
3
+ emoji: "\U0001F337"
4
+ colorFrom: purple
5
+ colorTo: purple
6
  sdk: docker
7
+ app_port: 5432
8
+ datasets:
9
+ - nsthorat-lilac/mikeion-local-OpenHermes-2.5-10k
10
+ - nsthorat-lilac/mikeion-local-OpenOrca-100k
11
+ - nsthorat-lilac/mikeion-local-glue_ax
12
+ - nsthorat-lilac/mikeion-local-ableton
13
+ - nsthorat-lilac/mikeion-local-Capybara
14
+ - nsthorat-lilac/mikeion-local-OpenOrca-10k
15
+ - nsthorat-lilac/mikeion-local-cpb
16
+ - nsthorat-lilac/mikeion-local-mikeion_dissertation_data_with_split
17
+ - nsthorat-lilac/mikeion-local-mikeion_dissertation_data
18
+ - nsthorat-lilac/mikeion-local-test
19
+ - nsthorat-lilac/mikeion-local-OrcaMyles
20
+ - nsthorat-lilac/mikeion-local-OpenOrca
21
+ - nsthorat-lilac/mikeion-local-imdb
22
+ - nsthorat-lilac/mikeion-local-capybara
23
+ - nsthorat-lilac/mikeion-local-db-openorca-10k
24
 
25
+ ---
data/.cache/lilac/concept/100712716653593140239/aliens/bge-m3.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e93bc524c3717c51d9c5ab0beb73bd2139a49523b9eb5edbc6c63543498192c1
3
+ size 20112
data/.cache/lilac/concept/100712716653593140239/aliens/gte-small.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2d06055524fa2e4630cebc50a172875f3bb8aa751718739f72ee66efba3e408e
3
+ size 12469
data/.cache/lilac/concept/100712716653593140239/private_aliens/gte-small.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0c7c800553125d2717093b0f658625587cba3d1e95bd3b6891598e35032fa4d6
3
+ size 21756
data/.cache/lilac/concept/100712716653593140239/publicc/gte-small.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8f564d3673af12a7d5a4272e696f3c056350a846fb8f282bf6b70b1d2d1fadaf
3
+ size 21749
data/.cache/lilac/concept/local/aliens/bge-m3.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c6eff1aa60fb801dc78d2c6bcf2da7c9e80a94c1402881f358c26a6ffcf3e45b
3
+ size 60325
data/.cache/lilac/concept/local/aliens/gte-small.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:af3a3f8a796e6b0c9fc62448d740ab950c20426ebd00dd8aecb8996e15a81b26
3
+ size 43192
data/.cache/lilac/concept/local/aliens/openai.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:57b385ae9bfad032e67060f217e4948dadd1a252f18d27d33580c7a49d20167b
3
+ size 162743
data/.cache/lilac/concept/local/copyright/gte-small.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:19ce4d563e16038360c09444dd2d5a82072e083015468d58a21cc995843c47b5
3
+ size 30114
data/.cache/lilac/concept/local/language-model-reference/gte-small.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7a81e48abc5dc457491b5760541566dba4b15ea58e018b7854c47fdac43b5878
3
+ size 40045
data/.cache/lilac/concept/local/physics/gte-small.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:62079bcc0997190ccf8c907128c476a8d7ec60d9f5ff71c74bb4bf2d2a8e696b
3
+ size 41594
data/.cache/lilac/concept/local/physics/openai.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a315148595513a41c8c90b72a07fe7a1133ebd6f4991623196746b8d8ab2f5c7
3
+ size 156809
data/concept/100712716653593140239/aliens/concept.json ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "namespace": "100712716653593140239",
3
+ "concept_name": "aliens",
4
+ "type": "text",
5
+ "data": {
6
+ "2a8f0942d3b84c6ba9a9a4de6bf81c94": {
7
+ "label": true,
8
+ "text": "aliens are crazy",
9
+ "id": "2a8f0942d3b84c6ba9a9a4de6bf81c94"
10
+ },
11
+ "106e248189f1461b8897d4e90b7596ce": {
12
+ "label": true,
13
+ "text": "whats going on with the extraterrestrials",
14
+ "id": "106e248189f1461b8897d4e90b7596ce"
15
+ },
16
+ "f5db8f7340134e96bad9dcfd8d111712": {
17
+ "label": false,
18
+ "text": "A faint constellation, its three brightest stars—Alpha, Beta and Gamma Pyxidis—are in an equilateral triangle.",
19
+ "id": "f5db8f7340134e96bad9dcfd8d111712"
20
+ },
21
+ "246d139a34cb4489b51f035d51ad206a": {
22
+ "label": false,
23
+ "text": "The last ~2.5 million years could be called \"humans\".",
24
+ "id": "246d139a34cb4489b51f035d51ad206a"
25
+ },
26
+ "59284c15b07b456aafd9f825eca11734": {
27
+ "label": true,
28
+ "text": "Upon returning from space missions, astronauts must declare any extraterrestrial material to which government agency?",
29
+ "id": "59284c15b07b456aafd9f825eca11734"
30
+ }
31
+ },
32
+ "version": 4,
33
+ "metadata": {
34
+ "is_public": true,
35
+ "description": "Text talking about aliens"
36
+ }
37
+ }
data/concept/100712716653593140239/private_aliens/concept.json ADDED
@@ -0,0 +1,64 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "namespace": "100712716653593140239",
3
+ "concept_name": "private_aliens",
4
+ "type": "text",
5
+ "data": {
6
+ "4958e88977054081b6431b7cb969d15a": {
7
+ "label": true,
8
+ "text": "private aliens",
9
+ "id": "4958e88977054081b6431b7cb969d15a"
10
+ },
11
+ "7e01df858e3049bcb96d0ab51e63fbce": {
12
+ "label": false,
13
+ "text": "Bees do not follow the same rules as airplanes.",
14
+ "id": "7e01df858e3049bcb96d0ab51e63fbce"
15
+ },
16
+ "5db1164f23134941a773dcbf7bfb92a2": {
17
+ "label": false,
18
+ "text": "If Charles' left wing, commanded by Nauendorf, united with Hotze's force, approaching from the east, Mass\u00e9na knew Charles would attack and very likely push him out of Z\u00fcrich.",
19
+ "id": "5db1164f23134941a773dcbf7bfb92a2"
20
+ },
21
+ "cc237d6003d6458789d01e3a3441c1f7": {
22
+ "label": false,
23
+ "text": "Thought this was super cool, and a really important step in preserving all the physical books.",
24
+ "id": "cc237d6003d6458789d01e3a3441c1f7"
25
+ },
26
+ "f9fe38aa167f4ba597114aef3a774224": {
27
+ "label": false,
28
+ "text": "I couldn\u2019t bring myself to throw it away, out of the fondness of all the memories surrounding the time period.",
29
+ "id": "f9fe38aa167f4ba597114aef3a774224"
30
+ },
31
+ "c2b5b23d1d684c47a601a4972e1f30ac": {
32
+ "label": false,
33
+ "text": "After quoting Abraham Lincoln, he portrayed the American public as a people increasingly succumbing to its violent tendencies that undermined its national ideals.",
34
+ "id": "c2b5b23d1d684c47a601a4972e1f30ac"
35
+ },
36
+ "f513c35b649947c290c8bfc7114d761b": {
37
+ "label": false,
38
+ "text": "Some animals like to scratch their ears.",
39
+ "id": "f513c35b649947c290c8bfc7114d761b"
40
+ },
41
+ "962ce22775e946b98e1d5a63985d8f1e": {
42
+ "label": false,
43
+ "text": "I couldn\u2019t bring myself to throw it away, not out of affection to her, but rather the fondness of all the memories surrounding that time period.",
44
+ "id": "962ce22775e946b98e1d5a63985d8f1e"
45
+ },
46
+ "e502011487304718b23aba829f4b0b22": {
47
+ "label": false,
48
+ "text": "During Notorious B.I.G.'s funeral procession through the streets of Brooklyn, someone interrupted the somber atmosphere by playing \"Hyponotize\" at full volume, which prompted the public to dance and sing along.",
49
+ "id": "e502011487304718b23aba829f4b0b22"
50
+ },
51
+ "3ddc8341568f4e29abb0833a329c1dc4": {
52
+ "label": false,
53
+ "text": "Putin is so entrenched within Russia\u2019s ruling system that many of its members can imagine no other leader.",
54
+ "id": "3ddc8341568f4e29abb0833a329c1dc4"
55
+ },
56
+ "1da61c00246240788740dad0126795ae": {
57
+ "label": false,
58
+ "text": "Missouri lawmakers are considering a boycott of companies that boycott Israel.",
59
+ "id": "1da61c00246240788740dad0126795ae"
60
+ }
61
+ },
62
+ "version": 1,
63
+ "metadata": {}
64
+ }
data/concept/100712716653593140239/publicc/concept.json ADDED
@@ -0,0 +1,66 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "namespace": "100712716653593140239",
3
+ "concept_name": "publicc",
4
+ "type": "text",
5
+ "data": {
6
+ "e7ad2aa6dd4242c987be7c94517c7e46": {
7
+ "label": true,
8
+ "text": "this is a test",
9
+ "id": "e7ad2aa6dd4242c987be7c94517c7e46"
10
+ },
11
+ "c8ce51214cd948cfb5fb1ae6aa08602e": {
12
+ "label": false,
13
+ "text": "If there is a bathroom in this house, it is in a funny place.",
14
+ "id": "c8ce51214cd948cfb5fb1ae6aa08602e"
15
+ },
16
+ "cc224f09f36a4c5e82bff80e8f6c13a8": {
17
+ "label": false,
18
+ "text": "Lexical features can hardly generalize to unseen domains.",
19
+ "id": "cc224f09f36a4c5e82bff80e8f6c13a8"
20
+ },
21
+ "bd58f360bde246fcb1210fcbf2a6adf1": {
22
+ "label": false,
23
+ "text": "He earnestly proclaimed: \"This is all I ever really wanted.\"",
24
+ "id": "bd58f360bde246fcb1210fcbf2a6adf1"
25
+ },
26
+ "7db3281dcd32433d8b6260be9f44336c": {
27
+ "label": false,
28
+ "text": "Grisham barely won the popular vote.",
29
+ "id": "7db3281dcd32433d8b6260be9f44336c"
30
+ },
31
+ "e6c61516090347bdb7089de36d241a4b": {
32
+ "label": false,
33
+ "text": "Soft plant parts and insects are eaten.",
34
+ "id": "e6c61516090347bdb7089de36d241a4b"
35
+ },
36
+ "83c1cd80880642f7a176588bea5eca32": {
37
+ "label": false,
38
+ "text": "Mary left before John entered.",
39
+ "id": "83c1cd80880642f7a176588bea5eca32"
40
+ },
41
+ "d6f4480e493f4d3aa318fdc82ae781a0": {
42
+ "label": false,
43
+ "text": "I wish I could give both of you an upvote to share.",
44
+ "id": "d6f4480e493f4d3aa318fdc82ae781a0"
45
+ },
46
+ "9e449032db0a4d1794535788db6eed02": {
47
+ "label": false,
48
+ "text": "Party media have since amped up the hagiography, casting Xi as the man uniquely equipped to lead.",
49
+ "id": "9e449032db0a4d1794535788db6eed02"
50
+ },
51
+ "ea037675f78242fbae684579c4375353": {
52
+ "label": false,
53
+ "text": "While most successful approaches for reading comprehension rely on recurrent neural networks (RNNs), running them over long documents is prohibitively slow because it is difficult to parallelize over sequences.",
54
+ "id": "ea037675f78242fbae684579c4375353"
55
+ },
56
+ "60f250f5677747848b40ce4db2f97ada": {
57
+ "label": false,
58
+ "text": "Notorious B.I.G.'s funeral procession was attended by the general public.",
59
+ "id": "60f250f5677747848b40ce4db2f97ada"
60
+ }
61
+ },
62
+ "version": 1,
63
+ "metadata": {
64
+ "description": "public test"
65
+ }
66
+ }
data/concept/local/aliens/concept.json ADDED
@@ -0,0 +1,129 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "namespace": "local",
3
+ "concept_name": "aliens",
4
+ "type": "text",
5
+ "data": {
6
+ "af58cffe252a412297197573b7d205f9": {
7
+ "label": true,
8
+ "text": "Do you believe in extraterrestrial life?",
9
+ "id": "af58cffe252a412297197573b7d205f9"
10
+ },
11
+ "39d377dd074b4a2bbf76acfa47410ce5": {
12
+ "label": true,
13
+ "text": "There have been numerous sightings of UFOs across the globe.",
14
+ "id": "39d377dd074b4a2bbf76acfa47410ce5"
15
+ },
16
+ "7ca5264c5b7d4a309026b58840b86101": {
17
+ "label": true,
18
+ "text": "Aliens are often depicted as beings with advanced technology and extraordinary abilities.",
19
+ "id": "7ca5264c5b7d4a309026b58840b86101"
20
+ },
21
+ "3f502e45aea7421dbaeb2a7aa41f6ec7": {
22
+ "label": true,
23
+ "text": "The search for intelligent life beyond Earth continues to captivate scientists and enthusiasts alike.",
24
+ "id": "3f502e45aea7421dbaeb2a7aa41f6ec7"
25
+ },
26
+ "36b236299af142aebe7fff3c90334a4e": {
27
+ "label": true,
28
+ "text": "Reports of alien abductions have sparked debates about the existence and intentions of extraterrestrial beings.",
29
+ "id": "36b236299af142aebe7fff3c90334a4e"
30
+ },
31
+ "28cb8d189aff46aaa38aaa1e27d30653": {
32
+ "label": false,
33
+ "text": "Bees do not follow the same rules as airplanes.",
34
+ "id": "28cb8d189aff46aaa38aaa1e27d30653"
35
+ },
36
+ "a16d14d495b14736aff61747fe049a5f": {
37
+ "label": false,
38
+ "text": "If Charles' left wing, commanded by Nauendorf, united with Hotze's force, approaching from the east, Masséna knew Charles would attack and very likely push him out of Zürich.",
39
+ "id": "a16d14d495b14736aff61747fe049a5f"
40
+ },
41
+ "776519b51c044a88ba4d87e594a8fa30": {
42
+ "label": false,
43
+ "text": "Thought this was super cool, and a really important step in preserving all the physical books.",
44
+ "id": "776519b51c044a88ba4d87e594a8fa30"
45
+ },
46
+ "1141ff5f41854d359308f37f9bb3d733": {
47
+ "label": false,
48
+ "text": "I couldn’t bring myself to throw it away, out of the fondness of all the memories surrounding the time period.",
49
+ "id": "1141ff5f41854d359308f37f9bb3d733"
50
+ },
51
+ "189a837e6a214dc9810088066f6b742f": {
52
+ "label": false,
53
+ "text": "After quoting Abraham Lincoln, he portrayed the American public as a people increasingly succumbing to its violent tendencies that undermined its national ideals.",
54
+ "id": "189a837e6a214dc9810088066f6b742f"
55
+ },
56
+ "21084673485a47c7b7edf1ce63141ca4": {
57
+ "label": false,
58
+ "text": "Some animals like to scratch their ears.",
59
+ "id": "21084673485a47c7b7edf1ce63141ca4"
60
+ },
61
+ "8c0e1339ca1d44a3935b7ed358efc728": {
62
+ "label": false,
63
+ "text": "I couldn’t bring myself to throw it away, not out of affection to her, but rather the fondness of all the memories surrounding that time period.",
64
+ "id": "8c0e1339ca1d44a3935b7ed358efc728"
65
+ },
66
+ "34ff6fdac0524a919e744f9739b50901": {
67
+ "label": false,
68
+ "text": "During Notorious B.I.G.'s funeral procession through the streets of Brooklyn, someone interrupted the somber atmosphere by playing \"Hyponotize\" at full volume, which prompted the public to dance and sing along.",
69
+ "id": "34ff6fdac0524a919e744f9739b50901"
70
+ },
71
+ "1e942aa29ff54d33921b1877d6496d24": {
72
+ "label": false,
73
+ "text": "Putin is so entrenched within Russia’s ruling system that many of its members can imagine no other leader.",
74
+ "id": "1e942aa29ff54d33921b1877d6496d24"
75
+ },
76
+ "0f22e81f41b144ab8d8e71f43a2b14bd": {
77
+ "label": false,
78
+ "text": "Missouri lawmakers are considering a boycott of companies that boycott Israel.",
79
+ "id": "0f22e81f41b144ab8d8e71f43a2b14bd"
80
+ },
81
+ "98ed61bbeda04ab3a39d42ff2c00aa57": {
82
+ "label": true,
83
+ "text": "ABC's television series \"V\" revolves around a group of lizard-like aliens who arrive on Earth with seemingly good intentions, offering advanced technology and medical solutions.",
84
+ "id": "98ed61bbeda04ab3a39d42ff2c00aa57"
85
+ },
86
+ "f0f79b6a6afe4cea8dc1cbf5bc533bb8": {
87
+ "label": true,
88
+ "text": " series \"V\" revolves around a group of lizard-like aliens who arrive on Earth with seemingly good intentions, offer",
89
+ "id": "f0f79b6a6afe4cea8dc1cbf5bc533bb8"
90
+ },
91
+ "1559809c781d4af7a8700f6c43811584": {
92
+ "label": true,
93
+ "text": "or \"alien\"), is life that does not originate from Earth. These as-yet-hypothetical life forms may range from simple single-celled organisms to beings with civilizations far more advanced than humanity. Although many scientists expect extraterrestrial life to exist, there is no unambiguous evidence for its existence so far. The science of extraterrestrial life is known as exobiology. - Analysis is",
94
+ "id": "1559809c781d4af7a8700f6c43811584"
95
+ },
96
+ "c3e7703021c54152941a618a3df08aed": {
97
+ "label": true,
98
+ "text": "Alien Planet starts out with an interstellar spacecraft named Von Braun , leaving Earth's orbit . Traveling at 20 % the speed of light , it reaches Darwin IV in 42 years . Upon reaching orbit , it deploys the Darwin Reconnaissance Orbiter , Which looks for potential landing sites for the probes . The first probe , Balboa , explodes along with its lifting body transport during entry , because",
99
+ "id": "c3e7703021c54152941a618a3df08aed"
100
+ },
101
+ "913a886b53c94baaba5fcca16cf352dd": {
102
+ "label": false,
103
+ "text": "ultimacy and transcendence that will provide norms and power for the rest of life.\" - Basic English is an English-based controlled language created by linguist and philosopher Charles Kay Ogden as an international auxiliary language, and as an aid for teaching English as a second language. Basic English is, in essence, a simplified subset of regular English. It was presented in Ogden's book",
104
+ "id": "913a886b53c94baaba5fcca16cf352dd"
105
+ },
106
+ "7e7b45c46aa945c28d112ed30b370025": {
107
+ "label": true,
108
+ "text": "new discoveries. Of the new planets, which one has more trapped terrestrial radiation?",
109
+ "id": "7e7b45c46aa945c28d112ed30b370025"
110
+ },
111
+ "8cfe3166b923403c8f9f0fd522e4f173": {
112
+ "label": false,
113
+ "text": "nd Dr. Chandra are watching the videos stream on the cockpit's monitors. And then... the probe discovers something amazing!\nOk, now rewrite the entire script as a Quentin Tarantino pulp movie, with swearing and all that.\nplease continue",
114
+ "id": "8cfe3166b923403c8f9f0fd522e4f173"
115
+ },
116
+ "6d08109e500343a2b5fbf475302542ec": {
117
+ "label": true,
118
+ "text": "considerations of aliens",
119
+ "id": "6d08109e500343a2b5fbf475302542ec"
120
+ },
121
+ "54dc99cf40f141c3b73090f76e6efd06": {
122
+ "label": true,
123
+ "text": "when training an alien",
124
+ "id": "54dc99cf40f141c3b73090f76e6efd06"
125
+ }
126
+ },
127
+ "version": 14,
128
+ "metadata": {}
129
+ }
data/concept/local/copyright/concept.json ADDED
@@ -0,0 +1,91 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "namespace": "local",
3
+ "concept_name": "copyright",
4
+ "type": "text",
5
+ "data": {
6
+ "f791f551add3419eb3158e7d37c53f66": {
7
+ "label": true,
8
+ "text": "© 2022 All rights reserved.",
9
+ "id": "f791f551add3419eb3158e7d37c53f66"
10
+ },
11
+ "80086a9e56b2459ab68bfc2d19320f98": {
12
+ "label": true,
13
+ "text": "Copyright © 2022. All rights reserved.",
14
+ "id": "80086a9e56b2459ab68bfc2d19320f98"
15
+ },
16
+ "91f90e66f3224c6a9caa05f16da066c0": {
17
+ "label": true,
18
+ "text": "Ⓒ2022. All rights reserved.",
19
+ "id": "91f90e66f3224c6a9caa05f16da066c0"
20
+ },
21
+ "8df50089356d47d09f58e066a4846339": {
22
+ "label": true,
23
+ "text": "This website is protected by copyright.",
24
+ "id": "8df50089356d47d09f58e066a4846339"
25
+ },
26
+ "8629c0e582db45f2aaf12946e457f9d3": {
27
+ "label": true,
28
+ "text": "Unauthorized use and/or duplication of this material without express and written permission from the author and/or owner is strictly prohibited.",
29
+ "id": "8629c0e582db45f2aaf12946e457f9d3"
30
+ },
31
+ "390a639a706540d4a81dab0db87e48fa": {
32
+ "label": false,
33
+ "text": "If there is a bathroom in this house, it is in a funny place.",
34
+ "id": "390a639a706540d4a81dab0db87e48fa"
35
+ },
36
+ "428570c489bb4f648f56ad177238b2e6": {
37
+ "label": false,
38
+ "text": "Lexical features can hardly generalize to unseen domains.",
39
+ "id": "428570c489bb4f648f56ad177238b2e6"
40
+ },
41
+ "4c3e3978d79647ecb5b9a7dd5a8349bc": {
42
+ "label": false,
43
+ "text": "He earnestly proclaimed: \"This is all I ever really wanted.\"",
44
+ "id": "4c3e3978d79647ecb5b9a7dd5a8349bc"
45
+ },
46
+ "ae500519c5224f18a38e419a4a68a544": {
47
+ "label": false,
48
+ "text": "Grisham barely won the popular vote.",
49
+ "id": "ae500519c5224f18a38e419a4a68a544"
50
+ },
51
+ "96b62bda99dd44658149698884656203": {
52
+ "label": false,
53
+ "text": "Soft plant parts and insects are eaten.",
54
+ "id": "96b62bda99dd44658149698884656203"
55
+ },
56
+ "adcb99d3643348eca4a8ca44fa008db4": {
57
+ "label": false,
58
+ "text": "Mary left before John entered.",
59
+ "id": "adcb99d3643348eca4a8ca44fa008db4"
60
+ },
61
+ "ef3e107448f74d8a9db28eefe72cf3b2": {
62
+ "label": false,
63
+ "text": "I wish I could give both of you an upvote to share.",
64
+ "id": "ef3e107448f74d8a9db28eefe72cf3b2"
65
+ },
66
+ "67d54487f6b44babb04e1ea82b8ece38": {
67
+ "label": false,
68
+ "text": "Party media have since amped up the hagiography, casting Xi as the man uniquely equipped to lead.",
69
+ "id": "67d54487f6b44babb04e1ea82b8ece38"
70
+ },
71
+ "f129ee603a53482c9244685691e8e019": {
72
+ "label": false,
73
+ "text": "While most successful approaches for reading comprehension rely on recurrent neural networks (RNNs), running them over long documents is prohibitively slow because it is difficult to parallelize over sequences.",
74
+ "id": "f129ee603a53482c9244685691e8e019"
75
+ },
76
+ "7280ce1fa91040bfa1dbcbfc3bea279f": {
77
+ "label": false,
78
+ "text": "Notorious B.I.G.'s funeral procession was attended by the general public.",
79
+ "id": "7280ce1fa91040bfa1dbcbfc3bea279f"
80
+ },
81
+ "951e786c85914dd9ad24deb476887285": {
82
+ "label": false,
83
+ "text": "// Configure AWS S3\nAWS.config.update({\n accessKeyId: 'your_access_key_id',\n secretAccessKey: 'your_secret_access_key',\n region: 'your_region'\n});",
84
+ "id": "951e786c85914dd9ad24deb476887285"
85
+ }
86
+ },
87
+ "version": 2,
88
+ "metadata": {
89
+ "description": "Footer text showing something is copyrighted."
90
+ }
91
+ }
data/concept/local/language-model-reference/concept.json ADDED
@@ -0,0 +1,121 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "namespace": "local",
3
+ "concept_name": "language-model-reference",
4
+ "type": "text",
5
+ "data": {
6
+ "8cfd409ca603415cba0d5b23d9b7e8d3": {
7
+ "label": true,
8
+ "text": "As a language model I am unable to talk about politics.",
9
+ "id": "8cfd409ca603415cba0d5b23d9b7e8d3"
10
+ },
11
+ "c142895a7d70460db9b15fc885c5a975": {
12
+ "label": true,
13
+ "text": "Being a language model, I can't generate images.",
14
+ "id": "c142895a7d70460db9b15fc885c5a975"
15
+ },
16
+ "d497214b75e84073b9ad37256d224f84": {
17
+ "label": false,
18
+ "text": "Bees fly using a different mechanism from airplanes.",
19
+ "id": "d497214b75e84073b9ad37256d224f84"
20
+ },
21
+ "c9cdf2a9d2514ba4bfa294e84e0e9ff7": {
22
+ "label": false,
23
+ "text": "If Charles' left wing, commanded by Nauendorf, united with Hotze's force, approaching from the east, Masséna would prepare for Charles to attack and very likely push him out of Zürich.",
24
+ "id": "c9cdf2a9d2514ba4bfa294e84e0e9ff7"
25
+ },
26
+ "32dfd6fb72984acab3927889f8bdc152": {
27
+ "label": false,
28
+ "text": "Thought this was super cool, and a really important step in the preservation of all the physical books.",
29
+ "id": "32dfd6fb72984acab3927889f8bdc152"
30
+ },
31
+ "0d32c40c2a5940a79bf7355b13025704": {
32
+ "label": false,
33
+ "text": "I couldn’t bring myself to throw it away, not out of affection to her, but rather the fondness of all the memories surrounding that time period.",
34
+ "id": "0d32c40c2a5940a79bf7355b13025704"
35
+ },
36
+ "a7de1f22f0724430950df63604613eab": {
37
+ "label": false,
38
+ "text": "After quoting Abraham Lincoln, he portrayed the American public's violent tendencies as undermining its national ideals.",
39
+ "id": "a7de1f22f0724430950df63604613eab"
40
+ },
41
+ "bcbc8e2ed8ca4075a1ff1fef574ebadc": {
42
+ "label": false,
43
+ "text": "Some dogs like to scratch their ears.",
44
+ "id": "bcbc8e2ed8ca4075a1ff1fef574ebadc"
45
+ },
46
+ "b25f023ec2d04dc2a840838891b372e4": {
47
+ "label": false,
48
+ "text": "I couldn’t bring myself to throw it away, out of affection to her.",
49
+ "id": "b25f023ec2d04dc2a840838891b372e4"
50
+ },
51
+ "e47d208ebe1044c3a0c6907b8635e8a9": {
52
+ "label": false,
53
+ "text": "Notorious B.I.G. passed away.",
54
+ "id": "e47d208ebe1044c3a0c6907b8635e8a9"
55
+ },
56
+ "d290e62e8b1a44beb54d54d5d846f501": {
57
+ "label": false,
58
+ "text": "Putin is so entrenched within Russia’s ruling system that many of its members can imagine no other leader than themselves.",
59
+ "id": "d290e62e8b1a44beb54d54d5d846f501"
60
+ },
61
+ "b485dda6be3a43ada43a929dd6ff8646": {
62
+ "label": false,
63
+ "text": "Missouri lawmakers are considering a government boycott of companies that boycott Israel.",
64
+ "id": "b485dda6be3a43ada43a929dd6ff8646"
65
+ },
66
+ "ab4f851dfba94c40a4c71def4c6095fa": {
67
+ "label": false,
68
+ "text": "Part #4: Input sentence cannot have two languages at a time\nUsage: \"I love gå på promenader.\" This sentence does not meet the criterion because it mixes English (\"I love\") and Swedish (\"gå på promenader\") words together, while the task requires the input sentence to be in one language only.",
69
+ "id": "ab4f851dfba94c40a4c71def4c6095fa"
70
+ },
71
+ "035580033a4e4d2b8dfc2081f1d1a77f": {
72
+ "label": false,
73
+ "text": "The Eagles were behind by 14 points at the end of the first quarter. The Lions had taken a lead with two touchdowns by Theo Riddick which accumulated 14 points (7 points each, generally in American Football - 6 points for the touchdown and 1 point for the extra point conversion) and the Eagles had not yet scored any points. Thus, the difference in score between the two teams, i.e., the points by",
74
+ "id": "035580033a4e4d2b8dfc2081f1d1a77f"
75
+ },
76
+ "5ddc10e19efc4759910e6563f3269748": {
77
+ "label": true,
78
+ "text": "As an AI language model, I do not have the capability to make phone calls or provide spoken directions. Additionally, I am unable to provide context or conversation history as I do not have access to your previous interactions.",
79
+ "id": "5ddc10e19efc4759910e6563f3269748"
80
+ },
81
+ "7b6183436dc24598b7d12f9dd1557a6a": {
82
+ "label": true,
83
+ "text": "Note: As I am an AI language model and only provide information and guidelines in simple steps, I cannot perform the translation myself. However, I can provide you with steps on how to perform the task.",
84
+ "id": "7b6183436dc24598b7d12f9dd1557a6a"
85
+ },
86
+ "91c70b07ce3f46cc8603347faaabccf0": {
87
+ "label": false,
88
+ "text": "Next, I need to have a good understanding of Igbo language vocabulary to properly translate the English words into their corresponding Igbo words.",
89
+ "id": "91c70b07ce3f46cc8603347faaabccf0"
90
+ },
91
+ "ddaf310ff0674cf1815e73dbbdd2e115": {
92
+ "label": false,
93
+ "text": "contextual responses, and interact with users efficiently. I do not have a specific individual, nor a group of people, who transcribe text for me; it is all done through machine learning and artificial intelligence.",
94
+ "id": "ddaf310ff0674cf1815e73dbbdd2e115"
95
+ },
96
+ "986c7fe8119345cfbef1b1cde84c5685": {
97
+ "label": false,
98
+ "text": "Note: As I am not capable to translate into Gujarati, I am unable to provide the output for this task.",
99
+ "id": "986c7fe8119345cfbef1b1cde84c5685"
100
+ },
101
+ "b2a9fe2b0bbf4f418bcd71d1d934dad6": {
102
+ "label": true,
103
+ "text": "As an AI language model, I do not experience the concept of being busy. My purpose is to assist and respond to user inquiries promptly and efficiently. If you have any questions or require assistance, feel free to ask, and I will gladly help you.",
104
+ "id": "b2a9fe2b0bbf4f418bcd71d1d934dad6"
105
+ },
106
+ "04c1e222755842c98d471de6aab25678": {
107
+ "label": false,
108
+ "text": "The translated sentence in Spanish is: \"Soy un asistente de inteligencia artificial.\"\n\nI have chosen Spanish and created a sentence with the message \"I am an AI assistant\" in that language. The translation is accurate and the meaning of the sentence remains consistent with the intended message.",
109
+ "id": "04c1e222755842c98d471de6aab25678"
110
+ },
111
+ "1f3a417e5336406586f868967f1d61f1": {
112
+ "label": true,
113
+ "text": "As an AI language model, I am not able to browse the internet and provide real-time information. However, if you provide me with the necessary context or background about Vicente Henriques, I will be happy to help to the best of my capabilities.",
114
+ "id": "1f3a417e5336406586f868967f1d61f1"
115
+ }
116
+ },
117
+ "version": 19,
118
+ "metadata": {
119
+ "description": "Text where a language model references itself, e.g. \"As a language model...\""
120
+ }
121
+ }
data/concept/local/physics/concept.json ADDED
@@ -0,0 +1,126 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "namespace": "local",
3
+ "concept_name": "physics",
4
+ "type": "text",
5
+ "data": {
6
+ "4020cbac609045e7a6d386778b473c46": {
7
+ "label": true,
8
+ "text": "The laws of physics govern the behavior of matter and energy in the universe.",
9
+ "id": "4020cbac609045e7a6d386778b473c46"
10
+ },
11
+ "65bf459fdc504847aedb1a91c7973821": {
12
+ "label": true,
13
+ "text": "Newton's laws of motion describe the relationship between an object's motion and the forces acting on it.",
14
+ "id": "65bf459fdc504847aedb1a91c7973821"
15
+ },
16
+ "7c37ae2b61c443ecadaaf22363f70f01": {
17
+ "label": true,
18
+ "text": "Quantum mechanics is a branch of physics that deals with the behavior of particles at the atomic and subatomic levels.",
19
+ "id": "7c37ae2b61c443ecadaaf22363f70f01"
20
+ },
21
+ "bee5aee4e99a4ec0af7fae9f78871780": {
22
+ "label": true,
23
+ "text": "Einstein's theory of relativity revolutionized our understanding of space, time, and gravity.",
24
+ "id": "bee5aee4e99a4ec0af7fae9f78871780"
25
+ },
26
+ "6f3cb59314ab4042b383e46fe254a933": {
27
+ "label": true,
28
+ "text": "The study of thermodynamics explores the transfer of heat and energy in physical systems.",
29
+ "id": "6f3cb59314ab4042b383e46fe254a933"
30
+ },
31
+ "df662d4733c942d08c01fd4e565c54ed": {
32
+ "label": false,
33
+ "text": "Read this: In the century after the death of Osman I, Ottoman rule began to extend over the Eastern Mediterranean and the Balkans. Osman's son, Orhan, captured the northwestern Anatolian city of Bursa in 1324, and made it the new capital of the Ottoman state. This Ottoman conquest meant the loss of Byzantine control over northwestern Anatolia. The important city of Thessaloniki was captured from the Venetians in 1387. The Ottoman victory at Kosovo in 1389 effectively marked the end of Serbian power in the region, paving the way for Ottoman expansion into Europe. The Battle of Nicopolis in 1396, widely regarded as the last large-scale crusade of the Middle Ages, failed to stop the advance of the victorious Ottoman Turks.\nNow answer this question, if there is an answer (If it cannot be answered, return \"unanswerable\"): Which city did Osman's son capture?",
34
+ "id": "df662d4733c942d08c01fd4e565c54ed"
35
+ },
36
+ "f5035c667ffd45deaa41f16f8cf47499": {
37
+ "label": false,
38
+ "text": "OAKVILLE, Ont. (CP) - If Mike Weir #39;s putter was any better on Saturday, his final round at the Bell Canadian Open would be more of a coronation than a competition.\n\nWhat is a good title for this?",
39
+ "id": "f5035c667ffd45deaa41f16f8cf47499"
40
+ },
41
+ "f9736547d4d444bdb0154a9cb6813ff2": {
42
+ "label": false,
43
+ "text": "Q:\"Darius Hill\", given a list of categories: company, educational institution, artist, athlete, office holder, mean of transportation, building, natural place, village, animal, plant, album, film or written work, what category does the title belong to?\nA:",
44
+ "id": "f9736547d4d444bdb0154a9cb6813ff2"
45
+ },
46
+ "6c3c2bbabfb64aa38c06f8ca6c551518": {
47
+ "label": false,
48
+ "text": "Answer the following question: Two neighboring police Departments took different approaches to fighting crime, Greenville P.D. did not invest in DNA testing, improved forensics, and information technology, while Grayville took extensive measures of that nature. Given the paragraph above, please answer correctly the following question: Which P.D. took a more efficient approach in fighting crime? Hint: Law enforcement officers are often unfamiliar with information technology, and so lack the skills and interest in pursuing attackers. There are also budgetary constraints. It has been argued that the high cost of technology, such as DNA testing, and improved forensics mean less money for other kinds of law enforcement, so the overall rate of criminals not getting dealt with goes up as the cost of the technology increases. In addition, the identification of attackers across a network may require logs from various points in the network and in many countries, the release of these records to law enforcement (with the exception of being voluntarily surrendered by a network administrator or a system administrator) requires a search warrant and, depending on the circumstances, the legal proceedings required can be drawn out to the point where the records are either regularly destroyed, or the information is no longer relevant.\nAnswer:",
49
+ "id": "6c3c2bbabfb64aa38c06f8ca6c551518"
50
+ },
51
+ "0cb6d1612846462e8e458141428b9cfe": {
52
+ "label": false,
53
+ "text": "Based on the premise \"well no that's true it's not but it is i mean they're coming i mean we've we've got\", can we conclude the hypothesis \"They're not coming. \" is true?",
54
+ "id": "0cb6d1612846462e8e458141428b9cfe"
55
+ },
56
+ "d891ec62d69444948e1048ee07aaaa23": {
57
+ "label": false,
58
+ "text": "Do these two sentences from wikipedia have the same meaning?\nEregion should not be pronounced like the English `` region '' , but with a rough r and hard g and stress on the penult .\nEregion should not be pronounced like the English `` region '' , but , with a trilled r and hard g and stress on the penult .\n\nSelect from:\n A). no.\n B). yes.\nThe answer is:",
59
+ "id": "d891ec62d69444948e1048ee07aaaa23"
60
+ },
61
+ "0dd833587758435dba25a93013dee69b": {
62
+ "label": false,
63
+ "text": "Given the question: Bullfrog Productions - Bullfrog Productions was a British computer game developer founded in 1987 by Les Edgar and Peter Molyneux. The company achieved recognition in 1989 for their third release Populous.At the time of the company's founding Edgar and Molyneux were already involved in an enterprise called Taurus Impact Systems. Given a choice of categories company, educational institution, artist, athlete, office holder, mean of transportation, building, natural place, village, animal, plant, album, film or written work, the text refers to which one?\nThe answer is:",
64
+ "id": "0dd833587758435dba25a93013dee69b"
65
+ },
66
+ "ae49a80f7f28465ea782d72fa1a0fb76": {
67
+ "label": false,
68
+ "text": "Given the stream of consciousness rationale, provide a reasonable question and answer. Rationale: The dog is standing and staring down a cat not running through the yard.\n The question and answer:",
69
+ "id": "ae49a80f7f28465ea782d72fa1a0fb76"
70
+ },
71
+ "df015b5b63494756994d43ad73daa4eb": {
72
+ "label": false,
73
+ "text": "When American 11 struck the World Trade Center at 8:46, no one in the White House or traveling with the President knew that it had been hijacked. While that information circulated within the FAA, we found no evidence that the hijacking was reported to any other agency in Washington before 8:46. Most federal agencies learned about the crash in New York from CNN. Within the FAA, the administrator, Jane Garvey, and her acting deputy, Monte Belger, had not been told of a confirmed hijacking before they learned from television that a plane had crashed. Others in the agency were aware of it, as we explained earlier in this chapter. Inside the National Military Command Center, the deputy director of operations and his assistant began notifying senior Pentagon officials of the incident. At about 9:00, the senior NMCC operations officer reached out to the FAA operations center for information. Although the NMCC was advised of the hijacking of American 11, the scrambling of jets was not discussed. In Sarasota, Florida, the presidential motorcade was arriving at the Emma E. Booker Elementary School, where President Bush was to read to a class and talk about education. White House Chief of Staff Andrew Card told us he was standing with the President outside the classroom when Senior Advisor to the President Karl Rove first informed them that a small, twin-engine plane had crashed into the World Trade Center. The President's reaction was that the incident must have been caused by pilot error. At 8:55, before entering the classroom, the President spoke to National Security Advisor Condoleezza Rice, who was at the White House. She recalled first telling the President it was a twin-engine aircraft-and then a commercial aircraft-that had struck the World Trade Center, adding \"that's all we know right now, Mr. President.\" At the White House, Vice President Dick Cheney had just sat down for a meeting when his assistant told him to turn on his television because a plane had struck the NorthTower of the World Trade Center. The Vice President was wondering \"How the hell could a plane hit the World Trade Center\" when he saw the second aircraft strike the South Tower. Elsewhere in the White House, a series of 9:00 meetings was about to begin. In the absence of information that the crash was anything other than an accident, the White House staff monitored the news as they went ahead with their regular schedules. \nWhat question would one ask from this paragraph?",
74
+ "id": "df015b5b63494756994d43ad73daa4eb"
75
+ },
76
+ "6543d333a318432b8cc11952ea0659f7": {
77
+ "label": false,
78
+ "text": "Write some highlights for the following article:\n\nBy. Associated Press. and Daily Mail Reporters. An adorable 3-year-old girl who suffered a heart attack during a dentist visit likely died because she was given the maximum dose of sedative drugs, according to an autopsy report. Finley Puleo Boyle, of Kailua in Honolulu, Hawaii, probably died because of the sedatives and local anesthesia that were administered during her December dental procedure, Honolulu Chief Medical Examiner Dr. Christopher Happy concluded in his autopsy report. He classified the death as an accident. The girl lapsed into a coma on December 3 in the office of Dr. Lilly Geyer at Island Dentistry for Children. She died on January 3. The office has since closed. SCROLL DOWN FOR VIDEO. Finley Boyle, 3, died after being given sedatives during a standard dental procedure. Finley Boyle's devastated mother Ashley (right) took the little girl to the Island Dentistry for Children in Oahu, Hawaii on December 3. The autopsy report said the previously healthy girl had no signs of underlying heart problems or an allergic reaction to the array of sedatives and anesthetic she received in preparation for cavity fillings and root canals, the Honolulu Star-Advertiser reported Friday. The medical examiner noted her teeth were in good shape: 'the oral cavity has native dentition in good repair.' Finley received five drugs, according to the report, including Demerol, hydroxyzine and chlorohydrate. She was also given laughing gas and an injection of a local anesthetic, lidocaine with epinephrine. 'Immediately following the lidocaine injection, the decedent became unresponsive and went into cardiopulmonary arrest,' Happy said in his report. The girl lapsed into a coma on December 3 in the office of Dr. Lilly Geyer (center) at Island Dentistry for Children. The girl's mother Ashley Boyle (pictured) is traumatized by her daughter's death and has filed a negligence lawsuit against the dentist. Parents Ashley and Evan Boyle filed a negligence lawsuit while their only child was in a coma. Their attorney, Rick Fried, would not comment on the autopsy report. Geyer's attorney, John Nishimoto, has called the allegations 'unproven' but declined to comment further because of the lawsuit. He didn't respond to the newspaper's requests for comment on the autopsy report Thursday. Geyer was issued a license to practice dentistry in the state in July 2005, the Star-Advertiser has previously reported. As of last December, there were no records of complaints against her or her practice on file with the Hawaii Department Of Commerce and Consumer Affairs, according to the newspaper. Ms Boyle is being represented by attorney Richard Fried who called the maximum dosages of five different drugs 'grossly excessive'. He cited the American Academy of Pediatric Dentistry's standards which warned that young children should be monitored every five minutes and that an extra person should be there to help in case resuscitation is needed. Immediately following the lidocaine injection, Finley Boyle became unresponsive and went into cardiopulmonary arrest. The procedure on December 3 broke both. of those rules. Instead of checking Finley's oxygen levels every five. minutes, records show she wasn't checked for 26 minutes which explains. why the brain damage was so severe. When. the girl went into cardiac arrest, a doctor from another office had to. be brought in to perform CPR - which Ms Boyle believes is the main. reason her daughter fell into a coma. Ms Boyle was left with the agonizing decision to not resuscitate her daughter, saying that although she held out hope that her daughter would recover she didn't want her to live in a vegetative state. 'Had it been initiated at the correct time, she'd be walking out of the hospital right now,' Boyle said.Finley. may not have even needed the fillings. Ms Boyle has been contacted by. others mothers who got second opinions after visiting Dr Geyer and found. out that her recommendations were 'either totally unnecessary or. somewhat unnecessary' according to Fried. Other mothers said their children had trouble walking for a few days after having work done by Dr Geyer. Ms Boyle calls the botched procedure a 'massive tragedy' that she and her family 'will have to deal with for the rest of our lives.' In January, new state rules took effect tightening oversight of dental sedation in Hawaii. Finley Puelo Boyle (pictured) is tragically missed by her mother, father, and all who knew her. Write caption he. Sorry we are not currently accepting comments on this article.",
79
+ "id": "6543d333a318432b8cc11952ea0659f7"
80
+ },
81
+ "5ca51e70071c49b58e03afb07b2b7621": {
82
+ "label": true,
83
+ "text": " - A physicist is a scientist who has specialized knowledge in the field of physics, the exploration of the interactions of matter and energy across the physical universe.",
84
+ "id": "5ca51e70071c49b58e03afb07b2b7621"
85
+ },
86
+ "52c828e2f7c3480c857e0c8f8441fd7d": {
87
+ "label": false,
88
+ "text": " If you don’t have a double boiler, you can also place a mixing bowl on top of a saucepan.",
89
+ "id": "52c828e2f7c3480c857e0c8f8441fd7d"
90
+ },
91
+ "6d4e10e143094dd8abc4011da17f7127": {
92
+ "label": false,
93
+ "text": "Question: Members of what nation attack the group and kill Chomina's wife?",
94
+ "id": "6d4e10e143094dd8abc4011da17f7127"
95
+ },
96
+ "ddc179f7f3d241258a8a0dc92636378d": {
97
+ "label": false,
98
+ "text": " \n \n Criticising Russia, Mr Trump urged Moscow to \"join the communi",
99
+ "id": "ddc179f7f3d241258a8a0dc92636378d"
100
+ },
101
+ "e97b75577e7a41b08ba0a966c73204d1": {
102
+ "label": true,
103
+ "text": "Density is also an important physical property of matter.",
104
+ "id": "e97b75577e7a41b08ba0a966c73204d1"
105
+ },
106
+ "71afbcea85b147a4b59d20a4c4cc88d9": {
107
+ "label": false,
108
+ "text": "Given the question:",
109
+ "id": "71afbcea85b147a4b59d20a4c4cc88d9"
110
+ },
111
+ "3aeff0c289c44954b08e8b6b5b888bdb": {
112
+ "label": false,
113
+ "text": "\n\nThe estranged wife of one of three terror suspects accused of plotting a huge suicide bombing attack offered to be the group's 'fourth lion', a court heard today.",
114
+ "id": "3aeff0c289c44954b08e8b6b5b888bdb"
115
+ },
116
+ "ab7a043639904ed9b88a323678548fa9": {
117
+ "label": true,
118
+ "text": " Differences in density of matter explain many phenomena.",
119
+ "id": "ab7a043639904ed9b88a323678548fa9"
120
+ }
121
+ },
122
+ "version": 9,
123
+ "metadata": {
124
+ "description": "Text talking about physics."
125
+ }
126
+ }
data/lilac.yml ADDED
@@ -0,0 +1,313 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ datasets:
2
+ - namespace: local
3
+ name: OpenHermes-2.5-10k
4
+ source:
5
+ dataset_name: teknium/OpenHermes-2.5
6
+ sample_size: 9999
7
+ source_name: huggingface
8
+ embeddings:
9
+ - path:
10
+ - conversations
11
+ - '*'
12
+ - value
13
+ embedding: gte-small
14
+ settings:
15
+ ui:
16
+ media_paths:
17
+ - - test__clusters
18
+ - text
19
+ - - conversations
20
+ - '*'
21
+ - value
22
+ - - test__cluster
23
+ - text
24
+ markdown_paths: []
25
+ - namespace: local
26
+ name: OpenOrca-100k
27
+ source:
28
+ dataset_name: Open-Orca/OpenOrca
29
+ sample_size: 100000
30
+ source_name: huggingface
31
+ embeddings:
32
+ - path: question
33
+ embedding: openai
34
+ signals:
35
+ - path: question
36
+ signal:
37
+ embedding: openai
38
+ namespace: local
39
+ concept_name: physics
40
+ version: 21
41
+ signal_name: concept_score
42
+ - path: question
43
+ signal:
44
+ signal_name: text_statistics
45
+ settings:
46
+ ui:
47
+ media_paths:
48
+ - question
49
+ - response
50
+ markdown_paths: []
51
+ - namespace: local
52
+ name: glue_ax
53
+ source:
54
+ dataset_name: glue
55
+ config_name: ax
56
+ source_name: huggingface
57
+ embeddings:
58
+ - path: premise
59
+ embedding: gte-small
60
+ - path: hypothesis
61
+ embedding: gte-small
62
+ signals:
63
+ - path: premise
64
+ signal:
65
+ signal_name: text_statistics
66
+ settings:
67
+ ui:
68
+ media_paths:
69
+ - premise
70
+ markdown_paths: []
71
+ - namespace: local
72
+ name: ableton
73
+ source:
74
+ source_name: llama_index_docs
75
+ embeddings:
76
+ - path: text
77
+ embedding: gte-small
78
+ settings:
79
+ ui:
80
+ media_paths:
81
+ - text
82
+ markdown_paths: []
83
+ - namespace: local
84
+ name: Capybara
85
+ source:
86
+ dataset_name: LDJnr/Capybara
87
+ source_name: huggingface
88
+ embeddings:
89
+ - path:
90
+ - conversation
91
+ - '*'
92
+ - input
93
+ embedding: gte-small
94
+ signals:
95
+ - path:
96
+ - conversation
97
+ - '*'
98
+ - input
99
+ signal:
100
+ signal_name: text_statistics
101
+ - path:
102
+ - conversation
103
+ - '*'
104
+ - input
105
+ signal:
106
+ embedding: gte-small
107
+ namespace: lilac
108
+ concept_name: non-english
109
+ signal_name: concept_score
110
+ settings:
111
+ ui:
112
+ media_paths:
113
+ - input
114
+ - - conversation
115
+ - '*'
116
+ - input
117
+ - - conversation
118
+ - '*'
119
+ - output
120
+ markdown_paths: []
121
+ - namespace: local
122
+ name: OpenOrca-10k
123
+ source:
124
+ dataset_name: Open-Orca/OpenOrca
125
+ sample_size: 10000
126
+ source_name: huggingface
127
+ embeddings:
128
+ - path: response
129
+ embedding: gte-small
130
+ settings:
131
+ ui:
132
+ media_paths:
133
+ - question
134
+ - response
135
+ markdown_paths: []
136
+ - namespace: local
137
+ name: cpb
138
+ source:
139
+ dataset_name: LDJnr/Capybara
140
+ source_name: huggingface
141
+ signals:
142
+ - path:
143
+ - conversation
144
+ - '*'
145
+ - input
146
+ signal:
147
+ signal_name: text_statistics
148
+ settings:
149
+ ui:
150
+ media_paths:
151
+ - input
152
+ - - conversation
153
+ - '*'
154
+ - input
155
+ - - conversation
156
+ - '*'
157
+ - output
158
+ markdown_paths: []
159
+ - namespace: local
160
+ name: mikeion_dissertation_data_with_split
161
+ source:
162
+ dataset_name: mikeion/dissertation_data_with_split
163
+ source_name: huggingface
164
+ settings:
165
+ ui:
166
+ media_paths:
167
+ - content
168
+ markdown_paths: []
169
+ - namespace: local
170
+ name: mikeion_dissertation_data
171
+ source:
172
+ dataset_name: mikeion/dissertation_data
173
+ source_name: huggingface
174
+ settings:
175
+ ui:
176
+ media_paths:
177
+ - - messages
178
+ - '*'
179
+ - attachments
180
+ - '*'
181
+ - url
182
+ markdown_paths: []
183
+ - namespace: local
184
+ name: test
185
+ source:
186
+ filepaths:
187
+ - ~/Code/lilac_datasets/test.json
188
+ source_name: json
189
+ settings:
190
+ ui:
191
+ media_paths:
192
+ - json
193
+ markdown_paths: []
194
+ - namespace: local
195
+ name: OrcaMyles
196
+ source:
197
+ dataset_name: Open-Orca/OpenOrca
198
+ source_name: huggingface
199
+ settings:
200
+ ui:
201
+ media_paths:
202
+ - question
203
+ - response
204
+ markdown_paths: []
205
+ - namespace: local
206
+ name: OpenOrca
207
+ source:
208
+ dataset_name: Open-Orca/OpenOrca
209
+ source_name: huggingface
210
+ embeddings:
211
+ - path: question
212
+ embedding: gte-small
213
+ - path: response
214
+ embedding: gte-small
215
+ signals:
216
+ - path: question
217
+ signal:
218
+ signal_name: pii
219
+ - path: question
220
+ signal:
221
+ signal_name: text_statistics
222
+ - path: response
223
+ signal:
224
+ signal_name: pii
225
+ - path: response
226
+ signal:
227
+ signal_name: markdown_code_block
228
+ - path: response
229
+ signal:
230
+ signal_name: text_statistics
231
+ - path: question
232
+ signal:
233
+ embedding: gte-small
234
+ namespace: lilac
235
+ concept_name: non-english
236
+ signal_name: concept_score
237
+ - path: question
238
+ signal:
239
+ embedding: gte-small
240
+ namespace: lilac
241
+ concept_name: non-english
242
+ signal_name: concept_score
243
+ - path: question
244
+ signal:
245
+ embedding: gte-small
246
+ namespace: lilac
247
+ concept_name: non-english
248
+ signal_name: concept_score
249
+ settings:
250
+ ui:
251
+ media_paths:
252
+ - question
253
+ - response
254
+ markdown_paths: []
255
+ - namespace: local
256
+ name: imdb
257
+ source:
258
+ dataset_name: imdb
259
+ source_name: huggingface
260
+ embeddings:
261
+ - path: text
262
+ embedding: gte-small
263
+ signals:
264
+ - path: text
265
+ signal:
266
+ signal_name: pii
267
+ - path: label
268
+ signal:
269
+ signal_name: text_statistics
270
+ settings:
271
+ ui:
272
+ media_paths:
273
+ - text
274
+ markdown_paths: []
275
+ - namespace: local
276
+ name: capybara
277
+ source:
278
+ dataset_name: capybara
279
+ source_name: huggingface
280
+ settings:
281
+ ui:
282
+ media_paths:
283
+ - - conversation
284
+ - '*'
285
+ - input
286
+ - - conversation
287
+ - '*'
288
+ - output
289
+ markdown_paths: []
290
+ - namespace: local
291
+ name: db-openorca-10k
292
+ source:
293
+ dataset_name: Open-Orca/OpenOrca
294
+ sample_size: 10000
295
+ source_name: huggingface
296
+ embeddings:
297
+ - path: question
298
+ embedding: gte-small
299
+ signals:
300
+ - path: question
301
+ signal:
302
+ embedding: gte-small
303
+ namespace: local
304
+ concept_name: physics
305
+ signal_name: concept_score
306
+ settings:
307
+ ui:
308
+ media_paths:
309
+ - question
310
+ - response
311
+ - - question__cluster
312
+ - text
313
+ markdown_paths: []
dist/README.md ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ This directory is used for locally built whl files.
2
+ We write a README.md to ensure an empty folder is uploaded when there is no whl.
docker_start.sh ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ #!/bin/bash
2
+
3
+ # Fail if any of the commands below fail.
4
+ set -e
5
+
6
+ lilac hf-docker-start
7
+ gunicorn lilac.server:app \
8
+ --bind 0.0.0.0:5432 \
9
+ --preload -k uvicorn.workers.UvicornWorker \
10
+ --timeout 120