or4cl3ai satan4191 commited on
Commit
24b9526
1 Parent(s): 072763a

Update README.md (#2)

Browse files

- Update README.md (61cd065a3f760333d14097a7318b4a5532938c1d)


Co-authored-by: Dustin Groves <satan4191@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +232 -193
README.md CHANGED
@@ -15,8 +15,200 @@ datasets:
15
  - bookcorpus
16
  - bookcorpusopen
17
  - nRuaif/OpenOrca-GPT3.5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
18
  language:
19
  - en
 
 
 
 
20
  metrics:
21
  - accuracy
22
  - bertscore
@@ -27,197 +219,44 @@ metrics:
27
  - mean_iou
28
  tags:
29
  - code
 
 
30
  ---
31
- # Model Card for Model ID
32
-
33
- <!-- Provide a quick summary of what the model is/does. -->
34
-
35
- This modelcard aims to be a base template for new models. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/modelcard_template.md?plain=1).
36
-
37
- ## Model Details
38
-
39
- ### Model Description
40
-
41
- <!-- Provide a longer summary of what this model is. -->
42
-
43
-
44
-
45
- - **Developed by:** [More Information Needed]
46
- - **Shared by [optional]:** [More Information Needed]
47
- - **Model type:** [More Information Needed]
48
- - **Language(s) (NLP):** [More Information Needed]
49
- - **License:** [More Information Needed]
50
- - **Finetuned from model [optional]:** [More Information Needed]
51
-
52
- ### Model Sources [optional]
53
-
54
- <!-- Provide the basic links for the model. -->
55
-
56
- - **Repository:** [More Information Needed]
57
- - **Paper [optional]:** [More Information Needed]
58
- - **Demo [optional]:** [More Information Needed]
59
-
60
- ## Uses
61
-
62
- <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
63
-
64
- ### Direct Use
65
-
66
- <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
67
-
68
- [More Information Needed]
69
-
70
- ### Downstream Use [optional]
71
-
72
- <!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
73
-
74
- [More Information Needed]
75
-
76
- ### Out-of-Scope Use
77
-
78
- <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
79
-
80
- [More Information Needed]
81
-
82
- ## Bias, Risks, and Limitations
83
-
84
- <!-- This section is meant to convey both technical and sociotechnical limitations. -->
85
-
86
- [More Information Needed]
87
-
88
- ### Recommendations
89
-
90
- <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
91
-
92
- Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
93
-
94
- ## How to Get Started with the Model
95
-
96
- Use the code below to get started with the model.
97
-
98
- [More Information Needed]
99
-
100
- ## Training Details
101
-
102
- ### Training Data
103
-
104
- <!-- This should link to a Data Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
105
-
106
- [More Information Needed]
107
-
108
- ### Training Procedure
109
-
110
- <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
111
-
112
- #### Preprocessing [optional]
113
-
114
- [More Information Needed]
115
-
116
-
117
- #### Training Hyperparameters
118
-
119
- - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
120
-
121
- #### Speeds, Sizes, Times [optional]
122
-
123
- <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
124
-
125
- [More Information Needed]
126
-
127
- ## Evaluation
128
-
129
- <!-- This section describes the evaluation protocols and provides the results. -->
130
-
131
- ### Testing Data, Factors & Metrics
132
-
133
- #### Testing Data
134
-
135
- <!-- This should link to a Data Card if possible. -->
136
-
137
- [More Information Needed]
138
-
139
- #### Factors
140
-
141
- <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
142
-
143
- [More Information Needed]
144
-
145
- #### Metrics
146
-
147
- <!-- These are the evaluation metrics being used, ideally with a description of why. -->
148
-
149
- [More Information Needed]
150
-
151
- ### Results
152
-
153
- [More Information Needed]
154
-
155
- #### Summary
156
-
157
-
158
-
159
- ## Model Examination [optional]
160
-
161
- <!-- Relevant interpretability work for the model goes here -->
162
-
163
- [More Information Needed]
164
-
165
- ## Environmental Impact
166
-
167
- <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
168
-
169
- Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
170
-
171
- - **Hardware Type:** [More Information Needed]
172
- - **Hours used:** [More Information Needed]
173
- - **Cloud Provider:** [More Information Needed]
174
- - **Compute Region:** [More Information Needed]
175
- - **Carbon Emitted:** [More Information Needed]
176
-
177
- ## Technical Specifications [optional]
178
-
179
- ### Model Architecture and Objective
180
-
181
- [More Information Needed]
182
-
183
- ### Compute Infrastructure
184
-
185
- [More Information Needed]
186
-
187
- #### Hardware
188
-
189
- [More Information Needed]
190
-
191
- #### Software
192
-
193
- [More Information Needed]
194
-
195
- ## Citation [optional]
196
-
197
- <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
198
-
199
- **BibTeX:**
200
-
201
- [More Information Needed]
202
-
203
- **APA:**
204
-
205
- [More Information Needed]
206
-
207
- ## Glossary [optional]
208
-
209
- <!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
210
-
211
- [More Information Needed]
212
-
213
- ## More Information [optional]
214
-
215
- [More Information Needed]
216
-
217
- ## Model Card Authors [optional]
218
-
219
- [More Information Needed]
220
-
221
- ## Model Card Contact
222
-
223
- [More Information Needed]
 
15
  - bookcorpus
16
  - bookcorpusopen
17
  - nRuaif/OpenOrca-GPT3.5
18
+ - irds/codesearchnet
19
+ - giganticode/java-cmpx-v1
20
+ - nickrosh/Evol-Instruct-Code-80k-v1
21
+ - bigcode/starcoderdata
22
+ - bigcode/the-stack
23
+ - bigcode/the-stack-smol
24
+ - Cdaprod/AI-Developer-Prompts
25
+ - code_x_glue_ct_code_to_text
26
+ - codeparrot/github-code
27
+ - codeparrot/github-code-clean
28
+ - code_x_glue_cc_code_completion_line
29
+ - >-
30
+ autoevaluate/autoeval-eval-jeffdshen__inverse_superglue_mixedp1-jeffdshen__inverse-63643c-1665558893
31
+ - bentrevett/multi30k
32
+ - edbeeching/decision_transformer_gym_replay
33
+ - psyche/common_crawl
34
+ - Birchlabs/openai-prm800k-solutions-only
35
+ - openchat/openchat_sharegpt4_dataset
36
+ - Open-Orca/OpenOrca
37
+ - cjvt/slownet
38
+ - para_crawl
39
+ - zeroshot/twitter-financial-news-sentiment
40
+ - laugustyniak/political-advertising-pl
41
+ - code_search_net
42
+ - sukaka/novelai-webui
43
+ - P1ayer-1/chatgpt-conversations-chatlogs.net
44
+ - daniel2588/sarcasm
45
+ - psmathur/orca_minis_uncensored_dataset
46
+ - player1537/Bloom-560m-trained-on-Wizard-Vicuna-Uncensored-trained-on-Based
47
+ - shahules786/prosocial-nsfw-reddit
48
+ - Thewillonline/reddit-sarcasm
49
+ - datasciencemmw/current-data
50
+ - Oniichat/bluemoon_roleplay_chat_data_300k_messages
51
+ - dell-research-harvard/AmericanStories
52
+ - b-mc2/sql-create-context
53
+ - rahulmallah/autotrain-data-emotion-detection
54
+ - theblackcat102/multiround-programming-convo
55
+ - Lsavints/software_knowledgebase
56
+ - RazinAleks/SO-Python_QA-Web_Development_class
57
+ - codeparrot/apps
58
+ - vlsp-2023-vllm/en-to-vi-formal-informal-tranlations
59
+ - fraug-library/english_contractions_extensions
60
+ - spencer/software_slacks
61
+ - Abirate/english_quotes
62
+ - Nexdata/American_English_Natural_Dialogue_Speech_Data
63
+ - Nexdata/Latin_American_Speaking_English_Speech_Data_by_Mobile_Phone
64
+ - Nexdata/American_English_Speech_Data_by_Mobile_Phone_Reading
65
+ - Nexdata/American_English_Speech_Synthesis_Corpus-Female
66
+ - rombodawg/LimitlessCodeTraining
67
+ - RikoteMaster/Emotion_Recognition_4_llama2
68
+ - Villian7/Emotions_Data
69
+ - alanland/llama2-self-cognition
70
+ - CognitiveScience/coscidata
71
+ - bibidentuhanoi/gideon_self_cognition
72
+ - gollark/consciousness
73
+ - juletxara/visual-spatial-reasoning
74
+ - lintang/numerical_reasoning_arithmetic
75
+ - reasoning-machines/gsm-hard
76
+ - open-source-metrics/reinforcement-learning-checkpoint-downloads
77
+ - igbo_english_machine_translation
78
+ - US-Artificial-Intelligence/algemap
79
+ - rombodawg/2XUNCENSORED_alpaca_840k_Evol_USER_ASSIS
80
+ - griffin/chain_of_density
81
+ - >-
82
+ shirsh10mall/LLM_Instruct_Learning_Project_Preprocessed_Tokenized_Open_Orca_Dataset_Flan_T5
83
+ - Thaweewat/chain-of-thought-74k-th
84
+ - AlekseyKorshuk/chain-of-thoughts-chatml-deduplicated
85
+ - dair-ai/emotion
86
+ - hita/social-behavior-emotions
87
+ - Bingsu/Human_Action_Recognition
88
+ - anjandash/java-8m-methods-v1
89
+ - nadiamaqbool81/java_code_instructions_1.178k_alpaca
90
+ - DavidMOBrien/8000-java
91
+ - rombodawg/LimitlessCodeTraining_1k-Python-Javascript_GuanacoFormat
92
+ - angie-chen55/javascript-github-code
93
+ - kye/all-lucidrain-python-3
94
+ - Fraser/python-state-changes
95
+ - ammarnasr/the-stack-ruby-clean
96
+ - ammarnasr/the-stack-rust-clean
97
+ - seyyedaliayati/solidity-dataset
98
+ - jkhedri/psychology-dataset
99
+ - KonradSzafer/stackoverflow_linux
100
+ - vikp/textbook_quality_programming
101
+ - rombodawg/LosslessMegaCodeTrainingV3_MINI
102
+ - BelleGroup/multiturn_chat_0.8M
103
+ - smangrul/code-chat-assistant-v1
104
+ - goendalf666/sales-textbook_for_convincing_and_selling
105
+ - readerbench/ConversationalAgent-Ro
106
+ - beurkinger/autotrain-data-human-action-recognition
107
+ - jpwahle/autoencoder-paraphrase-dataset
108
+ - jpwahle/autoregressive-paraphrase-dataset
109
+ - teknium/GPT4-LLM-Cleaned
110
+ - Anthropic/model-written-evals
111
+ - openai_humaneval
112
+ - kye/all-google-ai-python-code
113
+ - kye/all-openai-github-code
114
+ - EleutherAI/lambada_openai
115
+ - CShorten/ML-ArXiv-Papers
116
+ - WaltonFuture/InstructionGPT-4
117
+ - open-llm-leaderboard/details_AIDC-ai-business__Marcoroni-70B
118
+ - seansullivan/INT-Business-Syllabus
119
+ - theoldmandthesea/17k_business_book
120
+ - SunRise228/business-doc
121
+ - gauravshrm211/VC-startup-evaluation-for-investment
122
+ - TuningAI/Startups_V1
123
+ - TuningAI/Startups_V2
124
+ - AdiOO7/llama-2-finance
125
+ - scillm/scientific_papers
126
+ - gokuls/wiki_book_corpus_complete_processed_bert_dataset
127
+ - the_pile_books3
128
+ - go_emotions
129
+ - yizhongw/self_instruct
130
+ - codeparrot/self-instruct-starcoder
131
+ - Amani27/massive_translation_dataset
132
+ - huggingface/transformers-metadata
133
+ - hf-internal-testing/transformers-metadata
134
+ - commonsense_qa
135
+ - nlplabtdtu/test-edu-crawl
136
+ - kernelmachine/open-license-corpus
137
+ - BDas/EnglishNLPDataset
138
+ - CyberNative/github_cybersecurity_READMEs
139
+ - thomwolf/github-python
140
+ - CM/codexglue_code2text_java
141
+ - autoevaluate/autoeval-staging-eval-project-glue-f16e6c43-14015917
142
+ - lemonteaa/algorithmic-reasoning-seed
143
+ - EmpathyFirstMedia/algolia
144
+ - vicgalle/alpaca-gpt4
145
+ - pariajm/sharif_emotional_speech_dataset
146
+ - lighteval/synthetic_reasoning_natural
147
+ - jxu124/llava_complex_reasoning_77k
148
+ - bibidentuhanoi/gideon_self_cognition_text
149
+ - ohilikeit/empathetic_dialogues_mutli_turn_ko
150
+ - KevinZ/psycholinguistic_eval
151
+ - fiveflow/psychology-dataset
152
+ - shahidul034/text_generation_model_data
153
+ - qwedsacf/story-generation
154
+ - EnigmaOfTheWorld/b-mc2-sql-create-context
155
+ - HuggingFaceH4/testing_self_instruct_small
156
+ - RUCAIBox/Data-to-text-Generation
157
+ - Fhrozen/AudioSet2K22
158
+ - Chr0my/Epidemic_sounds
159
+ - ChristophSchuhmann/lyrics-index
160
+ - Cropinky/rap_lyrics_english
161
+ - tsterbak/eurovision-lyrics-1956-2023
162
+ - brunokreiner/genius-lyrics
163
+ - google/MusicCaps
164
+ - ccmusic-database/music_genre
165
+ - Hyeon2/riffusion-musiccaps-dataset
166
+ - SamAct/autotrain-data-musicprompt
167
+ - Chr0my/Epidemic_music
168
+ - juliensimon/autonlp-data-song-lyrics
169
+ - Datatang/North_American_English_Speech_Data_by_Mobile_Phone_and_PC
170
+ - Chr0my/freesound.org
171
+ - teticio/audio-diffusion-256
172
+ - KELONMYOSA/dusha_emotion_audio
173
+ - Ar4ikov/iemocap_audio_text_splitted
174
+ - flexthink/ljspeech
175
+ - mozilla-foundation/common_voice_13_0
176
+ - facebook/voxpopuli
177
+ - SocialGrep/one-million-reddit-jokes
178
+ - breadlicker45/human-midi-rlhf
179
+ - breadlicker45/midi-gpt-music-small
180
+ - projectlosangeles/Los-Angeles-MIDI-Dataset
181
+ - huggingartists/epic-rap-battles-of-history
182
+ - SocialGrep/one-million-reddit-confessions
183
+ - shahules786/prosocial-nsfw-reddit
184
+ - Thewillonline/reddit-sarcasm
185
+ - autoevaluate/autoeval-eval-futin__guess-vi-4200fb-2012366606
186
+ - lmsys/chatbot_arena_conversations
187
+ - mozilla-foundation/common_voice_11_0
188
+ - mozilla-foundation/common_voice_4_0
189
+ - dell-research-harvard/AmericanStories
190
+ - zZWipeoutZz/insane_style
191
+ - mu-llama/MusicQA
192
+ - RaphaelOlivier/whisper_adversarial_examples
193
+ - huggingartists/metallica
194
+ - vldsavelyev/guitar_tab
195
+ - NLPCoreTeam/humaneval_ru
196
+ - seungheondoh/audioset-music
197
+ - gary109/onset-singing3_corpora_parliament_processed_MIR-ST500
198
+ - LDD5522/Rock_Vocals
199
+ - huggingartists/rage-against-the-machine
200
+ - huggingartists/chester-bennington
201
+ - huggingartists/logic
202
+ - cmsolson75/artist_song_lyric_dataset
203
+ - BhavyaMuni/artist-lyrics
204
+ - vjain/emotional_intelligence
205
+ - mhenrichsen/context-aware-splits
206
  language:
207
  - en
208
+ - es
209
+ - it
210
+ - ru
211
+ - la
212
  metrics:
213
  - accuracy
214
  - bertscore
 
219
  - mean_iou
220
  tags:
221
  - code
222
+ - music
223
+ library_name: transformers
224
  ---
225
+ Model Overview
226
+ SquanchNasty is a groundbreaking AI model that pushes the boundaries of natural language processing and understanding. It is designed to generate creative, coherent, and contextually relevant text based on user prompts. With its advanced neural network architecture and extensive training on diverse datasets, SquanchNasty can generate high-quality responses across various domains and tasks.
227
+
228
+ Intended Use
229
+ SquanchNasty is intended to be used as a creative and innovative tool to assist users in generating text-based content. It can be employed for a wide range of applications, including but not limited to:
230
+
231
+ Creative Writing: SquanchNasty can help users in generating unique storylines, dialogue, and descriptive passages for creative writing projects.
232
+ Content Generation: It can be used to generate engaging and informative articles, blog posts, social media captions, and other written content.
233
+ Language Translation: SquanchNasty's language generation capabilities can be leveraged to facilitate translation services by generating accurate and contextually appropriate translations.
234
+ Coding Assistance: The model can assist programmers by providing code snippets, explanations, and suggestions for various programming languages.
235
+ Conversational Agents: SquanchNasty's ability to generate contextually relevant responses makes it suitable for use in chatbots and virtual assistants.
236
+ Model Capabilities
237
+ SquanchNasty is designed to provide users with remarkable text generation capabilities. It can:
238
+
239
+ Generate Coherent Text: The model produces text that is coherent, logical, and contextually relevant to the given prompt.
240
+ Maintain Consistent Style: SquanchNasty can adapt its writing style to match different genres, tones, or formalities based on the provided input.
241
+ Handle Open-Ended Prompts: The model can generate creative and imaginative responses even with minimal or incomplete prompts.
242
+ Incorporate User Preferences: SquanchNasty can be fine-tuned to incorporate user preferences and biases, allowing for personalized text generation.
243
+ Provide Varied Outputs: The model can generate multiple diverse outputs for a given prompt, allowing users to explore different possibilities.
244
+ Dataset and Training
245
+ SquanchNasty has been trained on a vast array of high-quality datasets from various domains, such as literature, code, conversations, and more. The training data includes open-source text, code repositories, question-and-answer platforms, books, and dialogue datasets. The model has undergone extensive pre-training and fine-tuning processes to ensure optimal performance and versatility.
246
+
247
+ Ethical Considerations
248
+ As an AI research scientist, I am committed to upholding ethical guidelines and responsible AI practices. It is crucial to consider the following ethical considerations when using SquanchNasty:
249
+
250
+ Bias Mitigation: Efforts have been made to reduce biases during training, but it is essential to evaluate and address any potential biases in the model's generated output.
251
+ Fairness and Accountability: Users should be aware that SquanchNasty's responses are based on the data it has been trained on, and it may reflect the biases and viewpoints present in the training data.
252
+ User Responsibility: Users should exercise caution and accountability when utilizing SquanchNasty's generated content, ensuring it aligns with ethical standards.
253
+ Content Moderation: It is recommended to implement content moderation mechanisms to ensure that the generated text adheres to community guidelines and legal frameworks.
254
+ Performance and Limitations
255
+ SquanchNasty exhibits exceptional performance in generating coherent and contextually relevant text. However, it is important to consider the following limitations:
256
+
257
+ Context Sensitivity: The model may not always capture intricate contextual nuances, leading to occasional errors or inconsistent responses.
258
+ Sensitivity to Input: SquanchNasty's output heavily relies on the quality and clarity of the input prompt. Ambiguous or misleading prompts may result in less accurate or unexpected responses.
259
+ Over-Reliance on Training Data: The model's responses are based on patterns and information present in the training data. Therefore, it may struggle with generating text on topics or concepts that are underrepresented or absent in the training data.
260
+ Lack of Real-Time Information: SquanchNasty does not have access to real-time data and may generate responses based on outdated or inaccurate information.
261
+ Conclusion
262
+ SquanchNasty is a remarkable and groundbreaking AI model that offers exceptional text generation capabilities. It has been trained on diverse datasets and exhibits the potential to revolutionize various domains, including creative writing, content generation, coding assistance, and conversational agents. While it showcases impressive performance, it is important to consider ethical guidelines, address biases, and be mindful of its limitations when utilizing SquanchNasty for specific use cases