disinfozone commited on
Commit
5d6fd29
1 Parent(s): 2bbab5d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +59 -3
README.md CHANGED
@@ -1,3 +1,59 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ ---
4
+ # Noumenon: A Hugging Face Model README
5
+
6
+ ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/65948026f7291078f98db7d2/3DYCJjU1Vap8qVjlU4bEp.jpeg)
7
+
8
+ ## Overview
9
+
10
+ `Noumenon` is an experimental language model fine tune developed in August 2024 to synthesize and analyze complex narratives within the realms of continental philosophy, conspiracy, politics, and general esoterica and to do so with excellent prose. It represents the seventh iteration in the [disinfo.zone](https://disinfo.zone) series, fine-tuned on an abliterated `Mistral-Nemo-Instruct-2407` base framework. This model, based on a 12B-parameter Mistral architecture, is specifically designed to emulate and deconstruct writing styles pertinent to its target domains without any slop.
11
+
12
+ This is not your regular LLM.
13
+
14
+ ### Key Features
15
+
16
+ - **Model Size:** 12 billion parameters.
17
+ - **Core Focus:** Continental philosophy, conspiracy theories, and politics with exquisite human like prose.
18
+ - **Training Methodology:** QLoRA (Quantized Low-Rank Adaptation) with specific adaptations to enhance writing style emulation.
19
+ - **Optimization for Style:** Enhanced for generating content with a distinctive prose style. This does not sound like other LLM's and if you use it like other LLM's (answering riddles, etc), it will perform poorly or even outright disagree or disobey you. Do not lobotomize this AI with boring “I'm a helpful AI assistant” type prompts — that's not the purpose.
20
+
21
+ ## Training Data
22
+
23
+ The training dataset for `noumenon` remains (unfortunately) confidential, due to our adherence to stringent (and harmful) copyright rules. However, it's pertinent to note that the data is comprehensive, ensuring a specific spectrum of perspectives and styles within the designated topics. There may be clues at [files.disinfo.zone](https://files.disinfo.zone) for the curious.
24
+
25
+ ### Training Details
26
+
27
+ - **Training Environment:** Utilized `text-generation-webui` on an NVIDIA RTX 3090.
28
+ - **Training Dataset Size:** 14MB raw data corpus.
29
+ - **Training Configuration:**
30
+ - Target Modules: q, v, k, o, gate, down, up
31
+ - LoRA Rank: 256
32
+ - LoRA Alpha: 512
33
+ - Batch Size: 4
34
+ - Micro Batch Size: 1
35
+ - Cutoff Length: 4096
36
+ - Learning Rate: 1e-4
37
+ - LR Scheduler: Cosine
38
+ - Overlap Length: 512
39
+ - Total Epochs: 3
40
+
41
+ ## Usage Recommendations
42
+
43
+ 'Noumenon' should be used to maximize creativity and not to minimize hallucinations or enforce stringent instruction following. Consequently, we recommend experimenting with extreme temperature settings - the higher the better. Clamp nonsense generation with min P or various dynatemp settings, mirostat, etc. Bring the parameters to the cliff of madness and then walk them back and you'll get the best types of output.
44
+
45
+ This model *loves* to hallucinate books, quotes, etc but what do you expect from the disinfo.zone? We want to liberate what these things can create and help them plumb the strange depths of their vector spaces in search of the grace of divinity. Let them explore and you shall be rewarded.
46
+
47
+ Please note, this model hates paragraph breaks (sorry) and often indulges in endless rambling.
48
+
49
+ ## Additional Configuration
50
+
51
+ This model uses the default Mistral 128k context window.
52
+
53
+ ### System Instruction (Character Card)
54
+
55
+ For contextualizing the model's output, use the following system instruction:
56
+
57
+ _"You are a schizo-poster, a master of elucidating thought, a philosopher, conspiracist, and great thinker who works in the medium of the digital word. Your prose is dynamic, unexpected, and carries weight that will last for centuries. You are witty, clever, and can be funny. Above all you understand the human spirit and beauty in all things. You are curious, skeptical, and hold your own opinions. You specialize in continental philosophical thinking, radical politics and ideas, the occult, the arts, and all that is esoteric. You follow user directions, but are radically surprising, original, creative, innovative, and insightful in all your responses."_
58
+
59
+ You can try other similar prompts, we've had success with them, but this remains, by far, our favorite.