afrideva commited on
Commit
2291980
1 Parent(s): e1b22ca

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +105 -0
README.md ADDED
@@ -0,0 +1,105 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: TroyDoesAI/Mermaid-Llama-3-5B-Pruned
3
+ inference: true
4
+ license: cc-by-4.0
5
+ model_creator: TroyDoesAI
6
+ model_name: Mermaid-Llama-3-5B-Pruned
7
+ pipeline_tag: text-generation
8
+ quantized_by: afrideva
9
+ tags:
10
+ - gguf
11
+ - ggml
12
+ - quantized
13
+ ---
14
+
15
+ # Mermaid-Llama-3-5B-Pruned-GGUF
16
+
17
+ Quantized GGUF model files for [Mermaid-Llama-3-5B-Pruned](https://huggingface.co/TroyDoesAI/Mermaid-Llama-3-5B-Pruned) from [TroyDoesAI](https://huggingface.co/TroyDoesAI)
18
+
19
+ ## Original Model Card:
20
+
21
+ # Mermaid-Llama-3-5B
22
+
23
+ Introducing Mermaid-LLama-3-5B, a language model designed for Python code understanding and crafting captivating story flow maps.
24
+
25
+ ![MermaidLlama GIF](Mermaid_ShowCase/MermaidLlama.webp)
26
+
27
+ ## Key Features
28
+
29
+ 1. **Code Understanding:**
30
+ - Masters Python intricacies with finesse.
31
+ - Generates clear and accurate Mermaid Diagram Flow Charts.
32
+ - Ideal for developers seeking visual representations of their code logic.
33
+
34
+ 2. **Storytelling Capabilities:**
35
+ - Converts narrative inputs into captivating Mermaid Diagrams.
36
+ - Maps character interactions, plot developments, and narrative arcs.
37
+
38
+ 3. **Unmatched Performance:**
39
+ - Surpasses GPT-4 in generating well-organized Mermaid Diagrams.
40
+
41
+ 4. **Training Insights:**
42
+ - Trained on a diverse dataset, including 800 unique, hand-curated Mermaid Graph examples utilizing 478 complete Python programs.
43
+ - Exhibits emergent properties in story-to-flow map translations and step-by-step instruction flow maps.
44
+
45
+ ## Collaboration
46
+
47
+ Interested in enhancing Mermaid's capabilities? Contact troydoesai@gmail.com for collaboration opportunities.
48
+
49
+ ## Example Use Cases
50
+ - **Retrieval-Augmented Generation (RAG):** Utilize Mermaid-LLama-3-8B to create condensed knowledge graphs. This model excels in generating flow diagrams that enhance the retrieval process. These knowledge graphs are stored in a vector database, which allows for quick and efficient retrieval of contextually relevant information. When a query is received, the system retrieves a pertinent knowledge graph, appending it as context to the model. This enriched context enables Mermaid-LLama-3-8B to deliver more accurate and nuanced responses. This approach is particularly beneficial in applications requiring deep, context-aware interactions, such as sophisticated Q&A systems, dynamic data analysis, and complex decision-making tasks.
51
+ - **Code Documentation:** Automatic visual flow charts from Python code.
52
+ - **Storyboarding:** Visually appealing diagrams for storytelling.
53
+ - **Project Planning:** Visual project flow maps for effective team communication.
54
+ - **Learning Python:** Helps students visually understand Python code structures.
55
+ - **Game Design:** Visualizing game storylines for coherent narrative structure.
56
+
57
+ ## Proof of Concept
58
+
59
+ Stay tuned for the release of the VSCode Extension that displays the Live Flow Map every time a user stops typing for more than 10 seconds.
60
+
61
+ ## Training Specifications
62
+
63
+ - **LoRA Rank:** 2048
64
+ - **LoRA Alpha:** 4096
65
+ - **Batch Size:** 1
66
+ - **Micro Batch Size:** 1
67
+ - **Cutoff Length:** 4096
68
+ - **Save every n steps:** 1000
69
+ - **Epochs:** 3
70
+ - **Learning Rate:** 1e-6
71
+ - **LR Scheduler:** Cosine
72
+
73
+ **Target Modules:**
74
+ - Enable q_proj
75
+ - Enable v_proj
76
+ - Enable k_proj
77
+ - Enable o_proj
78
+ - Enable gate_proj
79
+ - Enable down_proj
80
+ - Enable up_proj
81
+
82
+ ## Getting Started
83
+
84
+ Start by downloading one of my models.
85
+
86
+ ![0 TroyDoesAI GIF](Mermaid_ShowCase/0_TroyDoesAI.gif)
87
+
88
+ Load the model.
89
+
90
+ ![1 Load Model in 4-bit Show Example Use GIF](Mermaid_ShowCase/1_LoadModel_in_4bit_Show_Example_Use.gif)
91
+
92
+ Use my prompt template to generate a Mermaid code block, which can be viewed in the Mermaid Live Editor or using the Mermaid CLI tool.
93
+
94
+ ![2 Loaded Model in Full Precision 16-bit Show Inference and Mermaid Live Editor GIF](Mermaid_ShowCase/2_Loaded_Model_in_Full_Precision_16bit_Show_Inference_and_Mermaid_Live_editor.gif)
95
+
96
+ Here we open the VLLM GUI Program while still running in Vram the Mermaid-Llama-8B to compare the flow diagram to the actual program and show the lightweight capabilites of small models on consumer hardware.
97
+
98
+ ![3 Open The Program VLLM Program With Full Precision Mermaid-Llama-8B Running to Evaluate Flow Map GIF](Mermaid_ShowCase/3_Open_The_Program_VLLM_Program_With_Full_Precision_Mermaid-Llama-8B-Running_to_evaluate_flow_map.gif)
99
+
100
+ ## More on my VLLM Class and inference GUI : https://github.com/Troys-Code/VLLM
101
+
102
+ ![Python RtdBsaz8gy GIF](Mermaid_ShowCase/python_RtdBsaz8gy.gif)
103
+ ---
104
+
105
+ Note: This model should be treated as an Auto-Complete Model, Do not try talking to it in chat you are gonna get garbage, those layers have been pruned and replaced, that is all you will hear of my secret sauce on training on small < 1000 entry datasets.