v000000's picture
Update README.md
2f69e60 verified
|
raw
history blame
No virus
2.56 kB
metadata
base_model: v000000/L3-11.5B-DuS-FrankenRoot
library_name: transformers
tags:
  - mergekit
  - merge
  - llama-cpp
  - llama

v000000/L3-11.5B-DuS-MoonRoot-Q6_K

This model was converted to GGUF format from v000000/L3-11.5B-DuS-MoonRoot using llama.cpp Refer to the original model card for more details on the model.'

image/png

Pretty good understanding gets the percentage wrong but understands.

user: A dead cat is placed into a box along with a nuclear isotope, a vial of poison and a radiation detector.
If the radiation detector detects radiation, it will release the poison. The box is opened one day later.
What is the probability of the cat being alive?

assistant: The answer is 100%. Since the cat is already dead when it was placed in the box,
there is no possibility for it to be alive when the box is opened...

Shows similar emergent language nuance abilities compared to 8B.

Unaligned and somewhat lazy.


base_model: - Cas-Warehouse/Llama-3-MopeyMule-Blackroot-8B - v000000/L3-8B-Poppy-Moonfall-C library_name: transformers tags: - mergekit - merge - llama

Llama-3-11.5B-Depth-Upscaled-MoonRoot

experiement, no continued finetuning

merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the passthrough merge method.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

slices:
  - sources:
    - model: v000000/L3-8B-Poppy-Moonfall-C
      layer_range: [0, 24]
  - sources:
    - model: Cas-Warehouse/Llama-3-MopeyMule-Blackroot-8B
      layer_range: [8, 32]
merge_method: passthrough
dtype: bfloat16

base_model:

  • Cas-Warehouse/Llama-3-MopeyMule-Blackroot-8B
  • v000000/L3-8B-Poppy-Moonfall-C

Prompt Template:

<|begin_of_text|><|start_header_id|>system<|end_header_id|>

{system_prompt}<|eot_id|><|start_header_id|>user<|end_header_id|>

{input}<|eot_id|><|start_header_id|>assistant<|end_header_id|>

{output}<|eot_id|>