skratos115 commited on
Commit
cfebe63
1 Parent(s): 130bd0c

Initial commit of qwen2-7b-OpenDevin-f16

Browse files
Files changed (3) hide show
  1. .gitattributes +1 -0
  2. README.md +81 -0
  3. qwen2.7b.OD..f16.gguf +3 -0
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ qwen2.7b.OD..f16.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,81 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ tags:
4
+ - text-generation
5
+ - qwen2
6
+ - instruct
7
+ - unsloth
8
+ - OpenDevin
9
+ datasets:
10
+ - xingyaoww/opendevin-code-act
11
+ ---
12
+
13
+ ## Qwen2.7b.OpenDevin
14
+ brought to you by skratos115 (HF) / Kingatlas115 (GH) in colaboration with the official Opendevin Team ~xingyaoww
15
+
16
+ # Qwen2-7B-Instruct with OpenDevin Tool Calling
17
+
18
+ ## Overview
19
+
20
+ This project involves the fine-tuning of the `Qwen2-7B-Instruct` model using the [opendevin-code-act dataset](https://huggingface.co/datasets/xingyaoww/opendevin-code-act) with the help of Unsloth. The primary goal is to develop a more powerful LLM capable of effectively using the CodeAct framework for tool calling. This is still in early development and should not be used in production. We are working on building a bigger dataset for tool paths/ trajectories and could you all the help we can by using the feedback integration to help us build better trajectories and release to the public via MIT license for OSS model training.
21
+ read more here:https://x.com/gneubig/status/1802740786242420896 and http://www.linkedin.com/feed/update/urn:li:activity:7208507606728929280/
22
+
23
+ ## Model Details
24
+
25
+ - **Model Name**: Qwen2-7B-Instruct
26
+ - **Dataset**: [opendevin-code-act](https://huggingface.co/datasets/xingyaoww/opendevin-code-act)
27
+ - **Training Platform**: Unsloth
28
+
29
+ provided full merged files
30
+ or
31
+ Quantized f16, q4_k_m, Q5_k_m, and Q8_0 gguf files.
32
+ I used the qwen2.7b.OD.q4_k_m.gguf for my testing and got it to write me a simple script. more testing to come.
33
+
34
+ ## Running the Model
35
+
36
+ You can run this model using `vLLM` or `ollama`. The following instructions are for using `ollama`.
37
+
38
+ ### Prerequisites
39
+
40
+ - Docker
41
+ - Hugging Face `transformers` library (version >= 4.37.0 is recommended)
42
+
43
+ ### Running with Ollama
44
+
45
+ 1. **Install Docker**: Ensure you have Docker installed on your machine.
46
+
47
+ 2. **Pull the Latest Hugging Face Transformers**:
48
+
49
+ pip install transformers>=4.37.0
50
+
51
+
52
+ 3. **Set Up Your Workspace**:
53
+
54
+ WORKSPACE_BASE=$(pwd)/workspace
55
+
56
+
57
+ 4. **Run the Docker Command**:
58
+ docker run -it \
59
+ --pull=always \
60
+ -e SANDBOX_USER_ID=$(id -u) \
61
+ -e PERSIST_SANDBOX="true" \
62
+ -e LLM_API_KEY="ollama" \
63
+ -e LLM_BASE_URL="http://[yourIPhere or 0.0.0.0]:11434" \
64
+ -e SSH_PASSWORD="make something up here" \
65
+ -e WORKSPACE_MOUNT_PATH=$WORKSPACE_BASE \
66
+ -v $WORKSPACE_BASE:/opt/workspace_base \
67
+ -v /var/run/docker.sock:/var/run/docker.sock \
68
+ -p 3000:3000 \
69
+ --add-host host.docker.internal:host-gateway \
70
+ --name opendevin-app-$(date +%Y%m%d%H%M%S) \
71
+ ghcr.io/opendevin/opendevin:main
72
+
73
+ Replace `[yourIPhere or 0.0.0.0]` with your actual IP address or use `0.0.0.0` for localhost.
74
+
75
+ ## Early Development
76
+
77
+ This project is in its early stages, and we are continuously working to improve the model and its capabilities. Contributions and feedback are welcome.
78
+
79
+ ## License
80
+
81
+ This project is licensed under the [MIT License](LICENSE).
qwen2.7b.OD..f16.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0e8b31006bf5dccc50d884fec9fce868a499b92ac1956c8ee4c77924b4178fd5
3
+ size 15237850656