Update README.md
#1
by
sminkim
- opened
README.md
CHANGED
@@ -1,200 +1,73 @@
|
|
1 |
-
---
|
2 |
-
library_name: transformers
|
3 |
-
tags:
|
4 |
-
- unsloth
|
5 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
6 |
|
7 |
# Model Card for Model ID
|
8 |
|
9 |
-
|
10 |
-
|
11 |
-
|
12 |
|
13 |
## Model Details
|
|
|
14 |
|
15 |
-
|
16 |
-
|
17 |
-
<!-- Provide a longer summary of what this model is. -->
|
18 |
-
|
19 |
-
This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
|
20 |
|
21 |
-
|
22 |
-
- **Funded by [optional]:** [More Information Needed]
|
23 |
-
- **Shared by [optional]:** [More Information Needed]
|
24 |
-
- **Model type:** [More Information Needed]
|
25 |
-
- **Language(s) (NLP):** [More Information Needed]
|
26 |
-
- **License:** [More Information Needed]
|
27 |
-
- **Finetuned from model [optional]:** [More Information Needed]
|
28 |
|
29 |
-
|
|
|
|
|
|
|
|
|
|
|
30 |
|
31 |
-
|
|
|
|
|
|
|
32 |
|
33 |
-
|
34 |
-
- **
|
35 |
-
- **Demo [optional]:** [More Information Needed]
|
36 |
|
37 |
## Uses
|
38 |
|
39 |
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
|
|
|
|
|
|
40 |
|
41 |
-
|
42 |
-
|
43 |
-
|
44 |
-
|
45 |
-
[More Information Needed]
|
46 |
-
|
47 |
-
### Downstream Use [optional]
|
48 |
-
|
49 |
-
<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
|
50 |
-
|
51 |
-
[More Information Needed]
|
52 |
|
53 |
-
|
54 |
-
|
55 |
-
|
56 |
-
|
57 |
-
[More Information Needed]
|
58 |
|
59 |
## Bias, Risks, and Limitations
|
60 |
|
61 |
<!-- This section is meant to convey both technical and sociotechnical limitations. -->
|
62 |
-
|
63 |
-
|
64 |
-
|
65 |
-
|
66 |
-
|
67 |
-
|
68 |
-
|
69 |
-
|
70 |
-
|
71 |
-
|
72 |
-
|
73 |
-
|
74 |
-
|
75 |
-
|
76 |
-
|
77 |
-
## Training Details
|
78 |
-
|
79 |
-
### Training Data
|
80 |
-
|
81 |
-
<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
|
82 |
-
|
83 |
-
[More Information Needed]
|
84 |
-
|
85 |
-
### Training Procedure
|
86 |
-
|
87 |
-
<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
|
88 |
-
|
89 |
-
#### Preprocessing [optional]
|
90 |
-
|
91 |
-
[More Information Needed]
|
92 |
-
|
93 |
-
|
94 |
-
#### Training Hyperparameters
|
95 |
-
|
96 |
-
- **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
|
97 |
-
|
98 |
-
#### Speeds, Sizes, Times [optional]
|
99 |
-
|
100 |
-
<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
|
101 |
-
|
102 |
-
[More Information Needed]
|
103 |
-
|
104 |
-
## Evaluation
|
105 |
-
|
106 |
-
<!-- This section describes the evaluation protocols and provides the results. -->
|
107 |
-
|
108 |
-
### Testing Data, Factors & Metrics
|
109 |
-
|
110 |
-
#### Testing Data
|
111 |
-
|
112 |
-
<!-- This should link to a Dataset Card if possible. -->
|
113 |
-
|
114 |
-
[More Information Needed]
|
115 |
-
|
116 |
-
#### Factors
|
117 |
-
|
118 |
-
<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
|
119 |
-
|
120 |
-
[More Information Needed]
|
121 |
-
|
122 |
-
#### Metrics
|
123 |
-
|
124 |
-
<!-- These are the evaluation metrics being used, ideally with a description of why. -->
|
125 |
-
|
126 |
-
[More Information Needed]
|
127 |
-
|
128 |
-
### Results
|
129 |
-
|
130 |
-
[More Information Needed]
|
131 |
-
|
132 |
-
#### Summary
|
133 |
-
|
134 |
-
|
135 |
-
|
136 |
-
## Model Examination [optional]
|
137 |
-
|
138 |
-
<!-- Relevant interpretability work for the model goes here -->
|
139 |
-
|
140 |
-
[More Information Needed]
|
141 |
-
|
142 |
-
## Environmental Impact
|
143 |
-
|
144 |
-
<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
|
145 |
-
|
146 |
-
Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
|
147 |
-
|
148 |
-
- **Hardware Type:** [More Information Needed]
|
149 |
-
- **Hours used:** [More Information Needed]
|
150 |
-
- **Cloud Provider:** [More Information Needed]
|
151 |
-
- **Compute Region:** [More Information Needed]
|
152 |
-
- **Carbon Emitted:** [More Information Needed]
|
153 |
-
|
154 |
-
## Technical Specifications [optional]
|
155 |
-
|
156 |
-
### Model Architecture and Objective
|
157 |
-
|
158 |
-
[More Information Needed]
|
159 |
-
|
160 |
-
### Compute Infrastructure
|
161 |
-
|
162 |
-
[More Information Needed]
|
163 |
-
|
164 |
-
#### Hardware
|
165 |
-
|
166 |
-
[More Information Needed]
|
167 |
-
|
168 |
-
#### Software
|
169 |
-
|
170 |
-
[More Information Needed]
|
171 |
-
|
172 |
-
## Citation [optional]
|
173 |
-
|
174 |
-
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
|
175 |
-
|
176 |
-
**BibTeX:**
|
177 |
-
|
178 |
-
[More Information Needed]
|
179 |
-
|
180 |
-
**APA:**
|
181 |
-
|
182 |
-
[More Information Needed]
|
183 |
-
|
184 |
-
## Glossary [optional]
|
185 |
-
|
186 |
-
<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
|
187 |
-
|
188 |
-
[More Information Needed]
|
189 |
-
|
190 |
-
## More Information [optional]
|
191 |
-
|
192 |
-
[More Information Needed]
|
193 |
-
|
194 |
-
## Model Card Authors [optional]
|
195 |
-
|
196 |
-
[More Information Needed]
|
197 |
-
|
198 |
-
## Model Card Contact
|
199 |
-
|
200 |
-
[More Information Needed]
|
|
|
1 |
+
---
|
2 |
+
library_name: transformers
|
3 |
+
tags:
|
4 |
+
- unsloth
|
5 |
+
license: gemma
|
6 |
+
datasets:
|
7 |
+
- SoyunKim/startrup
|
8 |
+
language:
|
9 |
+
- ko
|
10 |
+
base_model:
|
11 |
+
- google/gemma-2-2b
|
12 |
+
---
|
13 |
|
14 |
# Model Card for Model ID
|
15 |
|
16 |
+
**StartBridge** is a machine learning model that helps startups find relevant 2024 government startup support programs by providing quick, personalized answers about funding and opportunities.
|
|
|
|
|
17 |
|
18 |
## Model Details
|
19 |
+
Introducing **StartBridge**, your go-to guide for navigating 2024 government support programs for startups! Designed with fine-tuning machine learning, **StartBridge** bridges the gap between you and the vast world of public funding, making it easier than ever to find the right opportunities.
|
20 |
|
21 |
+
Whether you’re an entrepreneur searching for startup grants or a business in need of early-stage funding, **StartBridge** swiftly delivers clear, relevant, and personalized answers to your questions. With **StartBridge**, no more sifting through countless websites or documents—simply ask, and get connected to the right support in seconds. It's not just a tool, it's your personal bridge to growth and innovation!
|
|
|
|
|
|
|
|
|
22 |
|
23 |
+
### Model Description
|
|
|
|
|
|
|
|
|
|
|
|
|
24 |
|
25 |
+
- **Developed by:** Soyun Kim, Su-min Kim
|
26 |
+
- **Activity with:** MLB 2024, Gemma Sprint
|
27 |
+
- **Model type:** Causal Language Model (AutoModelForCausalLM)
|
28 |
+
- **Language(s) (NLP):** Korean
|
29 |
+
- **License:** Gemma Term of Use
|
30 |
+
- **Finetuned from model:** google/gemma2-2b
|
31 |
|
32 |
+
**Model Sources**
|
33 |
+
- **Reference #1:** https://github.com/unslothai/unsloth.git
|
34 |
+
- **Reference #2:** https://www.sktenterprise.com/bizInsight/blogDetail/dev/9480
|
35 |
+
- **Reference #3:** https://unfinishedgod.netlify.app/2024/06/15/llm-unsloth-gguf/
|
36 |
|
37 |
+
**Data Resource**
|
38 |
+
- **Resource**: https://huggingface.co/datasets/SoyunKim/startrup
|
|
|
39 |
|
40 |
## Uses
|
41 |
|
42 |
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
43 |
+
1. **Intended Use**:
|
44 |
+
- **StartBridge** is designed to help users find relevant government support programs specifically targeted towards startups. It serves as a query-based system where users can ask questions about available programs, eligibility criteria, deadlines, application processes, and any other relevant details.
|
45 |
+
- The model can provide information by matching user queries with existing data about various support programs, simplifying the process for startups seeking government assistance.
|
46 |
|
47 |
+
2. **Foreseeable Users**:
|
48 |
+
- **Startups and Entrepreneurs**: The primary users of the model will be founders, entrepreneurs, or teams looking to access government grants, subsidies, tax reliefs, or any other form of financial or administrative support.
|
49 |
+
- **Business Advisors and Consultants**: Professionals who assist startups in navigating the complex landscape of government support programs may use the model to provide quick and accurate answers to their clients.
|
50 |
+
- **Incubators and Accelerators**: Organizations that provide mentorship and resources to early-stage startups may also use the model to assist their cohorts in understanding available government programs.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
51 |
|
52 |
+
3. **Those Affected by the Model**:
|
53 |
+
- **Entrepreneurs**: With better access to information, entrepreneurs may have a more equitable chance of benefiting from the support programs, potentially influencing startup growth rates and success rates.
|
54 |
+
- **Policymakers**: Insights from the types of questions users ask can help policymakers refine or promote certain programs, ensuring that they meet the needs of startups more effectively.
|
|
|
|
|
55 |
|
56 |
## Bias, Risks, and Limitations
|
57 |
|
58 |
<!-- This section is meant to convey both technical and sociotechnical limitations. -->
|
59 |
+
1. **Bias**:
|
60 |
+
- **StartBridge** has been primarily trained on data from **2024 government startup support programs** focusing specifically on **commercialization support** (사업화지원). This creates a potential bias, as the model's responses will be most accurate for questions related to this area.
|
61 |
+
- Programs supporting other aspects of startups, such as funding for research and development (R&D), international expansion, or human resources, may not be covered as thoroughly, leading to gaps in the information provided.
|
62 |
+
- The list of programs currently covered is as follows:
|
63 |
+
민관공동창업자발굴육성사업, 예비창업패키지, 초기창업패키지, 창업도약패키지, 초격차 스타트업 1000+ 프로젝트, 아기유니콘 육성사업, 재도전성공패키지, 창업중심대학, 생애최초 청년창업 지원사업, 공공기술 창업사업화 지원사업, 창업성공패키지(청년창업사관학교), 로컬크리에이터 육성사업, K-Global 액셀러레이터 육성사업, 글로벌 ICT 미래 유니콘 육성, 데이터 활용 사업화 지원사업(DATA-Stars), 에코스타트업 지원사업, 대한민국 물산업 혁신창업 대전, 물드림 사업화지원, 예술기업 성장 지원, 스포츠산업 창업 지원, 스포츠산업 창업중기(액셀러레이팅) 지원, 콘텐츠 아이디어 사업화 지원, 콘텐츠 초기창업 육성 지원, 콘텐츠 창업도약 프로그램, 선도기업 연계 동반성장 지원(콘텐츠 오픈이노베이션), 전통문화 청년창업 육성지원사업, 관광벤처사업 공모전, 관광 액셀러레이팅 프로그램, 신사업창업사관학교, 농식품 벤처육성지원, 농식품 기술창업 액셀러레이터 육성지원, 농식품 기술평가지원, 농식품 판로지원, 유망 창업기업 투자유치 지원사업
|
64 |
+
|
65 |
+
2. **Risks**:
|
66 |
+
- **Outdated Information**: Since the model is trained on a specific dataset from 2024, there is a risk that it may provide outdated or incomplete information if not updated regularly, especially as new government programs or changes to existing programs emerge.
|
67 |
+
- **Over-reliance on the Model**: Users may rely solely on the model without verifying the information through official channels, potentially leading to missed opportunities or misunderstandings in program eligibility and application requirements.
|
68 |
+
- **Limited Context**: The model might lack the ability to fully understand complex or nuanced queries that require knowledge beyond the provided dataset, which could result in incorrect or incomplete responses.
|
69 |
+
|
70 |
+
3. **Limitations**:
|
71 |
+
- **Data Scope**: As of now, **StartBridge** is trained only on data related to **commercialization support programs** in 2024. This limits the model’s effectiveness when users ask about other areas of startup support such as tax incentives, infrastructure support, or legal assistance.
|
72 |
+
- **Language and Terminology Variations**: If users use different terms or phrases that aren't covered in the dataset, the model may struggle to interpret the question correctly.
|
73 |
+
- **Future Updates**: The model is expected to be updated continuously as more data on new government programs and support areas become available. While this will expand its scope, users must be aware that the current version is limited in coverage.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|