davidberenstein1957 HF staff commited on
Commit
080f560
1 Parent(s): d6f9651

docs: Update readme

Browse files
Files changed (1) hide show
  1. README.md +4 -6
README.md CHANGED
@@ -21,24 +21,20 @@ hf_oauth_scopes:
21
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
22
 
23
  <div class="header-container">
24
- <div class="logo-container">
25
- <a href="https://github.com/argilla-io/distilabel" target="_blank" rel="noopener noreferrer">
26
- <img src="https://distilabel.argilla.io/latest/assets/distilabel-black.svg" alt="Distilabel Logo" style="width: 150px; height: auto;">
27
- </a>
28
- </div>
29
  <div class="title-container">
30
  <h1 style="margin: 0; font-size: 2em;">🧬 Synthetic Data Generator</h1>
31
  <p style="margin: 10px 0 0 0; color: #666; font-size: 1.1em;">Build datasets using natural language</p>
32
  </div>
33
  </div>
34
  <br>
 
35
  This repository contains the code for the [free Synthetic Data Generator app](https://huggingface.co/spaces/argilla/synthetic-data-generator), which is hosted on the Hugging Face Hub.
36
 
37
  ## How it works?
38
 
39
  ![Synthetic Data Generator](https://huggingface.co/spaces/argilla/synthetic-data-generator/resolve/main/assets/flow.png)
40
 
41
- Distilabel Synthetic Data Generator is an experimental tool that allows you to easily create high-quality datasets for training and fine-tuning language models. It leverages the power of distilabel and advanced language models to generate synthetic data tailored to your specific needs.
42
 
43
  This tool simplifies the process of creating custom datasets, enabling you to:
44
 
@@ -59,6 +55,8 @@ pip install -r requirements.txt
59
  python app.py
60
  ```
61
 
 
 
62
  ## Do you need more control?
63
 
64
  Each pipeline is based on a distilabel component, so you can easily run it locally or with other LLMs.
 
21
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
22
 
23
  <div class="header-container">
 
 
 
 
 
24
  <div class="title-container">
25
  <h1 style="margin: 0; font-size: 2em;">🧬 Synthetic Data Generator</h1>
26
  <p style="margin: 10px 0 0 0; color: #666; font-size: 1.1em;">Build datasets using natural language</p>
27
  </div>
28
  </div>
29
  <br>
30
+
31
  This repository contains the code for the [free Synthetic Data Generator app](https://huggingface.co/spaces/argilla/synthetic-data-generator), which is hosted on the Hugging Face Hub.
32
 
33
  ## How it works?
34
 
35
  ![Synthetic Data Generator](https://huggingface.co/spaces/argilla/synthetic-data-generator/resolve/main/assets/flow.png)
36
 
37
+ Distilabel Synthetic Data Generator is a tool that allows you to easily create high-quality datasets for training and fine-tuning language models. It leverages the power of distilabel and advanced language models to generate synthetic data tailored to your specific needs.
38
 
39
  This tool simplifies the process of creating custom datasets, enabling you to:
40
 
 
55
  python app.py
56
  ```
57
 
58
+ Note that you do need to have an `HF_TOKEN` that can make calls to the free serverless Hugging Face Inference Endpoints. You can get one [here](https://huggingface.co/settings/tokens/new?ownUserPermissions=repo.content.read&ownUserPermissions=repo.write&globalPermissions=inference.serverless.write&tokenType=fineGrained).
59
+
60
  ## Do you need more control?
61
 
62
  Each pipeline is based on a distilabel component, so you can easily run it locally or with other LLMs.