{% extends "page.html" %} {% block stylesheet %} {% endblock %} {% block site %}
This Space is designed to provide you with an easy way to get started generating synthetic datasets using Spaces compute to host open LLMs. The Space comes with a ready-to-go environment and a series of notebooks showing various examples of generating synthetic datasets.
Currently this Space has notebooks covering the following topics:
A set of notebooks covering the steps for creating a synthetic dataset for fine-tuning a sentence similarity model. These notebooks cover:
To use this Space, use the duplicate button. You'll want to enable persistent storage so you can save your work. To start, you may want to use a smaller GPU like the T4 and switch out to a bigger GPU when you want to use bigger models for generating data. Reminder you can preview the notebooks in the Space without running them. You can find the notebooks in the `notebooks` folder here.
{% trans %}No login available, you shouldn't be seeing this page.{% endtrans %}
This template was created by camenduru and nateraw, with contributions of osanseviero and azzr