{% extends "page.html" %} {% block stylesheet %} {% endblock %} {% block site %}
This Space is designed to provide you with an easy way to get started generating synthetic datasets using Spaces compute to host open LLMs. The Space comes with a ready-to-go environment and a series of notebooks showing various examples of generating synthetic datasets.
Currently this Space has notebooks covering the following topics:
A set of notebooks covering the steps for creating a synthetic dataset for fine-tuning a sentence similarity model. These notebooks cover:
To use this Space, use the duplicate button. You'll want to enable persistent storage so you can save your work. To start, you may want to use a smaller GPU like the T4 and switch out to a bigger GPU when you want to use bigger models for generating data.
{% trans %}No login available, you shouldn't be seeing this page.{% endtrans %}
This template was created by camenduru and nateraw, with contributions of osanseviero and azzr