RaccoonResearch/simian100
Viewer
•
Updated
open source fine tuned models, datasets, synthetic data pipeline code, and distribution gpu code to democratize access to ai/ml
Raccoon Research creates data and training pipelines for multimodal generative models.
Discord - Github - Huggingface
Simian is a synthetic data generator for creating perfect caption pairs to image and video data. It is designed to help researchers and developers generate high-quality training datasets for various computer vision and NLP tasks. By using Simian, users can quickly and easily create large datasets that improve the performance of their models.
Learn more: Simian
We are proud to be sponsored by DeepAI. Their support enables us to continue our work in developing cutting-edge data and training pipelines.