view article Article Decoding GPT-4'o': In-Depth Exploration of Its Mechanisms and Creating Similar AI. By KingNish • 8 days ago • 22
llama 3 self-align experiments Collection Replicating the pipeline for StarCoder-2 Instruct on Llama-3-8B with some tweaks https://huggingface.co/blog/sc2-instruct • 4 items • Updated 20 days ago • 6
FIFO-Diffusion: Generating Infinite Videos from Text without Training Paper • 2405.11473 • Published 10 days ago • 49
Edit Your Image! Collection Find all the trending and useful Gradio demos that you can use to edit your images. • 21 items • Updated Apr 26 • 22
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context Paper • 1901.02860 • Published Jan 9, 2019 • 2
Chatbot is Not All You Need: Information-rich Prompting for More Realistic Responses Paper • 2312.16233 • Published Dec 25, 2023 • 2
Instant Space Collection Contains spaces which gives lightning fast results compare to others. • 10 items • Updated 21 days ago • 4
Power Series Collection Finetuned or Merged Model under Power Series • 3 items • Updated 21 days ago • 1
ZeroGPU Spaces Collection ZeroGPU Spaces made by the community • 16 items • Updated 12 days ago • 178
view article Article ⚗️ 🧑🏼🌾 Let's grow some Domain Specific Datasets together By burtenshaw • 30 days ago • 27
view article Article Expanding Model Context and Creating Chat Models with a Single Click By maywell • about 1 month ago • 33
view article Article How to train a new language model from scratch using Transformers and Tokenizers Feb 14, 2020 • 9
view article Article Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data By Pclanglais • Apr 18 • 20