view post Post Diaries of Open Source. Part 1.What a week! Here are some of the exciting Open Source releases of the week!1. BigCode releases The Stack v2 and StarCoder 2Resources in https://huggingface.co/posts/loubnabnl/596860170283496Blog https://huggingface.co/blog/starcoder2Collection: bigcode/starcoder2-65de6da6e87db3383572be1a2. Playground v2.5, a very powerful new text-to-image modelModel: playgroundai/playground-v2.5-1024px-aestheticDemo: playgroundai/playground-v2.5Blog: https://playground.com/blog/playground-v2-53.Evo: DNA foundation models Blog: https://arcinstitute.org/news/blog/evoModels: togethercomputer/evo-1-131k-base4. OpenHermesPreferences: a dataset of ~1 million AI Preferences argilla/OpenHermesPreferences5. SpeechBrain 1.0: a toolkit with hundreds of recipes and pretrained models for audio-related tasks, such as speech recognition, diarization, and enhancement. New major release!HF repos: speechbrain Website: https://speechbrain.github.io/6. Tower: a suite of Llama-based multilingual translation models Unbabel/tower-659eaedfe36e6dd29eb1805c7. AllenAI releases OLMo-7B-Instruct allenai/olmo-suite-65aeaae8fe5b6b2122b467788. DIBT - An crowdsourced effort to human-rate prompts. Its 10k prompts dataset is released ttps://huggingface.co/datasets/DIBT/10k_prompts_ranked9. ChatMusician: A Llama 2 fine-tuned model for music generation m-a-p/ChatMusician10. Bonito, an model that converts data into synthetic instruction datasetsGitHub: https://github.com/BatsResearch/bonitoModel: BatsResearch/bonito-v1Paper: Learning to Generate Instruction Tuning Datasets for Zero-Shot Task Adaptation (2402.18334) 3 replies · ❤️ 13 13 👍 2 2 + Reply