Tansu Turkoglu's picture

1 4

Tansu Turkoglu

tansutt

·

https://www.instagram.com/tansu.studio

AI & ML interests

None yet

Recent Activity

liked a model 4 months ago

RunDiffusion/Juggernaut-XL-v9

updated a dataset 4 months ago

tansutt/MedQA-USMLE-4-options-hf

updated a model 4 months ago

tansutt/medqa-usmle-mistral-7B-Instruct-v0.3

View all activity

Organizations

tansutt's activity

liked a model 4 months ago

RunDiffusion/Juggernaut-XL-v9

Text-to-Image • Updated Dec 11, 2024 • 219k • 182

updated a dataset 4 months ago

tansutt/MedQA-USMLE-4-options-hf

Viewer • Updated Oct 22, 2024 • 12.7k • 57

updated a model 4 months ago

tansutt/medqa-usmle-mistral-7B-Instruct-v0.3

Updated Oct 22, 2024

New activity in tansutt/medqa-usmle-mistral-7B-Instruct-v0.3 4 months ago

[WIP] Upload folder using huggingface_hub (multi-commit 4bae03dda90b35f8fe0e4534c612be4ae7161a17abae6b2de5515b3861657423)

#1 opened 4 months ago by

reacted to mrfakename's post with 👀 11 months ago

Post

4067

Mistral AI recently released a new Mixtral model. It's another Mixture of Experts model with 8 experts, each with 22B parameters. It requires over 200GB of VRAM to run in float16, and over 70GB of VRAM to run in int4. However, individuals have been successful at finetuning it on Apple Silicon laptops using the MLX framework. It features a 64K context window, twice that of their previous models (32K).

The model was released over torrent, a method Mistral has recently often used for their releases. While the license has not been confirmed yet, a moderator on their Discord server yesterday suggested it was Apache 2.0 licensed.

Sources:
• https://twitter.com/_philschmid/status/1778051363554934874
• https://twitter.com/reach_vb/status/1777946948617605384

1 reply

·

liked 3 models about 1 year ago

bigscience/bloom

Text Generation • Updated Jul 28, 2023 • 1.09M • 4.86k

TheBloke/CodeLlama-70B-Instruct-GGUF

Text Generation • Updated Jan 30, 2024 • 2.27k • 57

TheBloke/CodeLlama-70B-hf-GGUF

Text Generation • Updated Jan 30, 2024 • 821 • 43