Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
KingNish 
posted an update 15 days ago
Post
3648
Introducing OpenGPT-4o
KingNish/OpenGPT-4o

Features:
1️⃣ Inputs possible are Text ✏️, Text + Image 📝🖼️, Audio 🎧, WebCam📸
and outputs possible are Image 🖼️, Image + Text 🖼️📝, Text 📝, Audio 🎧
2️⃣ Flat 100% FREE 💸 and Super-fast ⚡.
3️⃣ Publicly Available before GPT 4o.

Future Features:
1️⃣ Chat with PDF (Both voice and text)
2️⃣ Video generation.
3️⃣ Sequential Image Generation.
4️⃣ Better UI and customization.

Note: It's not possible to reach level of complexity of GPT 4o because OpenAI has been developing GPT-4o from six months with a team of over 450+ experienced members, Whereas I am only One. Moreover, they haven't released it fully publicly, So, it remains a test model.

this is working quite well!

I tried with the OAI example and it worked nicely! https://huggingface.co/spaces/KingNish/GPT-4o/discussions/1

This is amazing!

Out of curiosity did you use dev mode while building it?

·

Yes, but how you know

I tried it

·

any suggestions

This comment has been hidden

what model did you use to build it ?
And is it possible to make a blog on how did you make it ?

·

Super Chat Model - Idefics 2
Image Generation Model - Pollination Ai Api
Speech to Text - Nemo (API)
Voice Chat (Base Model) - Mixtral 8x7b (Inference API)
Text to Speech - Edge tts (API)
Live Chat (base model) - uform gen2 dpo

is it possible to make a blog on how did you make it ?

Okay, after the video chat is completed.

can we use this model?

·

yes

@KingNish
Which one is better?

Model Names: gpt-4-turbo-preview, gpt-4-vision-preview, gpt-3.5-turbo-16k
Searchable Models: Creative, Balanced, Precise

Image creation will be available soon in NiansuhAI.
Model Name: DALL-E 3


·

@Niansuh I am not able to check this.
image.png