Nishith Jain
AI & ML interests
Articles
Organizations
KingNish's activity
how to access??
any Sample Space Please.
Thanks! 🤗
1. Dedicated Image and Video Engine
2. Model Choices for Voice Chat
3. Better and Faster Voice Chat
4. Various Bug fixes
Test and give feedback of New features:
KingNish/OpenGPT-4o
Future Updates:
1. Web Search (Suggested by @GPT007 and @Saionton )
2. Live Chat with Voice Chat
3. Model Choices (Suggested by @NotAiLOL )
4. Multilingual Chats.
Suggest more features that should be added. 🤗
Thanks!
Start with Learning basic Python
Then Learn from Other spaces how they work.
Always stay Curious.
46C h yha Indore me.
Garmi ka aanand le rhe, Pak me garmi kesi par rhi h
Me from Pakistan
Hello, Neighbour
you are from Germany
No, India
Amazing, Its Fast and provides various customizations.
@Niansuh
I am not able to check this.
@Saionton
Created new dedicated image generation module and 1st model there is DallE. its working super fine.
Thanks for suggestion.
Currently not, in future may be.
Lots of restrictions by Microsoft.
But some people gonna remove restrictions 🤣.
1. Phi 3 Medium (4k and 128k): A 14b Instruct tuned models that outperformed big models like Command R+ (104b), GPT 3.5 Pro, Gemini Pro, and is highly competitive with top models such as Mixtral 8x22b, Llama3 70B, and GPT 4.
microsoft/Phi-3-medium-4k-instruct
DEMO: Walmart-the-bag/Phi-3-Medium
2. Phi 3 Mini Vision 128k: A 4.5 billion-parameter, instruction-tuned vision model that has outperformed models such as Llava3 and Claude 3, and is providing stiff competition to Gemini 1Pro Vision.
microsoft/Phi-3-vision-128k-instruct
3. Phi3 Small (8k and 128k): Better than Llama3 8b, Mixtral 8x7b and GPT 3.5 turbo.
microsoft/Phi-3-small-128k-instruct
Why not use bigger computer vision model?i think we already reached enough improvement in language models.we need to focus on text to image and image to text models
Because bigger model requires bigger spaces and also slow down output.
Can you suggest some tools??
but what about updating them or making them private.
yes
Cool, fast, and with excellent image quality.
Demo Link: https://huggingface.co/spaces/KingNish/SDXL-Flash
Currently, I use the Pollination API, which is weak in generating text in images.
But in next update, I'm definitely going to add another powerful image generator.
Well, its speed depends on how many people are using it simultaneously, but let's see if there is a method to increase its speed from my side.
𝗥𝗲𝗮𝗱 𝗙𝘂𝗹𝗹 𝐀𝐫𝐭𝐢𝐜𝐥𝐞: https://huggingface.co/blog/KingNish/decoding-gpt-4o
𝐒𝐮𝐦𝐦𝐚𝐫𝐲 𝐨𝐟 𝐀𝐫𝐭𝐢𝐜𝐥𝐞- 📝
# 𝐌𝐞𝐜𝐡𝐚𝐧𝐢𝐜𝐬 𝐨𝐟 𝐆𝐏𝐓-𝟒’𝐨’: GPT-4’o’ operates through three main components 🛠️
𝟏. 𝐒𝐮𝐩𝐞𝐫𝐂𝐡𝐚𝐭: Integrates image generation, QnA (image, document and video) for diverse interactions.
𝟐. 𝐕𝐨𝐢𝐜𝐞 𝐂𝐡𝐚𝐭: Merges TTS and STT for real-time, human-like audio responses, focusing on human interaction.
𝟑. 𝐕𝐢𝐝𝐞𝐨 𝐂𝐡𝐚𝐭: Utilizes Zero Shot Image Classification to enhance user interaction with visual information.
# 𝐌𝐞𝐭𝐡𝐨𝐝𝐬 𝐭𝐨 𝐂𝐫𝐞𝐚𝐭𝐞 𝐒𝐢𝐦𝐢𝐥𝐚𝐫 𝐀𝐈 🧠
𝟏. 𝐌𝐮𝐥𝐭𝐢𝐌𝐨𝐝𝐚𝐥𝐢𝐟𝐢𝐜𝐚𝐭𝐢𝐨𝐧: Combines multiple models for a powerful, multifunctional AI.
𝟐. 𝐃𝐮𝐜𝐭 𝐓𝐚𝐩𝐞 𝐌𝐞𝐭𝐡𝐨𝐝: Uses different models or APIs for specific tasks without additional training.
The article provides an in-depth exploration of GPT-4’o’, its functionalities, and methods to create similar AI models. It emphasizes the model’s language support and its innovative approach to human-AI interaction. 💡🌐
(𝙉𝙊𝙏𝙀: 𝙎𝙪𝙢𝙢𝙖𝙧𝙮 𝙞𝙨 𝘼𝙄 𝙜𝙚𝙣𝙚𝙧𝙖𝙩𝙚𝙙) ✅
Resolved the issue in live chat; it's now functioning properly.
okk, in next update
@awacke1
Sanskrit could potentially have billions of words because of its flexibility.
For instance, consider the word "water" in Sanskrit; it has three distinct words for each tense—past, present, and future. These then branch into eight 'vibhaktis' or cases, each with a specific use in conversation, resulting in 24 variations of just one word.
Additionally, there are 280 synonyms for "water," leading to approximately 6720 words for a single concept. (Source: https://qr.ae/psiHhb )
This immense flexibility allows for the creation of new words by adhering to certain rules.
These leads to creation of billions of words.
For instance, consider the various names of people around the world; each person's name can be expressed in 24 different ways, depending on the tense and context.
This leads to more than billions of words in sanskrit.
Hope you understands.
https://huggingface.co/spaces/KingNish/paligemma-video-chat
try this same thing
1. Live Chat (also known as video chat) (very powerful and fast, it can even identify famous places and persons)
2. Powerful Image Generation
Test and give feedback of New features:
KingNish/OpenGPT-4o
Future Updates
1. PDF Chat
2. Human like speech (Using Parler tts expresso)
3. Multilingual support for voice chat
Suggest more features that should be added. 🤗
Edit: Live Chat is now very powerful (than prev)
Super Chat Model - Idefics 2
Image Generation Model - Pollination Ai Api
Speech to Text - Nemo (API)
Voice Chat (Base Model) - Mixtral 8x7b (Inference API)
Text to Speech - Edge tts (API)
Live Chat (base model) - uform gen2 dpo
is it possible to make a blog on how did you make it ?
Okay, after the video chat is completed.
But what 's the use of this AI.
This implies that OpenAI provides a less robust model to free subscribers, as it appears to have weaker reasoning and mathematical capabilities.
okk, thanks
Thank you for improving me.
Today, I gained access to GPT-4o, so I thought to test it. However, I encountered several problems, such as When I requested image generation, it did not create any images but only provided links, which are also incorrect. 😥 [Image 1]
Subsequently, I considered that my prompt might be incorrect, I attempted once more with a prompt from OpenAI's examples, but it also did not work. 😥 [Image 2]
Then, I tested its logical reasoning skills, which it failed. I presented a question that an 8b model solved with ease, but GPT-4o could not. 😥 [Image 3]
I also attempted to generate an image from another image, but this too was unsuccessful. [image 4]
Nonetheless, it excels in tasks such as image classification and voice chat.
If you've experienced similar issues, please share them here.
any suggestions
Yes, but how you know
🤣 add this also.
KingNish/OpenGPT-4o
Features:
1️⃣ Inputs possible are Text ✏️, Text + Image 📝🖼️, Audio 🎧, WebCam📸
and outputs possible are Image 🖼️, Image + Text 🖼️📝, Text 📝, Audio 🎧
2️⃣ Flat 100% FREE 💸 and Super-fast ⚡.
3️⃣ Publicly Available before GPT 4o.
Future Features:
1️⃣ Chat with PDF (Both voice and text)
2️⃣ Video generation.
3️⃣ Sequential Image Generation.
4️⃣ Better UI and customization.
Note: It's not possible to reach level of complexity of GPT 4o because OpenAI has been developing GPT-4o from six months with a team of over 450+ experienced members, Whereas I am only One. Moreover, they haven't released it fully publicly, So, it remains a test model.
@singhsidhukuldeep Please correct the link of blog to - https://openai.com/index/hello-gpt-4o/
Hope so.
[ { "from": "human", "value": "Welcome, to HF" }]
Check it out: KingNish/JARVIS
It's done, check it out - https://huggingface.co/spaces/KingNish/IllusionDiffusionVideo
Why are you not continuing it??
Best of Luck bro
It's done, check it out - https://huggingface.co/spaces/KingNish/IllusionDiffusionVideo
https://huggingface.co/spaces/KingNish/IllusionDiffusionVideo
It can Create high quality ULTRA HD illusion video.
If you find any bugs, please let me know😊
Hope to see this on Assistant of the WEEK page.
Would you like to see Illusion Diffusion in Video format. AP123/IllusionDiffusion
Let me Know.
Super fast! this is awesome!!
Thanks❤
@SvCy Welcome bro❤
JARVIS responds to all your questions in audio format.
Must TRY -> KingNish/JARVIS
Jarvis is currently equipped to accept text input and provide audio output.
In the future, it may also support audio input.
DEMO Video:
How do you get access? activity and such?
cc @KingNish
Go to Community blogpost section on HF official discord.
and simply provide them with your Hugging Face ID.
https://discord.com/channels/879548962464493619/1158330196148097045/1234374442390519813
Power of OPEN SOURCE 💪💪
+ llama 3 beated commandR+
Love to see on Android 🥰😍
Best of Luck😊
Can you give more info about bot like his capabilities and also keep us updated on its future update.