Nishith Jain
AI & ML interests
Articles
Organizations
KingNish's activity
1. Chat with Google Agent - This includes three AI models that allow you to converse with an AI, which provides answers by searching Google.
Demo Link: poscye/google-go
2. HelpingAI 9B - A model that surpassed all top AIs with the highest EQ benchmark score of 89.23. It specializes in understanding human emotions and responding in human style.
Demo Link: Abhaykoul/HelpingAI-9B
Model Link: OEvortex/HelpingAI-9B
Blog Link: https://huggingface.co/blog/KingNish/helpingai-9b
My partner has dealt with the licenses. Actually, I'm not sure what he intends to do next, but for now, it is an open-source project.
Don't act like kid, First decide with your partner that project is opensource for lifetime or not.
This is a opensource project
Then why you confusing peoples with custom license. Choose any better license. like Apache 2.0 or any poplar.
@alan45x try helping AI from here https://huggingface.co/spaces/Abhaykoul/HelpingAI-9B
Yes, you can use them but...
with limitations like
You can't use DallE 😥,
You can't make Custom GPTs
And chat limit also😥.
But...
We already have an open-source alternative like Hugging Chat, where you can create your custom assistant, generate, edit images, without any chat limit.
Try both of them from here:
https://chatgpt.com/gpts
https://huggingface.co/chat
and don't forget to Give your review here 👇:
Thanks for reporting the issue.
You encountered the issue because you entered text before image.
But I solved the issue by adding dropdown menu to select task.
Well, AI automatically determines the task you want but if it hallucinates just select correct task type.
I wasn't aware of it before. I've tried it now, and it's better than the standard pix2pix; the outputs are even more realistic.
Thank you for the suggestion.🤗
KingNish/Image-Gen-Pro
It is Expert in Text to Image generation, Sequential Image generation or Image Editing.
Examples:
how to access??
any Sample Space Please.
Thanks! 🤗
1. Dedicated Image and Video Engine
2. Model Choices for Voice Chat
3. Better and Faster Voice Chat
4. Various Bug fixes
Test and give feedback of New features:
KingNish/OpenGPT-4o
Future Updates:
1. Web Search (Suggested by @GPT007 and @Saionton )
2. Live Chat with Voice Chat
3. Model Choices (Suggested by @NotAiLOL )
4. Multilingual Chats.
Suggest more features that should be added. 🤗
Thanks!
Start with Learning basic Python
Then Learn from Other spaces how they work.
Always stay Curious.
46C h yha Indore me.
Garmi ka aanand le rhe, Pak me garmi kesi par rhi h
Me from Pakistan
Hello, Neighbour
you are from Germany
No, India
Amazing, Its Fast and provides various customizations.
@Niansuh
I am not able to check this.
@Saionton
Created new dedicated image generation module and 1st model there is DallE. its working super fine.
Thanks for suggestion.
Currently not, in future may be.
Lots of restrictions by Microsoft.
But some people gonna remove restrictions 🤣.
1. Phi 3 Medium (4k and 128k): A 14b Instruct tuned models that outperformed big models like Command R+ (104b), GPT 3.5 Pro, Gemini Pro, and is highly competitive with top models such as Mixtral 8x22b, Llama3 70B, and GPT 4.
microsoft/Phi-3-medium-4k-instruct
DEMO: Walmart-the-bag/Phi-3-Medium
2. Phi 3 Mini Vision 128k: A 4.5 billion-parameter, instruction-tuned vision model that has outperformed models such as Llava3 and Claude 3, and is providing stiff competition to Gemini 1Pro Vision.
microsoft/Phi-3-vision-128k-instruct
3. Phi3 Small (8k and 128k): Better than Llama3 8b, Mixtral 8x7b and GPT 3.5 turbo.
microsoft/Phi-3-small-128k-instruct
Why not use bigger computer vision model?i think we already reached enough improvement in language models.we need to focus on text to image and image to text models
Because bigger model requires bigger spaces and also slow down output.
Can you suggest some tools??
but what about updating them or making them private.
yes
Cool, fast, and with excellent image quality.
Demo Link: https://huggingface.co/spaces/KingNish/SDXL-Flash
Currently, I use the Pollination API, which is weak in generating text in images.
But in next update, I'm definitely going to add another powerful image generator.
Well, its speed depends on how many people are using it simultaneously, but let's see if there is a method to increase its speed from my side.
𝗥𝗲𝗮𝗱 𝗙𝘂𝗹𝗹 𝐀𝐫𝐭𝐢𝐜𝐥𝐞: https://huggingface.co/blog/KingNish/decoding-gpt-4o
𝐒𝐮𝐦𝐦𝐚𝐫𝐲 𝐨𝐟 𝐀𝐫𝐭𝐢𝐜𝐥𝐞- 📝
# 𝐌𝐞𝐜𝐡𝐚𝐧𝐢𝐜𝐬 𝐨𝐟 𝐆𝐏𝐓-𝟒’𝐨’: GPT-4’o’ operates through three main components 🛠️
𝟏. 𝐒𝐮𝐩𝐞𝐫𝐂𝐡𝐚𝐭: Integrates image generation, QnA (image, document and video) for diverse interactions.
𝟐. 𝐕𝐨𝐢𝐜𝐞 𝐂𝐡𝐚𝐭: Merges TTS and STT for real-time, human-like audio responses, focusing on human interaction.
𝟑. 𝐕𝐢𝐝𝐞𝐨 𝐂𝐡𝐚𝐭: Utilizes Zero Shot Image Classification to enhance user interaction with visual information.
# 𝐌𝐞𝐭𝐡𝐨𝐝𝐬 𝐭𝐨 𝐂𝐫𝐞𝐚𝐭𝐞 𝐒𝐢𝐦𝐢𝐥𝐚𝐫 𝐀𝐈 🧠
𝟏. 𝐌𝐮𝐥𝐭𝐢𝐌𝐨𝐝𝐚𝐥𝐢𝐟𝐢𝐜𝐚𝐭𝐢𝐨𝐧: Combines multiple models for a powerful, multifunctional AI.
𝟐. 𝐃𝐮𝐜𝐭 𝐓𝐚𝐩𝐞 𝐌𝐞𝐭𝐡𝐨𝐝: Uses different models or APIs for specific tasks without additional training.
The article provides an in-depth exploration of GPT-4’o’, its functionalities, and methods to create similar AI models. It emphasizes the model’s language support and its innovative approach to human-AI interaction. 💡🌐
(𝙉𝙊𝙏𝙀: 𝙎𝙪𝙢𝙢𝙖𝙧𝙮 𝙞𝙨 𝘼𝙄 𝙜𝙚𝙣𝙚𝙧𝙖𝙩𝙚𝙙) ✅
Resolved the issue in live chat; it's now functioning properly.
okk, in next update
@awacke1
Sanskrit could potentially have billions of words because of its flexibility.
For instance, consider the word "water" in Sanskrit; it has three distinct words for each tense—past, present, and future. These then branch into eight 'vibhaktis' or cases, each with a specific use in conversation, resulting in 24 variations of just one word.
Additionally, there are 280 synonyms for "water," leading to approximately 6720 words for a single concept. (Source: https://qr.ae/psiHhb )
This immense flexibility allows for the creation of new words by adhering to certain rules.
These leads to creation of billions of words.
For instance, consider the various names of people around the world; each person's name can be expressed in 24 different ways, depending on the tense and context.
This leads to more than billions of words in sanskrit.
Hope you understands.
https://huggingface.co/spaces/KingNish/paligemma-video-chat
try this same thing
1. Live Chat (also known as video chat) (very powerful and fast, it can even identify famous places and persons)
2. Powerful Image Generation
Test and give feedback of New features:
KingNish/OpenGPT-4o
Future Updates
1. PDF Chat
2. Human like speech (Using Parler tts expresso)
3. Multilingual support for voice chat
Suggest more features that should be added. 🤗
Edit: Live Chat is now very powerful (than prev)
Super Chat Model - Idefics 2
Image Generation Model - Pollination Ai Api
Speech to Text - Nemo (API)
Voice Chat (Base Model) - Mixtral 8x7b (Inference API)
Text to Speech - Edge tts (API)
Live Chat (base model) - uform gen2 dpo
is it possible to make a blog on how did you make it ?
Okay, after the video chat is completed.
But what 's the use of this AI.
This implies that OpenAI provides a less robust model to free subscribers, as it appears to have weaker reasoning and mathematical capabilities.
okk, thanks
Thank you for improving me.
Today, I gained access to GPT-4o, so I thought to test it. However, I encountered several problems, such as When I requested image generation, it did not create any images but only provided links, which are also incorrect. 😥 [Image 1]
Subsequently, I considered that my prompt might be incorrect, I attempted once more with a prompt from OpenAI's examples, but it also did not work. 😥 [Image 2]
Then, I tested its logical reasoning skills, which it failed. I presented a question that an 8b model solved with ease, but GPT-4o could not. 😥 [Image 3]
I also attempted to generate an image from another image, but this too was unsuccessful. [image 4]
Nonetheless, it excels in tasks such as image classification and voice chat.
If you've experienced similar issues, please share them here.
any suggestions
Yes, but how you know
🤣 add this also.
KingNish/OpenGPT-4o
Features:
1️⃣ Inputs possible are Text ✏️, Text + Image 📝🖼️, Audio 🎧, WebCam📸
and outputs possible are Image 🖼️, Image + Text 🖼️📝, Text 📝, Audio 🎧
2️⃣ Flat 100% FREE 💸 and Super-fast ⚡.
3️⃣ Publicly Available before GPT 4o.
Future Features:
1️⃣ Chat with PDF (Both voice and text)
2️⃣ Video generation.
3️⃣ Sequential Image Generation.
4️⃣ Better UI and customization.
Note: It's not possible to reach level of complexity of GPT 4o because OpenAI has been developing GPT-4o from six months with a team of over 450+ experienced members, Whereas I am only One. Moreover, they haven't released it fully publicly, So, it remains a test model.
@singhsidhukuldeep Please correct the link of blog to - https://openai.com/index/hello-gpt-4o/
Hope so.
[ { "from": "human", "value": "Welcome, to HF" }]
Check it out: KingNish/JARVIS
It's done, check it out - https://huggingface.co/spaces/KingNish/IllusionDiffusionVideo
Why are you not continuing it??
Best of Luck bro
It's done, check it out - https://huggingface.co/spaces/KingNish/IllusionDiffusionVideo
https://huggingface.co/spaces/KingNish/IllusionDiffusionVideo
It can Create high quality ULTRA HD illusion video.
If you find any bugs, please let me know😊
Hope to see this on Assistant of the WEEK page.
Would you like to see Illusion Diffusion in Video format. AP123/IllusionDiffusion
Let me Know.