Jawad Mansoor
supercharge19
AI & ML interests
NLP for text and voice (even videos)
RL with multimodaliy models (agent is able to learn human speech as well as can see and make decisions based on what it "sees")
Recent Activity
new activity
5 days ago
reducto/RolmOCR:Recomended GPU size
liked
a model
19 days ago
Banafo/Kroko-ASR
new activity
about 2 months ago
microsoft/Phi-4-multimodal-instruct:How to use it with LM Studio?
Organizations
models
None public yet
datasets
None public yet