Kyle's picture

Kyle PRO

iky1e

·

https://ikyle.me

kylehowells

AI & ML interests

None yet

Recent Activity

upvoted a paper about 8 hours ago

SoundStorm: Efficient Parallel Audio Generation

liked a model about 8 hours ago

nari-labs/Dia-1.6B

upvoted an article about 8 hours ago

Timm ❤️ Transformers: Use any timm model with transformers

View all activity

Organizations

None yet

iky1e's activity

upvoted a paper about 8 hours ago

SoundStorm: Efficient Parallel Audio Generation

Paper • 2305.09636 • Published May 16, 2023 • 7

upvoted an article about 8 hours ago

Article

Timm ❤️ Transformers: Use any timm model with transformers

Jan 16

• 47

upvoted a paper 1 day ago

SkyReels-V2: Infinite-length Film Generative Model

Paper • 2504.13074 • Published 6 days ago • 7

upvoted a collection 1 day ago

SkyReels-V2

Infinite-length Film Generative Model • 7 items • Updated 3 days ago • 25

upvoted 3 papers 5 days ago

Translation Errors Significantly Impact Low-Resource Languages in Cross-Lingual Learning

Paper • 2402.02080 • Published Feb 3, 2024 • 1

Compositional Translation: A Novel LLM-based Approach for Low-resource Machine Translation

Paper • 2503.04554 • Published Mar 6 • 2

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 9 days ago • 239

upvoted 2 collections 5 days ago

InternVL3

34 items • Updated 3 days ago • 54

GLM-4-0414

GLM-4-0414 series model • 8 items • Updated 8 days ago • 104

upvoted 2 collections 6 days ago

OuteTTS 1.0

3 items • Updated 16 days ago • 4

OuteTTS

10 items • Updated 16 days ago • 16

upvoted a paper 7 days ago

MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft

Paper • 2504.08388 • Published 12 days ago • 39

upvoted 2 collections 12 days ago

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 113

Orpheus Multilingual Research Release

Beta Release of multilingual models. • 12 items • Updated 13 days ago • 76

upvoted a paper 13 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 16 days ago • 168

upvoted a collection 16 days ago

Nomic Embed Multimodal

Multimodal models allowing you to search over interleaved text, PDFs, charts, and images! • 15 items • Updated 16 days ago • 20

upvoted a collection 17 days ago

Llama 4

Llama 4 release • 10 items • Updated 17 days ago • 442

upvoted a collection 27 days ago

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 3 items • Updated 27 days ago • 89

upvoted a collection about 1 month ago

Orpheus TTS

TTS Towards Human-Sounding Speech • 2 items • Updated Mar 18 • 60

upvoted a paper about 1 month ago

JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse

Paper • 2503.16365 • Published Mar 20 • 40