Travis King

travisking

AI & ML interests

have you heard of generative AI?

Recent Activity

upvoted an article about 14 hours ago
AI Agents Are Here. What Now?
liked a dataset about 14 hours ago
FreedomIntelligence/medical-o1-verifiable-problem
liked a model about 20 hours ago
UsefulSensors/moonshine-base
View all activity

Organizations

None yet

travisking's activity

upvoted an article about 14 hours ago
view article
Article

AI Agents Are Here. What Now?

ā€¢ 48
reacted to merve's post with ā¤ļø about 21 hours ago
view post
Post
1025
Everything that happened this week in open AI, a recap šŸ¤  merve/jan-17-releases-678a673a9de4a4675f215bf5

šŸ‘€ Multimodal
- MiniCPM-o 2.6 is a new sota any-to-any model by OpenBMB
(vision, speech and text!)
- VideoChat-Flash-Qwen2.5-2B is new video multimodal models by OpenGVLab that come in sizes 2B & 7B in resolutions 224 & 448
- ByteDance released larger SA2VA that comes in 26B parameters
- Dataset: VRC-Bench is a new diverse benchmark for multimodal LLM reasoning performance

šŸ’¬ LLMs
- MiniMax-Text-01 is a new huge language model (456B passive 45.9B active params) by MiniMaxAI with context length of 4M tokens šŸ¤Æ
- Dataset: Sky-T1-data-17k is a diverse dataset used to train Sky-T1-32B
- kyutai released Helium-1-Preview-2B is a new small multilingual LM
- Wayfarer-12B is a new LLM able to write D&D šŸ§™šŸ»ā€ā™‚ļø
- ReaderLM-v2 is a new HTML parsing model by Jina AI

- Dria released, Dria-Agent-a-3B, new agentic coding model (Pythonic function calling) based on Qwen2.5 Coder
- Unsloth released Phi-4, faster and memory efficient Llama 3.3

šŸ–¼ļø Vision
- MatchAnything is a new foundation model for matching
- FitDit is a high-fidelity VTON model based on DiT architecture

šŸ—£ļø Audio
- OuteTTS-0.3-1B is a new multilingual text-to-speech model with voice cloning and emotion control capabilities

šŸ“– Retrieval
- lightblue released a new reranker based on Qwen2.5 LB-reranker-0.5B-v1.0 that can handle 95+ languages
- cde-small-v2 is a new sota small retrieval model by
@jxm
upvoted an article 2 days ago
view article
Article

Train 400x faster Static Embedding Models with Sentence Transformers

ā€¢ 103
New activity in ByteDance/Sa2VA-4B 9 days ago

license for 4b

#2 opened 9 days ago by
travisking
New activity in bullerwins/DeepSeek-V3-GGUF 11 days ago

quantization request

4
#1 opened 16 days ago by
KT313