17 93 23

Börje Karlsson

tellarin

https://tellarin.com/borje/

AI & ML interests

Machine Learning Systems, Mobile Sensing, Knowledge Mining, Digital Entertainment

Recent Activity

upvoted a paper 17 days ago

Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control

upvoted a paper 18 days ago

GenDec: A robust generative Question-decomposition method for Multi-hop reasoning

authored a paper 19 days ago

Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills

View all activity

Organizations

tellarin's activity

upvoted a paper 17 days ago

Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control

Paper • 2503.14492 • Published 18 days ago • 17

upvoted a paper 18 days ago

GenDec: A robust generative Question-decomposition method for Multi-hop reasoning

Paper • 2402.11166 • Published Feb 17, 2024 • 1

upvoted 5 papers 19 days ago

MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research

Paper • 2503.13399 • Published 19 days ago • 20

Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills

Paper • 2503.12533 • Published 20 days ago • 61

upvoted a collection 24 days ago

Cosmos

Collection

The collection of Cosmos models • 31 items • Updated 2 days ago • 279

upvoted 2 papers 24 days ago

Pixel-Level Reasoning Segmentation via Multi-turn Conversations

Paper • 2502.09447 • Published Feb 13 • 1

"Principal Components" Enable A New Language of Images

Paper • 2503.08685 • Published 25 days ago • 11

upvoted a paper 25 days ago

Video Action Differencing

Paper • 2503.07860 • Published 26 days ago • 31

upvoted a collection 25 days ago

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark S

Collection

SEACrowd is a community movement project aimed at centralizing and standardizing AI resources for Southeast Asian languages, cultures, and/or regions. • 3 items • Updated Jun 18, 2024 • 8

upvoted a paper 25 days ago

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published 26 days ago • 95

upvoted a collection 26 days ago

SEA-VL: Multicultural VL Dataset for Southeast Asia

Collection

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia • 3 items • Updated 24 days ago • 16

upvoted 6 papers 26 days ago

INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge

Paper • 2411.19799 • Published Nov 29, 2024 • 14

Multi-Level Knowledge Distillation for Out-of-Distribution Detection in Text

Paper • 2211.11300 • Published Nov 21, 2022 • 1

PE3R: Perception-Efficient 3D Reconstruction

Paper • 2503.07507 • Published 26 days ago • 10

FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data

Paper • 2501.17144 • Published Jan 28 • 6

Taking Notes Brings Focus? Towards Multi-Turn Multimodal Dialogue Learning

Paper • 2503.07002 • Published 27 days ago • 38

TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

Paper • 2503.05638 • Published 29 days ago • 18