13 71 19

Börje Karlsson

tellarin

https://tellarin.com/borje/

AI & ML interests

Machine Learning Systems, Mobile Sensing, Knowledge Mining, Digital Entertainment

Recent Activity

upvoted a paper about 1 month ago

From Elements to Design: A Layered Approach for Automatic Graphic Design Composition

new activity about 2 months ago

MAmmoTH-VL/MAmmoTH-VL-Instruct-12M:Missing License in dataset and code repositories

liked a dataset about 2 months ago

MAmmoTH-VL/MAmmoTH-VL-Instruct-12M

View all activity

Organizations

tellarin's activity

upvoted a paper about 1 month ago

From Elements to Design: A Layered Approach for Automatic Graphic Design Composition

Paper • 2412.19712 • Published Dec 27, 2024 • 15

upvoted 4 papers 3 months ago

DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation

Paper • 2411.04999 • Published Nov 7, 2024 • 17

Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos

Paper • 2410.16259 • Published Oct 21, 2024 • 5

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Paper • 2410.10563 • Published Oct 14, 2024 • 38

Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation

Paper • 2410.13232 • Published Oct 17, 2024 • 41

upvoted 5 papers 4 months ago

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

Paper • 2410.02073 • Published Oct 2, 2024 • 40

WALL-E: World Alignment by Rule Learning Improves World Model-based LLM Agents

Paper • 2410.07484 • Published Oct 9, 2024 • 48

Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition

Paper • 2410.05603 • Published Oct 8, 2024 • 11

Agent S: An Open Agentic Framework that Uses Computers Like a Human

Paper • 2410.08164 • Published Oct 10, 2024 • 24

Intriguing Properties of Large Language and Vision Models

Paper • 2410.04751 • Published Oct 7, 2024 • 16

upvoted a collection 4 months ago

Foundation AI Papers

Collection

Curated List of Must-Reads on LLM reasoning at Temus AI team • 135 items • Updated Jun 15, 2024 • 29

upvoted 3 papers 4 months ago

MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents

Paper • 2410.03450 • Published Oct 4, 2024 • 36

CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction

Paper • 2410.01273 • Published Oct 2, 2024 • 10

EuroLLM: Multilingual Language Models for Europe

Paper • 2409.16235 • Published Sep 24, 2024 • 26

upvoted 3 papers 5 months ago

Open Language Data Initiative: Advancing Low-Resource Machine Translation for Karakalpak

Paper • 2409.04269 • Published Sep 6, 2024 • 9

Robot Utility Models: General Policies for Zero-Shot Deployment in New Environments

Paper • 2409.05865 • Published Sep 9, 2024 • 14

MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Paper • 2408.13257 • Published Aug 23, 2024 • 26

upvoted 2 papers 7 months ago

Cognitive Map for Language Models: Optimal Planning via Verbally Representing the World Model

Paper • 2406.15275 • Published Jun 21, 2024 • 12

Two Giraffes in a Dirt Field: Using Game Play to Investigate Situation Modelling in Large Multimodal Models

Paper • 2406.14035 • Published Jun 20, 2024 • 13

upvoted a paper 8 months ago

Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study

Paper • 2403.03186 • Published Mar 5, 2024 • 5