Thinking Machines Lab, led by a group of prominent former OpenAI researchers, is betting that fine-tuning cutting-edge models ...
Thanks to everyone who attended our AI Agenda Live event in New York yesterday! It was incredible to get to meet so many ...
Abstract: Navigating in a crowded social environment without collisions or freezing is a crucial and challenging task. Recent studies have demonstrated considerable success using Deep Reinforcement ...
RULER (Relative Universal LLM-Elicited Rewards) eliminates the need for hand-crafted reward functions by using an LLM-as-judge to automatically score agent trajectories. Simply define your task in the ...
Abstract: We present the design, implementation, and evaluation of Ceilbot, a ceiling-mounted robot for efficient and accurate RFID localization. Unlike previous robotic RFID localization systems, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results