Skip to main content

Showing 1–8 of 8 results for author: Lo, Y L

  1. arXiv:2407.07875  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    Generative Image as Action Models

    Authors: Mohit Shridhar, Yat Long Lo, Stephen James

    Abstract: Image-generation diffusion models have been fine-tuned to unlock new capabilities such as image-editing and novel view synthesis. Can we similarly unlock image-generation models for visuomotor control? We present GENIMA, a behavior-cloning agent that fine-tunes Stable Diffusion to 'draw joint-actions' as targets on RGB images. These images are fed into a controller that maps the visual targets int… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Project website, code, checkpoints: https://genima-robot.github.io/

  2. arXiv:2307.14267  [pdf, other

    cs.CY cs.AI

    Improving International Climate Policy via Mutually Conditional Binding Commitments

    Authors: Jobst Heitzig, Jörg Oechssler, Christoph Pröschel, Niranjana Ragavan, Yat Long Lo

    Abstract: The Paris Agreement, considered a significant milestone in climate negotiations, has faced challenges in effectively addressing climate change due to the unconditional nature of most Nationally Determined Contributions (NDCs). This has resulted in a prevalence of free-riding behavior among major polluters and a lack of concrete conditionality in NDCs. To address this issue, we propose the implemen… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: Presented at AI For Global Climate Cooperation Competition, 2023 (arXiv:cs/2307.06951)

    Report number: AI4GCC/2023/track2/3

  3. arXiv:2307.01403  [pdf, other

    cs.AI cs.LG

    Learning Multi-Agent Communication with Contrastive Learning

    Authors: Yat Long Lo, Biswa Sengupta, Jakob Foerster, Michael Noukhovitch

    Abstract: Communication is a powerful tool for coordination in multi-agent RL. But inducing an effective, common language is a difficult challenge, particularly in the decentralized setting. In this work, we introduce an alternative perspective where communicative messages sent between agents are considered as different incomplete views of the environment state. By examining the relationship between message… ▽ More

    Submitted 1 February, 2024; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: The 12th International Conference on Learning Representations (ICLR)

  4. arXiv:2303.10733  [pdf, other

    cs.AI cs.MA

    Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement Learning

    Authors: Yat Long Lo, Christian Schroeder de Witt, Samuel Sokota, Jakob Nicolaus Foerster, Shimon Whiteson

    Abstract: By enabling agents to communicate, recent cooperative multi-agent reinforcement learning (MARL) methods have demonstrated better task performance and more coordinated behavior. Most existing approaches facilitate inter-agent communication by allowing agents to send messages to each other through free communication channels, i.e., cheap talk channels. Current methods require these channels to be co… ▽ More

    Submitted 19 March, 2023; originally announced March 2023.

    Comments: The 11th International Conference on Learning Representations (ICLR)

  5. arXiv:2203.03344  [pdf, other

    cs.AI

    Learning to Ground Decentralized Multi-Agent Communication with Contrastive Learning

    Authors: Yat Long Lo, Biswa Sengupta

    Abstract: For communication to happen successfully, a common language is required between agents to understand information communicated by one another. Inducing the emergence of a common language has been a difficult challenge to multi-agent learning systems. In this work, we introduce an alternative perspective to the communicative messages sent between agents, considering them as different incomplete view… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Journal ref: EmeCom at ICLR 2022

  6. arXiv:2003.07417  [pdf, other

    cs.LG cs.AI cs.NE

    Improving Performance in Reinforcement Learning by Breaking Generalization in Neural Networks

    Authors: Sina Ghiassian, Banafsheh Rafiee, Yat Long Lo, Adam White

    Abstract: Reinforcement learning systems require good representations to work well. For decades practical success in reinforcement learning was limited to small domains. Deep reinforcement learning systems, on the other hand, are scalable, not dependent on domain specific prior knowledge and have been successfully used to play Atari, in 3D navigation from pixels, and to control high degree of freedom robots… ▽ More

    Submitted 16 March, 2020; originally announced March 2020.

    Comments: 10 pages; Accepted to AAMAS 2020

  7. arXiv:1910.13213  [pdf, other

    cs.AI cs.LG

    Overcoming Catastrophic Interference in Online Reinforcement Learning with Dynamic Self-Organizing Maps

    Authors: Yat Long Lo, Sina Ghiassian

    Abstract: Using neural networks in the reinforcement learning (RL) framework has achieved notable successes. Yet, neural networks tend to forget what they learned in the past, especially when they learn online and fully incrementally, a setting in which the weights are updated after each sample is received and the sample is then discarded. Under this setting, an update can lead to overly global generalizati… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: 9 Pages, 7 Figures, NeurIPS Workshop on Biological and Artificial Reinforcement Learning, 2019

    Journal ref: Biological and Artificial RL Workshop at NeurIPS 2019

  8. Finding by Counting: A Probabilistic Packet Count Model for Indoor Localization in BLE Environments

    Authors: Subham De, Shreyans Chowdhary, Aniket Shirke, Yat Long Lo, Robin Kravets, Hari Sundaram

    Abstract: We propose a probabilistic packet reception model for Bluetooth Low Energy (BLE) packets in indoor spaces and we validate the model by using it for indoor localization. We expect indoor localization to play an important role in indoor public spaces in the future. We model the probability of reception of a packet as a generalized quadratic function of distance, beacon power and advertising frequenc… ▽ More

    Submitted 27 August, 2017; originally announced August 2017.

    Comments: 8 pages, 6 figures, to be published in WiNTECH 2017