Skip to main content

Showing 1–47 of 47 results for author: Francis, J

  1. arXiv:2407.06939  [pdf, other

    cs.RO cs.CV

    Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge

    Authors: Sriram Yenamandra, Arun Ramachandran, Mukul Khanna, Karmesh Yadav, Jay Vakil, Andrew Melnik, Michael Büttner, Leon Harz, Lyon Brown, Gora Chand Nandi, Arjun PS, Gaurav Kumar Yadav, Rahul Kala, Robert Haschke, Yang Luo, Jinxin Zhu, Yansen Han, Bingyi Lu, Xuan Gu, Qinyuan Liu, Yaping Zhao, Qiting Ye, Chenxiao Dou, Yansong Chua, Volodymyr Kuzma , et al. (20 additional authors not shown)

    Abstract: In order to develop robots that can effectively serve as versatile and capable home assistants, it is crucial for them to reliably perceive and interact with a wide variety of objects across diverse environments. To this end, we proposed Open Vocabulary Mobile Manipulation as a key benchmark task for robotics: finding any object in a novel environment and placing it on any receptacle surface withi… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  2. arXiv:2407.05910  [pdf, other

    cs.CV cs.AI cs.RO

    Enhancing Vision-Language Models with Scene Graphs for Traffic Accident Understanding

    Authors: Aaron Lohner, Francesco Compagno, Jonathan Francis, Alessandro Oltramari

    Abstract: Recognizing a traffic accident is an essential part of any autonomous driving or road monitoring system. An accident can appear in a wide variety of forms, and understanding what type of accident is taking place may be useful to prevent it from reoccurring. The task of being able to classify a traffic scene as a specific type of accident is the focus of this work. We approach the problem by likeni… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted: 1st Workshop on Semantic Reasoning and Goal Understanding in Robotics, at the Robotics Science and Systems Conference (SemRob @ RSS 2024)

  3. arXiv:2404.10626  [pdf, other

    cs.CV stat.AP

    Exploring selective image matching methods for zero-shot and few-sample unsupervised domain adaptation of urban canopy prediction

    Authors: John Francis, Stephen Law

    Abstract: We explore simple methods for adapting a trained multi-task UNet which predicts canopy cover and height to a new geographic setting using remotely sensed data without the need of training a domain-adaptive classifier and extensive fine-tuning. Extending previous research, we followed a selective alignment process to identify similar images in the two geographical domains and then tested an array o… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: ICLR 2024 Machine Learning for Remote Sensing (ML4RS) Workshop

  4. arXiv:2404.02359  [pdf, ps, other

    cs.LG

    Attribution Regularization for Multimodal Paradigms

    Authors: Sahiti Yerramilli, Jayant Sravan Tamarapalli, Jonathan Francis, Eric Nyberg

    Abstract: Multimodal machine learning has gained significant attention in recent years due to its potential for integrating information from multiple modalities to enhance learning and decision-making processes. However, it is commonly observed that unimodal models outperform multimodal models, despite the latter having access to richer information. Additionally, the influence of a single modality often dom… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  5. arXiv:2404.02353  [pdf, other

    cs.CV cs.AI cs.LG

    Semantic Augmentation in Images using Language

    Authors: Sahiti Yerramilli, Jayant Sravan Tamarapalli, Tanmay Girish Kulkarni, Jonathan Francis, Eric Nyberg

    Abstract: Deep Learning models are incredibly data-hungry and require very large labeled datasets for supervised learning. As a consequence, these models often suffer from overfitting, limiting their ability to generalize to real-world examples. Recent advancements in diffusion models have enabled the generation of photorealistic images based on textual inputs. Leveraging the substantial datasets used to tr… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  6. arXiv:2403.14712  [pdf, other

    cs.CY cs.AI

    AI for bureaucratic productivity: Measuring the potential of AI to help automate 143 million UK government transactions

    Authors: Vincent J. Straub, Youmna Hashem, Jonathan Bright, Satyam Bhagwanani, Deborah Morgan, John Francis, Saba Esnaashari, Helen Margetts

    Abstract: There is currently considerable excitement within government about the potential of artificial intelligence to improve public service productivity through the automation of complex but repetitive bureaucratic tasks, freeing up the time of skilled staff. Here, we explore the size of this opportunity, by mapping out the scale of citizen-facing bureaucratic decision-making procedures within UK centra… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  7. arXiv:2403.13208  [pdf, other

    cs.RO

    CaDRE: Controllable and Diverse Generation of Safety-Critical Driving Scenarios using Real-World Trajectories

    Authors: Peide Huang, Wenhao Ding, Jonathan Francis, Bingqing Chen, Ding Zhao

    Abstract: Simulation is an indispensable tool in the development and testing of autonomous vehicles (AVs), offering an efficient and safe alternative to road testing by allowing the exploration of a wide range of scenarios. Despite its advantages, a significant challenge within simulation-based testing is the generation of safety-critical scenarios, which are essential to ensure that AVs can handle rare but… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  8. arXiv:2401.12295  [pdf, other

    cs.CL

    Cheap Learning: Maximising Performance of Language Models for Social Data Science Using Minimal Data

    Authors: Leonardo Castro-Gonzalez, Yi-Ling Chung, Hannak Rose Kirk, John Francis, Angus R. Williams, Pica Johansson, Jonathan Bright

    Abstract: The field of machine learning has recently made significant progress in reducing the requirements for labelled training data when building new models. These `cheaper' learning techniques hold significant potential for the social sciences, where development of large labelled training datasets is often a significant practical impediment to the use of machine learning for analytical tasks. In this ar… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 39 pages, 10 figures, 6 tables

    ACM Class: I.2.7; J.4

  9. arXiv:2401.01291  [pdf, other

    cs.CY

    Generative AI is already widespread in the public sector

    Authors: Jonathan Bright, Florence E. Enock, Saba Esnaashari, John Francis, Youmna Hashem, Deborah Morgan

    Abstract: Generative AI has the potential to transform how public services are delivered by enhancing productivity and reducing time spent on bureaucracy. Furthermore, unlike other types of artificial intelligence, it is a technology that has quickly become widely available for bottom-up adoption: essentially anyone can decide to make use of it in their day to day work. But to what extent is generative AI a… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

  10. arXiv:2312.08782  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis

    Authors: Yafei Hu, Quanting Xie, Vidhi Jain, Jonathan Francis, Jay Patrikar, Nikhil Keetha, Seungchan Kim, Yaqi Xie, Tianyi Zhang, Shibo Zhao, Yu Quan Chong, Chen Wang, Katia Sycara, Matthew Johnson-Roberson, Dhruv Batra, Xiaolong Wang, Sebastian Scherer, Zsolt Kira, Fei Xia, Yonatan Bisk

    Abstract: Building general-purpose robots that can operate seamlessly, in any environment, with any object, and utilizing various skills to complete diverse tasks has been a long-standing goal in Artificial Intelligence. Unfortunately, however, most existing robotic systems have been constrained - having been designed for specific tasks, trained on specific datasets, and deployed within specific environment… ▽ More

    Submitted 15 December, 2023; v1 submitted 14 December, 2023; originally announced December 2023.

  11. arXiv:2310.06475  [pdf

    cs.CY

    Approaches to the Algorithmic Allocation of Public Resources: A Cross-disciplinary Review

    Authors: Saba Esnaashari, Jonathan Bright, John Francis, Youmna Hashem, Vincent Straub, Deborah Morgan

    Abstract: Allocation of scarce resources is a recurring challenge for the public sector: something that emerges in areas as diverse as healthcare, disaster recovery, and social welfare. The complexity of these policy domains and the need for meeting multiple and sometimes conflicting criteria has led to increased focus on the use of algorithms in this type of decision. However, little engagement between res… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  12. arXiv:2309.08889  [pdf, other

    cs.RO

    SafeShift: Safety-Informed Distribution Shifts for Robust Trajectory Prediction in Autonomous Driving

    Authors: Benjamin Stoler, Ingrid Navarro, Meghdeep Jana, Soonmin Hwang, Jonathan Francis, Jean Oh

    Abstract: As autonomous driving technology matures, safety and robustness of its key components, including trajectory prediction, is vital. Though real-world datasets, such as Waymo Open Motion, provide realistic recorded scenarios for model development, they often lack truly safety-critical situations. Rather than utilizing unrealistic simulation or dangerous real-world testing, we instead propose a framew… ▽ More

    Submitted 2 February, 2024; v1 submitted 16 September, 2023; originally announced September 2023.

    Comments: 10 pages, 5 figures, 5 tables

  13. arXiv:2309.08508  [pdf, other

    cs.RO

    MOSAIC: Learning Unified Multi-Sensory Object Property Representations for Robot Learning via Interactive Perception

    Authors: Gyan Tatiya, Jonathan Francis, Ho-Hsiang Wu, Yonatan Bisk, Jivko Sinapov

    Abstract: A holistic understanding of object properties across diverse sensory modalities (e.g., visual, audio, and haptic) is essential for tasks ranging from object categorization to complex manipulation. Drawing inspiration from cognitive science studies that emphasize the significance of multi-sensory integration in human perception, we introduce MOSAIC (Multimodal Object property learning with Self-Att… ▽ More

    Submitted 22 February, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: Accepted to the 2024 IEEE International Conference on Robotics and Automation (ICRA), May 13 to 17, 2024; Yokohama, Japan

  14. arXiv:2307.09636  [pdf, other

    cs.CV cs.AI

    Traffic-Domain Video Question Answering with Automatic Captioning

    Authors: Ehsan Qasemi, Jonathan M. Francis, Alessandro Oltramari

    Abstract: Video Question Answering (VidQA) exhibits remarkable potential in facilitating advanced machine reasoning capabilities within the domains of Intelligent Traffic Monitoring and Intelligent Transportation Systems. Nevertheless, the integration of urban traffic scene knowledge into VidQA systems has received limited attention in previous research endeavors. In this work, we present a novel approach t… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: Accepted in ITSC2023

  15. arXiv:2306.15864  [pdf, other

    cs.RO

    What Went Wrong? Closing the Sim-to-Real Gap via Differentiable Causal Discovery

    Authors: Peide Huang, Xilun Zhang, Ziang Cao, Shiqi Liu, Mengdi Xu, Wenhao Ding, Jonathan Francis, Bingqing Chen, Ding Zhao

    Abstract: Training control policies in simulation is more appealing than on real robots directly, as it allows for exploring diverse states in an efficient manner. Yet, robot simulators inevitably exhibit disparities from the real-world \rebut{dynamics}, yielding inaccuracies that manifest as the dynamical simulation-to-reality (sim-to-real) gap. Existing literature has proposed to close this gap by activel… ▽ More

    Submitted 19 October, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

  16. arXiv:2306.02520  [pdf, other

    cs.CL cs.AI cs.LG

    A Study of Situational Reasoning for Traffic Understanding

    Authors: Jiarui Zhang, Filip Ilievski, Kaixin Ma, Aravinda Kollaa, Jonathan Francis, Alessandro Oltramari

    Abstract: Intelligent Traffic Monitoring (ITMo) technologies hold the potential for improving road safety/security and for enabling smart city infrastructure. Understanding traffic situations requires a complex fusion of perceptual information with domain-specific and causal commonsense knowledge. Whereas prior work has provided benchmarks and methods for traffic monitoring, it remains unclear whether model… ▽ More

    Submitted 15 July, 2023; v1 submitted 4 June, 2023; originally announced June 2023.

    Comments: 11 pages, 6 figures, 5 tables, camera ready version of SIGKDD 2023

  17. arXiv:2305.05091  [pdf, other

    cs.CL cs.AI cs.HC

    Knowledge-enhanced Agents for Interactive Text Games

    Authors: Prateek Chhikara, Jiarui Zhang, Filip Ilievski, Jonathan Francis, Kaixin Ma

    Abstract: Communication via natural language is a key aspect of machine intelligence, and it requires computational models to learn and reason about world concepts, with varying levels of supervision. Significant progress has been made on fully-supervised non-interactive tasks, such as question-answering and procedural text understanding. Yet, various sequential interactive tasks, as in text-based games, ha… ▽ More

    Submitted 16 December, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: Published at K-CAP '23

  18. arXiv:2305.00131  [pdf, other

    cs.CV

    Regularizing Self-training for Unsupervised Domain Adaptation via Structural Constraints

    Authors: Rajshekhar Das, Jonathan Francis, Sanket Vaibhav Mehta, Jean Oh, Emma Strubell, Jose Moura

    Abstract: Self-training based on pseudo-labels has emerged as a dominant approach for addressing conditional distribution shifts in unsupervised domain adaptation (UDA) for semantic segmentation problems. A notable drawback, however, is that this family of approaches is susceptible to erroneous pseudo labels that arise from confirmation biases in the source domain and that manifest as nuisance factors in th… ▽ More

    Submitted 28 April, 2023; originally announced May 2023.

  19. arXiv:2304.02738  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.HC

    Core Challenges in Embodied Vision-Language Planning

    Authors: Jonathan Francis, Nariaki Kitamura, Felix Labelle, Xiaopeng Lu, Ingrid Navarro, Jean Oh

    Abstract: Recent advances in the areas of Multimodal Machine Learning and Artificial Intelligence (AI) have led to the development of challenging tasks at the intersection of Computer Vision, Natural Language Processing, and Robotics. Whereas many approaches and previous survey pursuits have characterised one or two of these dimensions, there has not been a holistic analysis at the center of all three. More… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: Extended Abstract accepted to the 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023); special journal track for authors of published JAIR 2022 and AIJ 2022 papers. 6 pages, 2 figures. arXiv admin note: substantial text overlap with arXiv:2106.13948

  20. arXiv:2303.14007  [pdf

    cs.CY cs.AI cs.HC

    'Team-in-the-loop': Ostrom's IAD framework 'rules in use' to map and measure contextual impacts of AI

    Authors: Deborah Morgan, Youmna Hashem, John Francis, Saba Esnaashari, Vincent J. Straub, Jonathan Bright

    Abstract: This article explores how the 'rules in use' from Ostrom's Institutional Analysis and Development Framework (IAD) can be developed as a context analysis approach for AI. AI risk assessment frameworks increasingly highlight the need to understand existing contexts. However, these approaches do not frequently connect with established institutional analysis scholarship. We outline a novel direction i… ▽ More

    Submitted 30 June, 2024; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: 19 pages

  21. arXiv:2303.10106  [pdf

    cs.CY cs.AI

    A multidomain relational framework to guide institutional AI research and adoption

    Authors: Vincent J. Straub, Deborah Morgan, Youmna Hashem, John Francis, Saba Esnaashari, Jonathan Bright

    Abstract: Calls for new metrics, technical standards and governance mechanisms to guide the adoption of Artificial Intelligence (AI) in institutions and public administration are now commonplace. Yet, most research and policy efforts aimed at understanding the implications of adopting AI tend to prioritize only a handful of ideas; they do not fully connect all the different perspectives and topics that are… ▽ More

    Submitted 17 July, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: 23 pages, 1 figure

  22. arXiv:2303.04023  [pdf, other

    cs.RO

    Cross-Tool and Cross-Behavior Perceptual Knowledge Transfer for Grounded Object Recognition

    Authors: Gyan Tatiya, Jonathan Francis, Jivko Sinapov

    Abstract: Humans learn about objects via interaction and using multiple perceptions, such as vision, sound, and touch. While vision can provide information about an object's appearance, non-visual sensors, such as audio and haptics, can provide information about its intrinsic properties, such as weight, temperature, hardness, and the object's sound. Using tools to interact with objects can reveal additional… ▽ More

    Submitted 15 September, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

    Comments: Under review for 2024 IEEE International Conference on Robotics and Automation (ICRA), May 13 to 17, 2024, Yokohama, Japan

  23. arXiv:2212.14874  [pdf, other

    cs.LG stat.ML

    Comparative Analysis of Clustering Techniques for Personalized Food Kit Distribution

    Authors: Jude Francis, Rowan K Baby, Jacob Abraham, Ajmal P. S

    Abstract: The Government of Kerala had increased the frequency of supply of free food kits owing to the pandemic, however, these items were static and not indicative of the personal preferences of the consumers. This paper conducts a comparative analysis of various clustering techniques on a scaled-down version of a real-world dataset obtained through a conjoint analysis-based survey. Clustering carried out… ▽ More

    Submitted 30 December, 2022; originally announced December 2022.

    Comments: 7 pages,13 figures

  24. arXiv:2212.11345  [pdf, other

    cs.RO cs.AI cs.CV

    Knowledge-driven Scene Priors for Semantic Audio-Visual Embodied Navigation

    Authors: Gyan Tatiya, Jonathan Francis, Luca Bondi, Ingrid Navarro, Eric Nyberg, Jivko Sinapov, Jean Oh

    Abstract: Generalisation to unseen contexts remains a challenge for embodied navigation agents. In the context of semantic audio-visual navigation (SAVi) tasks, the notion of generalisation should include both generalising to unseen indoor visual scenes as well as generalising to unheard sounding objects. However, previous SAVi task definitions do not include evaluation conditions on truly novel sounding ob… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: 19 pages, 8 figures, 9 tables

  25. arXiv:2212.08729  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    Distribution-aware Goal Prediction and Conformant Model-based Planning for Safe Autonomous Driving

    Authors: Jonathan Francis, Bingqing Chen, Weiran Yao, Eric Nyberg, Jean Oh

    Abstract: The feasibility of collecting a large amount of expert demonstrations has inspired growing research interests in learning-to-drive settings, where models learn by imitating the driving behaviour from experts. However, exclusively relying on imitation can limit agents' generalisability to novel scenarios that are outside the support of the training data. In this paper, we address this challenge by… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

    Comments: Accepted: 1st Workshop on Safe Learning for Autonomous Driving, at the International Conference on Machine Learning (ICML 2022); Best Paper Award

  26. arXiv:2212.07798  [pdf, other

    cs.CL cs.AI

    Utilizing Background Knowledge for Robust Reasoning over Traffic Situations

    Authors: Jiarui Zhang, Filip Ilievski, Aravinda Kollaa, Jonathan Francis, Kaixin Ma, Alessandro Oltramari

    Abstract: Understanding novel situations in the traffic domain requires an intricate combination of domain-specific and causal commonsense knowledge. Prior work has provided sufficient perception-based modalities for traffic monitoring, in this paper, we focus on a complementary research aspect of Intelligent Transportation: traffic understanding. We scope our study to text-based methods and datasets given… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

    Comments: Camera ready version of AAAI 2023 workshop on Knowledge-Augmented Methods for Natural Language Processing

  27. arXiv:2212.05061  [pdf

    eess.IV cs.CV

    Estimating Chicago's tree cover and canopy height using multi-spectral satellite imagery

    Authors: John Francis, Stephen Law

    Abstract: Information on urban tree canopies is fundamental to mitigating climate change [1] as well as improving quality of life [2]. Urban tree planting initiatives face a lack of up-to-date data about the horizontal and vertical dimensions of the tree canopy in cities. We present a pipeline that utilizes LiDAR data as ground-truth and then trains a multi-task machine learning model to generate reliable e… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

    Comments: 4 pages, 4 figures, Submitted to Tackling Climate Change with Machine Learning: workshop at NeurIPS 2022

  28. Transferring Implicit Knowledge of Non-Visual Object Properties Across Heterogeneous Robot Morphologies

    Authors: Gyan Tatiya, Jonathan Francis, Jivko Sinapov

    Abstract: Humans leverage multiple sensor modalities when interacting with objects and discovering their intrinsic properties. Using the visual modality alone is insufficient for deriving intuition behind object properties (e.g., which of two boxes is heavier), making it essential to consider non-visual modalities as well, such as the tactile and auditory. Whereas robots may leverage various modalities to o… ▽ More

    Submitted 6 July, 2023; v1 submitted 14 September, 2022; originally announced September 2022.

    Comments: In proceedings of the IEEE International Conference on Robotics and Automation (ICRA), May 29 - June 2, 2023 , ExCeL London, UK

  29. arXiv:2208.12848  [pdf, other

    cs.CL

    Coalescing Global and Local Information for Procedural Text Understanding

    Authors: Kaixin Ma, Filip Ilievski, Jonathan Francis, Eric Nyberg, Alessandro Oltramari

    Abstract: Procedural text understanding is a challenging language reasoning task that requires models to track entity states across the development of a narrative. A complete procedural understanding solution should combine three core aspects: local and global views of the inputs, and global view of outputs. Prior methods considered a subset of these aspects, resulting in either low precision or low recall.… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

    Comments: COLING 2022

  30. arXiv:2205.10661  [pdf, other

    cs.CL cs.AI

    An Empirical Investigation of Commonsense Self-Supervision with Knowledge Graphs

    Authors: Jiarui Zhang, Filip Ilievski, Kaixin Ma, Jonathan Francis, Alessandro Oltramari

    Abstract: Self-supervision based on the information extracted from large knowledge graphs has been shown to improve the generalization of language models, in zero-shot evaluation on various downstream language reasoning tasks. Since these improvements are reported in aggregate, however, little is known about (i) how to select the appropriate knowledge for solid performance across tasks, (ii) how to combine… ▽ More

    Submitted 21 May, 2022; originally announced May 2022.

  31. arXiv:2205.02953  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    Learn-to-Race Challenge 2022: Benchmarking Safe Learning and Cross-domain Generalisation in Autonomous Racing

    Authors: Jonathan Francis, Bingqing Chen, Siddha Ganju, Sidharth Kathpal, Jyotish Poonganam, Ayush Shivani, Vrushank Vyas, Sahika Genc, Ivan Zhukov, Max Kumskoy, Anirudh Koul, Jean Oh, Eric Nyberg

    Abstract: We present the results of our autonomous racing virtual challenge, based on the newly-released Learn-to-Race (L2R) simulation framework, which seeks to encourage interdisciplinary research in autonomous driving and to help advance the state of the art on a realistic benchmark. Analogous to racing being used to test cutting-edge vehicles, we envision autonomous racing to serve as a particularly cha… ▽ More

    Submitted 10 May, 2022; v1 submitted 5 May, 2022; originally announced May 2022.

    Comments: 20 pages, 4 figures, 2 tables

  32. arXiv:2202.13541  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Pattern Based Multivariable Regression using Deep Learning (PBMR-DP)

    Authors: Jiztom Kavalakkatt Francis, Chandan Kumar, Jansel Herrera-Gerena, Kundan Kumar, Matthew J Darr

    Abstract: We propose a deep learning methodology for multivariate regression that is based on pattern recognition that triggers fast learning over sensor data. We used a conversion of sensors-to-image which enables us to take advantage of Computer Vision architectures and training processes. In addition to this data preparation methodology, we explore the use of state-of-the-art architectures to generate re… ▽ More

    Submitted 9 March, 2022; v1 submitted 27 February, 2022; originally announced February 2022.

    Comments: 7 pages, 5 figures, 3 tables

  33. Generalizable Neuro-symbolic Systems for Commonsense Question Answering

    Authors: Alessandro Oltramari, Jonathan Francis, Filip Ilievski, Kaixin Ma, Roshanak Mirzaee

    Abstract: This chapter illustrates how suitable neuro-symbolic models for language understanding can enable domain generalizability and robustness in downstream tasks. Different methods for integrating neural language models and knowledge graphs are discussed. The situations in which this combination is most appropriate are characterized, including quantitative evaluation and qualitative error analysis on a… ▽ More

    Submitted 17 January, 2022; originally announced January 2022.

    Comments: In Pascal Hitzler, Md Kamruzzaman Sarker (eds.), Neuro-Symbolic Artificial Intelligence: The State of the Art. Frontiers in Artificial Intelligence and Applications Vol. 342, IOS Press, Amsterdam, 2022. arXiv admin note: text overlap with arXiv:2003.04707

  34. arXiv:2110.07699  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Safe Autonomous Racing via Approximate Reachability on Ego-vision

    Authors: Bingqing Chen, Jonathan Francis, Jean Oh, Eric Nyberg, Sylvia L. Herbert

    Abstract: Racing demands each vehicle to drive at its physical limits, when any safety infraction could lead to catastrophic failure. In this work, we study the problem of safe reinforcement learning (RL) for autonomous racing, using the vehicle's ego-camera view and speed as input. Given the nature of the task, autonomous agents need to be able to 1) identify and avoid unsafe scenarios under the complex ve… ▽ More

    Submitted 30 November, 2021; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: 17 pages, 15 figures, 3 tables

  35. arXiv:2109.02837  [pdf, other

    cs.CL

    Exploring Strategies for Generalizable Commonsense Reasoning with Pre-trained Models

    Authors: Kaixin Ma, Filip Ilievski, Jonathan Francis, Satoru Ozaki, Eric Nyberg, Alessandro Oltramari

    Abstract: Commonsense reasoning benchmarks have been largely solved by fine-tuning language models. The downside is that fine-tuning may cause models to overfit to task-specific data and thereby forget their knowledge gained during pre-training. Recent works only propose lightweight model updates as models may already possess useful knowledge from past experience, but a challenge remains in understanding wh… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021

  36. arXiv:2106.13948  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.RO

    Core Challenges in Embodied Vision-Language Planning

    Authors: Jonathan Francis, Nariaki Kitamura, Felix Labelle, Xiaopeng Lu, Ingrid Navarro, Jean Oh

    Abstract: Recent advances in the areas of multimodal machine learning and artificial intelligence (AI) have led to the development of challenging tasks at the intersection of Computer Vision, Natural Language Processing, and Embodied AI. Whereas many approaches and previous survey pursuits have characterised one or two of these dimensions, there has not been a holistic analysis at the center of all three. M… ▽ More

    Submitted 24 May, 2022; v1 submitted 26 June, 2021; originally announced June 2021.

    Comments: Journal of Artificial Intelligence Research 74 (2022) 459-515

  37. arXiv:2103.11575  [pdf, other

    cs.RO cs.CV cs.LG

    Learn-to-Race: A Multimodal Control Environment for Autonomous Racing

    Authors: James Herman, Jonathan Francis, Siddha Ganju, Bingqing Chen, Anirudh Koul, Abhinav Gupta, Alexey Skabelkin, Ivan Zhukov, Max Kumskoy, Eric Nyberg

    Abstract: Existing research on autonomous driving primarily focuses on urban driving, which is insufficient for characterising the complex driving behaviour underlying high-speed racing. At the same time, existing racing simulation frameworks struggle in capturing realism, with respect to visual rendering, vehicular dynamics, and task objectives, inhibiting the transfer of learning agents to real-world cont… ▽ More

    Submitted 18 August, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

    Comments: Accepted to the International Conference on Computer Vision (ICCV 2021); equal contribution - JH and JF; 15 pages, 4 figures

    Journal ref: International Conference on Computer Vision (ICCV), 2021

  38. arXiv:2103.05708  [pdf, other

    quant-ph cs.LG

    Machine Learning the period finding algorithm

    Authors: John George Francis, Anil Shaji

    Abstract: We use differentiable programming and gradient descent to find unitary matrices that can be used in the period finding algorithm to extract period information from the state of a quantum computer post application of the oracle. The standard procedure is to use the inverse quantum Fourier transform. Our findings suggest that that this is not the only unitary matrix appropriate for the period findin… ▽ More

    Submitted 9 March, 2021; originally announced March 2021.

    Comments: 10 pages, 10 figures

  39. arXiv:2012.10813  [pdf, other

    cs.CL

    Lexically-constrained Text Generation through Commonsense Knowledge Extraction and Injection

    Authors: Yikang Li, Pulkit Goel, Varsha Kuppur Rajendra, Har Simrat Singh, Jonathan Francis, Kaixin Ma, Eric Nyberg, Alessandro Oltramari

    Abstract: Conditional text generation has been a challenging task that is yet to see human-level performance from state-of-the-art models. In this work, we specifically focus on the Commongen benchmark, wherein the aim is to generate a plausible sentence for a given set of input concepts. Despite advances in other tasks, large pre-trained language models that are fine-tuned on this dataset often produce sen… ▽ More

    Submitted 19 December, 2020; originally announced December 2020.

    Comments: AAAI-CSKG 2021

  40. arXiv:2011.14910  [pdf, other

    cs.CV

    Trajformer: Trajectory Prediction with Local Self-Attentive Contexts for Autonomous Driving

    Authors: Manoj Bhat, Jonathan Francis, Jean Oh

    Abstract: Effective feature-extraction is critical to models' contextual understanding, particularly for applications to robotics and autonomous driving, such as multimodal trajectory prediction. However, state-of-the-art generative methods face limitations in representing the scene context, leading to predictions of inadmissible futures. We alleviate these limitations through the use of self-attention, whi… ▽ More

    Submitted 30 November, 2020; originally announced November 2020.

    Comments: Accepted: Machine Learning for Autonomous Driving @ NeurIPS 2020

  41. arXiv:2011.03863  [pdf, other

    cs.CL cs.AI

    Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question Answering

    Authors: Kaixin Ma, Filip Ilievski, Jonathan Francis, Yonatan Bisk, Eric Nyberg, Alessandro Oltramari

    Abstract: Recent developments in pre-trained neural language modeling have led to leaps in accuracy on commonsense question-answering benchmarks. However, there is increasing concern that models overfit to specific tasks, without learning to utilize external knowledge or perform general semantic reasoning. In contrast, zero-shot evaluations have shown promise as a more robust measure of a model's general re… ▽ More

    Submitted 14 December, 2020; v1 submitted 7 November, 2020; originally announced November 2020.

    Comments: AAAI 2021

  42. Data-driven Thermal Model Inference with ARMAX, in Smart Environments, based on Normalized Mutual Information

    Authors: Zhanhong Jiang, Jonathan Francis, Anit Kumar Sahu, Sirajum Munir, Charles Shelton, Anthony Rowe, Mario Bergés

    Abstract: Understanding the models that characterize the thermal dynamics in a smart building is important for the comfort of its occupants and for its energy optimization. A significant amount of research has attempted to utilize thermodynamics (physical) models for smart building control, but these approaches remain challenging due to the stochastic nature of the intermittent environmental disturbances. T… ▽ More

    Submitted 10 June, 2020; originally announced June 2020.

    Journal ref: American Control Conference (2018) 4634-4639

  43. arXiv:2004.09673  [pdf, other

    q-bio.QM cs.LG eess.IV

    Neural Network Segmentation of Cell Ultrastructure Using Incomplete Annotation

    Authors: John Paul Francis, Hongzhi Wang, Kate White, Tanveer Syeda-Mahmood, Raymond Stevens

    Abstract: The Pancreatic beta cell is an important target in diabetes research. For scalable modeling of beta cell ultrastructure, we investigate automatic segmentation of whole cell imaging data acquired through soft X-ray tomography. During the course of the study, both complete and partial ultrastructure annotations were produced manually for different subsets of the data. To more effectively use existin… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

  44. arXiv:2003.04707  [pdf, other

    cs.AI cs.CL cs.SC

    Neuro-symbolic Architectures for Context Understanding

    Authors: Alessandro Oltramari, Jonathan Francis, Cory Henson, Kaixin Ma, Ruwan Wickramarachchi

    Abstract: Computational context understanding refers to an agent's ability to fuse disparate sources of information for decision-making and is, therefore, generally regarded as a prerequisite for sophisticated machine reasoning capabilities, such as in artificial intelligence (AI). Data-driven and knowledge-driven methods are two classical techniques in the pursuit of such machine sense-making capability. H… ▽ More

    Submitted 9 March, 2020; originally announced March 2020.

    Comments: In: Ilaria Tiddi, Freddy Lecue, Pascal Hitzler (eds.), Knowledge Graphs for eXplainable AI -- Foundations, Applications and Challenges. Studies on the Semantic Web, IOS Press, Amsterdam, 2020. arXiv admin note: text overlap with arXiv:1910.14087

  45. arXiv:2003.03212  [pdf, other

    cs.CV

    Diverse and Admissible Trajectory Forecasting through Multimodal Context Understanding

    Authors: Seong Hyeon Park, Gyubok Lee, Manoj Bhat, Jimin Seo, Minseok Kang, Jonathan Francis, Ashwin R. Jadhav, Paul Pu Liang, Louis-Philippe Morency

    Abstract: Multi-agent trajectory forecasting in autonomous driving requires an agent to accurately anticipate the behaviors of the surrounding vehicles and pedestrians, for safe and reliable decision-making. Due to partial observability in these dynamical scenes, directly obtaining the posterior distribution over future agent trajectories remains a challenging problem. In realistic embodied environments, ea… ▽ More

    Submitted 31 August, 2020; v1 submitted 6 March, 2020; originally announced March 2020.

    Comments: ECCV 2020

  46. arXiv:1910.14087  [pdf, other

    cs.CL

    Towards Generalizable Neuro-Symbolic Systems for Commonsense Question Answering

    Authors: Kaixin Ma, Jonathan Francis, Quanyang Lu, Eric Nyberg, Alessandro Oltramari

    Abstract: Non-extractive commonsense QA remains a challenging AI task, as it requires systems to reason about, synthesize, and gather disparate pieces of information, in order to generate responses to queries. Recent approaches on such tasks show increased performance, only when models are either pre-trained with additional information or when domain-specific heuristics are used, without any special conside… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

    Comments: EMNLP-COIN 2019

  47. Resistive Threshold Logic

    Authors: A. P. James, L. R. V. J. Francis, D. Kumar

    Abstract: We report a resistance based threshold logic family useful for mimicking brain like large variable logic functions in VLSI. A universal Boolean logic cell based on an analog resistive divider and threshold logic circuit is presented. The resistive divider is implemented using memristors and provides output voltage as a summation of weighted product of input voltages. The output of resistive divide… ▽ More

    Submitted 1 August, 2013; originally announced August 2013.

    Comments: Memristors, Brain inspired logic circuits. IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 2013