Skip to main content

Showing 1–29 of 29 results for author: Yim, M

  1. arXiv:2407.07885  [pdf, other

    cs.RO cs.LG

    Learning In-Hand Translation Using Tactile Skin With Shear and Normal Force Sensing

    Authors: Jessica Yin, Haozhi Qi, Jitendra Malik, James Pikul, Mark Yim, Tess Hellebrekers

    Abstract: Recent progress in reinforcement learning (RL) and tactile sensing has significantly advanced dexterous manipulation. However, these methods often utilize simplified tactile signals due to the gap between tactile simulation and the real world. We introduce a sensor model for tactile skin that enables zero-shot sim-to-real transfer of ternary shear and binary normal forces. Using this model, we dev… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Website: https://jessicayin.github.io/tactile-skin-rl/

  2. arXiv:2405.14144  [pdf, other

    cs.RO eess.SY

    A Single Motor Nano Aerial Vehicle with Novel Peer-to-Peer Communication and Sensing Mechanism

    Authors: Jingxian Wang, Andrew G. Curtis, Mark Yim, Michael Rubenstein

    Abstract: Communication and position sensing are among the most important capabilities for swarm robots to interact with their peers and perform tasks collaboratively. However, the hardware required to facilitate communication and position sensing is often too complicated, expensive, and bulky to be carried on swarm robots. Here we present Maneuverable Piccolissimo 3 (MP3), a minimalist, single motor drone… ▽ More

    Submitted 3 June, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

  3. arXiv:2405.00260  [pdf, other

    cs.CV

    CREPE: Coordinate-Aware End-to-End Document Parser

    Authors: Yamato Okamoto, Youngmin Baek, Geewook Kim, Ryota Nakao, DongHyun Kim, Moon Bin Yim, Seunghyun Park, Bado Lee

    Abstract: In this study, we formulate an OCR-free sequence generation model for visual document understanding (VDU). Our model not only parses text from document images but also extracts the spatial coordinates of the text based on the multi-head architecture. Named as Coordinate-aware End-to-end Document Parser (CREPE), our method uniquely integrates these capabilities by introducing a special token for OC… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: Accepted at the International Conference on Document Analysis and Recognition (ICDAR 2024) main conference

  4. arXiv:2404.19205  [pdf, other

    cs.CV cs.AI

    TableVQA-Bench: A Visual Question Answering Benchmark on Multiple Table Domains

    Authors: Yoonsik Kim, Moonbin Yim, Ka Yeon Song

    Abstract: In this paper, we establish a benchmark for table visual question answering, referred to as the TableVQA-Bench, derived from pre-existing table question-answering (QA) and table structure recognition datasets. It is important to note that existing datasets have not incorporated images or QA pairs, which are two crucial components of TableVQA. As such, the primary objective of this paper is to obta… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Technical Report

  5. arXiv:2404.02265  [pdf, other

    cs.RO

    Continuous Sculpting: Persistent Swarm Shape Formation Adaptable to Local Environmental Changes

    Authors: Andrew G. Curtis, Mark Yim, Michael Rubenstein

    Abstract: Despite their growing popularity, swarms of robots remain limited by the operating time of each individual. We present algorithms which allow a human to sculpt a swarm of robots into a shape that persists in space perpetually, independent of onboard energy constraints such as batteries. Robots generate a path through a shape such that robots cycle in and out of the shape. Robots inside the shape r… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 20 pages, 17 figures

  6. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  7. arXiv:2211.16611  [pdf, other

    cs.RO

    Holonomic Control of Arbitrary Configurations of Docked Modboats

    Authors: Zhijie Qiao, Gedaliah Knizhnik, Mark Yim

    Abstract: The Modboat is a low-cost, underactuated, modular robot capable of surface swimming, docking to other modules, and undocking from them using only a single motor and two passive flippers. Undocking is achieved by causing intentional self-collision between the tails of neighboring modules in certain configurations; this becomes a challenge, however, when collective swimming as one connected componen… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  8. arXiv:2211.07480  [pdf, other

    cs.RO

    Electroadhesive Clutches for Programmable Shape Morphing of Soft Actuators

    Authors: Gregory M. Campbell, Jessica Yin, Yuyang Song, Umesh Gandhi, Mark Yim, James Pikul

    Abstract: Soft robotic actuators are safe and adaptable devices with inherent compliance, which makes them attractive for manipulating delicate and complex objects. Researchers have integrated stiff materials into soft actuators to increase their force capacity and direct their deformation. However, these embedded materials have largely been pre-prescribed and static, which constrains the actuators to a pre… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: This work was presented at IEEE International Conference on Intelligent Robots and Systems (IROS) 2022

  9. arXiv:2211.03256  [pdf, other

    cs.CV cs.AI cs.LG

    On Web-based Visual Corpus Construction for Visual Document Understanding

    Authors: Donghyun Kim, Teakgyu Hong, Moonbin Yim, Yoonsik Kim, Geewook Kim

    Abstract: In recent years, research on visual document understanding (VDU) has grown significantly, with a particular emphasis on the development of self-supervised learning methods. However, one of the significant challenges faced in this field is the limited availability of publicly accessible visual corpora or extensive collections of images with detailed text annotations, particularly for non-Latin or r… ▽ More

    Submitted 2 May, 2023; v1 submitted 6 November, 2022; originally announced November 2022.

    Comments: Accepted at ICDAR2023

  10. arXiv:2209.04000  [pdf, other

    cs.RO

    Collective Control for Arbitrary Configurations of Docked Modboats

    Authors: Gedaliah Knizhnik, Mark Yim

    Abstract: The Modboat is a low-cost, underactuated, modular robot capable of surface swimming, docking to other modules, and undocking from them using only a single motor and two passive flippers. Undocking is achieved by causing intentional self-collision between the tails of neighboring modules in certain configurations; this becomes a challenge, however, when collective swimming as one connected componen… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

    Comments: 11 pages. Submitted for consideration in the IEEE Transactions on Robotics (T-RO)

  11. arXiv:2204.08586  [pdf, other

    cs.RO

    Multimodal Proximity and Visuotactile Sensing With a Selectively Transmissive Soft Membrane

    Authors: Jessica Yin, Gregory M. Campbell, James Pikul, Mark Yim

    Abstract: The most common sensing modalities found in a robot perception system are vision and touch, which together can provide global and highly localized data for manipulation. However, these sensing modalities often fail to adequately capture the behavior of target objects during the critical moments as they transition out of static, controlled contact with an end-effector to dynamic and uncontrolled mo… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: Accepted to IEEE International Conference on Soft Robotics (RoboSoft) 2022

  12. Amplitude Control for Parallel Lattices of Docked Modboats

    Authors: Gedaliah Knizhnik, Mark Yim

    Abstract: The Modboat is a low-cost, underactuated, modular robot capable of surface swimming. It is able to swim individually, dock to other Modboats, and undock from them using only a single motor and two passive flippers. Undocking without additional actuation is achieved by causing intentional self-collision between the tails of neighboring modules; this becomes a challenge when group swimming as one co… ▽ More

    Submitted 21 July, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

    Comments: 7 pages. Accepted to the 2022 International Conference on Robotics and Automation (ICRA)

  13. A Low-Cost, Highly Customizable Solution for Position Estimation in Modular Robots

    Authors: Chao Liu, Tarik Tosun, Mark Yim

    Abstract: Accurate position sensing is important for state estimation and control in robotics. Reliable and accurate position sensors are usually expensive and difficult to customize. Incorporating them into systems that have very tight volume constraints such as modular robots are particularly difficult. PaintPots are low-cost, reliable, and highly customizable position sensors, but their performance is hi… ▽ More

    Submitted 10 January, 2022; originally announced January 2022.

    Comments: 10 pages, 28 figures

    Journal ref: ASME. J. Mechanisms Robotics. December 2021; 13(6): 061004

  14. arXiv:2111.15664  [pdf, other

    cs.LG cs.AI

    OCR-free Document Understanding Transformer

    Authors: Geewook Kim, Teakgyu Hong, Moonbin Yim, Jeongyeon Nam, Jinyoung Park, Jinyeong Yim, Wonseok Hwang, Sangdoo Yun, Dongyoon Han, Seunghyun Park

    Abstract: Understanding document images (e.g., invoices) is a core but challenging task since it requires complex functions such as reading text and a holistic understanding of the document. Current Visual Document Understanding (VDU) methods outsource the task of reading text to off-the-shelf Optical Character Recognition (OCR) engines and focus on the understanding task with the OCR outputs. Although such… ▽ More

    Submitted 6 October, 2022; v1 submitted 30 November, 2021; originally announced November 2021.

    Comments: ECCV 2022. (v5) update table 2 and figures; add LayoutLM and update scores with the latest test script at https://github.com/clovaai/donut

  15. arXiv:2109.15278  [pdf, other

    cs.RO

    Coverage Control in Multi-Robot Systems via Graph Neural Networks

    Authors: Walker Gosrich, Siddharth Mayya, Rebecca Li, James Paulos, Mark Yim, Alejandro Ribeiro, Vijay Kumar

    Abstract: This paper develops a decentralized approach to mobile sensor coverage by a multi-robot system. We consider a scenario where a team of robots with limited sensing range must position itself to effectively detect events of interest in a region characterized by areas of varying importance. Towards this end, we develop a decentralized control policy for the robots -- realized via a Graph Neural Netwo… ▽ More

    Submitted 30 September, 2021; originally announced September 2021.

  16. arXiv:2109.00662  [pdf, other

    cs.RO

    Quori: A Community-Informed Design of a Socially Interactive Humanoid Robot

    Authors: Andrew Specian, Ross Mead, Simon Kim, Maja Matarić, Mark Yim

    Abstract: Hardware platforms for socially interactive robotics can be limited by cost or lack of functionality. This paper presents the overall system -- design, hardware, and software -- for Quori, a novel, affordable, socially interactive humanoid robot platform for facilitating non-contact human-robot interaction (HRI) research. The design of the system is motivated by feedback sampled from the HRI resea… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

    Comments: 20 pages. 21 figures. This was accepted to and will be published to the IEEE Transactions on Robotics Journal

  17. Motion Planning for Variable Topology Trusses: Reconfiguration and Locomotion

    Authors: Chao Liu, Sencheng Yu, Mark Yim

    Abstract: Truss robots are highly redundant parallel robotic systems that can be applied in a variety of scenarios. The variable topology truss (VTT) is a class of modular truss robots. As self-reconfigurable modular robots, a VTT is composed of many edge modules that can be rearranged into various structures depending on the task. These robots change their shape by not only controlling joint positions as w… ▽ More

    Submitted 24 September, 2023; v1 submitted 31 July, 2021; originally announced August 2021.

    Comments: 20 pages, 36 figures

    Journal ref: IEEE Transactions on Robotics, vol. 39, no. 3, pp. 2020-2039, June 2023

  18. Thrust Direction Control of an Underactuated Oscillating Swimming Robot

    Authors: Gedaliah Knizhnik, Mark Yim

    Abstract: The Modboat is an autonomous surface robot that turns the oscillation of a single motor into a controlled paddling motion through passive flippers. Inertial control methods developed in prior work can successfully drive the Modboat along trajectories and enable docking to neighboring modules, but have a non-constant cycle time and cannot react to dynamic environments. In this work we present a thr… ▽ More

    Submitted 3 February, 2022; v1 submitted 27 July, 2021; originally announced July 2021.

    Comments: 6 pages. Published in and presented at the 2021 IEE/RSJ International Conference on Intelligent Robots and Systems

  19. arXiv:2107.11041  [pdf, other

    cs.CV

    RewriteNet: Reliable Scene Text Editing with Implicit Decomposition of Text Contents and Styles

    Authors: Junyeop Lee, Yoonsik Kim, Seonghyeon Kim, Moonbin Yim, Seung Shin, Gayoung Lee, Sungrae Park

    Abstract: Scene text editing (STE), which converts a text in a scene image into the desired text while preserving an original style, is a challenging task due to a complex intervention between text and style. In this paper, we propose a novel STE model, referred to as RewriteNet, that decomposes text images into content and style features and re-writes a text in the original image. Specifically, RewriteNet… ▽ More

    Submitted 2 May, 2022; v1 submitted 23 July, 2021; originally announced July 2021.

    Comments: CVPRW 2022 - AI for Content Creation Workshop

  20. arXiv:2107.09313  [pdf, other

    cs.CV

    SynthTIGER: Synthetic Text Image GEneratoR Towards Better Text Recognition Models

    Authors: Moonbin Yim, Yoonsik Kim, Han-Cheol Cho, Sungrae Park

    Abstract: For successful scene text recognition (STR) models, synthetic text image generators have alleviated the lack of annotated text images from the real world. Specifically, they generate multiple text images with diverse backgrounds, font styles, and text shapes and enable STR models to learn visual patterns that might not be accessible from manually annotated data. In this paper, we introduce a new s… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

    Comments: Accepted at ICDAR 2021, 16 pages, 6 figures

  21. A Quadratic Programming Approach to Manipulation in Real-Time Using Modular Robots

    Authors: Chao Liu, Mark Yim

    Abstract: Motion planning in high-dimensional space is a challenging task. In order to perform dexterous manipulation in an unstructured environment, a robot with many degrees of freedom is usually necessary, which also complicates its motion planning problem. Real-time control brings about more difficulties in which robots have to maintain the stability while moving towards the target. Redundant systems ar… ▽ More

    Submitted 31 July, 2021; v1 submitted 6 April, 2021; originally announced April 2021.

    Comments: 25 pages, 20 figures

    Journal ref: International Journal of Robotic Computing, Vol. 3, No. 1, (2021) 121-145

  22. SMORES-EP, a Modular Robot with Parallel Self-assembly

    Authors: Chao Liu, Qian Lin, Hyun Kim, Mark Yim

    Abstract: Self-assembly of modular robotic systems enables the construction of complex robotic configurations to adapt to different tasks. This paper presents a framework for SMORES types of modular robots to efficiently self-assemble into tree topologies. These modular robots form kinematic chains that have been shown to be capable of a large variety of manipulation and locomotion tasks, yet they can recon… ▽ More

    Submitted 2 January, 2023; v1 submitted 1 April, 2021; originally announced April 2021.

    Comments: 18 pages, 20 figures. Auton Robot (2022)

    Journal ref: Autonomous Robots volume 47, pages 211-228 (2023)

  23. Docking and Undocking a Modular Underactuated Oscillating Swimming Robot

    Authors: Gedaliah Knizhnik, Mark Yim

    Abstract: We describe a docking mechanism and strategy to allow modular self-assembly for the Modboat: an inexpensive underactuated oscillating swimming robot powered by a single motor. Because propulsion is achieved through oscillation, orientation can be controlled only in the average; this complicates docking, which requires precise position and orientation control. Given these challenges, we present a d… ▽ More

    Submitted 28 October, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

    Comments: 6 pages. Submitted to the 2021 IEEE International Conference on Robotics and Automation (ICRA)

  24. arXiv:2101.05201  [pdf, other

    eess.SP cs.LG stat.ML

    Optimisation of Spectral Wavelets for Persistence-based Graph Classification

    Authors: Ka Man Yim, Jacob Leygonie

    Abstract: A graph's spectral wavelet signature determines a filtration, and consequently an associated set of extended persistence diagrams. We propose a framework that optimises the choice of wavelet for a dataset of graphs, such that their associated persistence diagrams capture features of the graphs that are best suited to a given data science problem. Since the spectral wavelet signature of a graph is… ▽ More

    Submitted 1 March, 2021; v1 submitted 10 January, 2021; originally announced January 2021.

  25. Design and Experiments with a Low-Cost Single-Motor Modular Aquatic Robot

    Authors: Gedaliah Knizhnik, Mark Yim

    Abstract: We present a novel design for a low-cost robotic boat powered by a single actuator, useful for both modular and swarming applications. The boat uses the conservation of angular momentum and passive flippers to convert the motion of a single motor into an adjustable paddling motion for propulsion and steering. We develop design criteria for modularity and swarming and present a prototype implementi… ▽ More

    Submitted 5 May, 2020; v1 submitted 5 February, 2020; originally announced February 2020.

    Comments: Accepted to the International Conference on Ubiquitous Robots (UR 2020). 8 pages

  26. arXiv:1812.04190  [pdf, other

    cs.RO

    Optimal Structure Synthesis for Environment Augmenting Robots

    Authors: Tarik Tosun, Cynthia Sung, Colin McCloskey, Mark Yim

    Abstract: Building structures can allow a robot to surmount large obstacles, expanding the set of areas it can reach. This paper presents a planning algorithm to automatically determine what structures a construction-capable robot must build in order to traverse its entire environment. Given an environment, a set of building blocks, and a robot capable of building structures, we seek a optimal set of struct… ▽ More

    Submitted 10 December, 2018; originally announced December 2018.

  27. Accomplishing High-Level Tasks with Modular Robots

    Authors: Gangyuan Jing, Tarik Tosun, Mark Yim, Hadas Kress-Gazit

    Abstract: The advantage of modular self-reconfigurable robot systems is their flexibility, but this advantage can only be realized if appropriate configurations (shapes) and behaviors (controlling programs) can be selected for a given task. In this paper, we present an integrated system for addressing high-level tasks with modular robots, and demonstrate that it is capable of accomplishing challenging, mult… ▽ More

    Submitted 1 May, 2018; v1 submitted 6 December, 2017; originally announced December 2017.

    Comments: Published in Autonomous Robots, 2018. 18 pages

  28. arXiv:1710.01840  [pdf, other

    cs.RO

    Perception-Informed Autonomous Environment Augmentation With Modular Robots

    Authors: Tarik Tosun, Jonathan Daudelin, Gangyuan Jing, Hadas Kress-Gazit, Mark Campbell, Mark Yim

    Abstract: We present a system enabling a modular robot to autonomously build structures in order to accomplish high-level tasks. Building structures allows the robot to surmount large obstacles, expanding the set of tasks it can perform. This addresses a common weakness of modular robot systems, which often struggle to traverse large obstacles. This paper presents the hardware, perception, and planning to… ▽ More

    Submitted 1 March, 2018; v1 submitted 4 October, 2017; originally announced October 2017.

    Comments: 2018 IEEE International Conference on Robotics and Automation (ICRA). 7 pages

  29. An Integrated System for Perception-Driven Autonomy with Modular Robots

    Authors: Jonathan Daudelin, Gangyuan Jing, Tarik Tosun, Mark Yim, Hadas Kress-Gazit, Mark Campbell

    Abstract: The theoretical ability of modular robots to reconfigure in response to complex tasks in a priori unknown environments has frequently been cited as an advantage and remains a major motivator for work in the field. We present a modular robot system capable of autonomously completing high-level tasks by reactively reconfiguring to meet the needs of a perceived, a priori unknown environment. The syst… ▽ More

    Submitted 13 December, 2018; v1 submitted 15 September, 2017; originally announced September 2017.

    Comments: Published article available at: http://robotics.sciencemag.org/cgi/content/full/3/23/eaat4983?ijkey=iBq7yW7Z8vmjE&keytype=ref&siteid=robotics

    Journal ref: Science Robotics 31 Oct 2018. Vol. 3, Issue 23, eeat4983