Skip to main content

Showing 1–31 of 31 results for author: Kim, D H

  1. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  2. arXiv:2403.08187  [pdf, other

    cs.CL cs.SD eess.AS

    Automatic Speech Recognition (ASR) for the Diagnosis of pronunciation of Speech Sound Disorders in Korean children

    Authors: Taekyung Ahn, Yeonjung Hong, Younggon Im, Do Hyung Kim, Dayoung Kang, Joo Won Jeong, Jae Won Kim, Min Jung Kim, Ah-ra Cho, Dae-Hyun Jang, Hosung Nam

    Abstract: This study presents a model of automatic speech recognition (ASR) designed to diagnose pronunciation issues in children with speech sound disorders (SSDs) to replace manual transcriptions in clinical procedures. Since ASR models trained for general purposes primarily predict input speech into real words, employing a well-known high-performance ASR model for evaluating pronunciation in children wit… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 12 pages, 2 figures

    ACM Class: I.2.7

  3. AINeedsPlanner: A Workbook to Support Effective Collaboration Between AI Experts and Clients

    Authors: Dae Hyun Kim, Hyungyu Shin, Shakhnozakhon Yadgarova, Jinho Son, Hariharan Subramonyam, Juho Kim

    Abstract: Clients often partner with AI experts to develop AI applications tailored to their needs. In these partnerships, careful planning and clear communication are critical, as inaccurate or incomplete specifications can result in misaligned model characteristics, expensive reworks, and potential friction between collaborators. Unfortunately, given the complexity of requirements ranging from functionali… ▽ More

    Submitted 26 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: To appear in DIS 2024

  4. Natural Language Dataset Generation Framework for Visualizations Powered by Large Language Models

    Authors: Hyung-Kwon Ko, Hyeon Jeon, Gwanmo Park, Dae Hyun Kim, Nam Wook Kim, Juho Kim, Jinwook Seo

    Abstract: We introduce VL2NL, a Large Language Model (LLM) framework that generates rich and diverse NL datasets using only Vega-Lite specifications as input, thereby streamlining the development of Natural Language Interfaces (NLIs) for data visualization. To synthesize relevant chart semantics accurately and enhance syntactic diversity in each NL dataset, we leverage 1) a guided discovery incorporated int… ▽ More

    Submitted 21 January, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 22 pages, 5 figures

    Journal ref: In Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI '24), May 11-16, 2024, Honolulu, HI, USA

  5. arXiv:2308.11568  [pdf, other

    cs.CV

    SPANet: Frequency-balancing Token Mixer using Spectral Pooling Aggregation Modulation

    Authors: Guhnoo Yun, Juhan Yoo, Kijung Kim, Jeongho Lee, Dong Hwan Kim

    Abstract: Recent studies show that self-attentions behave like low-pass filters (as opposed to convolutions) and enhancing their high-pass filtering capability improves model performance. Contrary to this idea, we investigate existing convolution-based models with spectral analysis and observe that improving the low-pass filtering in convolution operations also leads to performance improvement. To account f… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: Accepted paper at ICCV 2023

  6. arXiv:2308.07593  [pdf, other

    cs.CV cs.MM eess.AS eess.IV

    AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model

    Authors: Jeong Hun Yeo, Minsu Kim, Jeongsoo Choi, Dae Hoe Kim, Yong Man Ro

    Abstract: Visual Speech Recognition (VSR) is the task of predicting spoken words from silent lip movements. VSR is regarded as a challenging task because of the insufficient information on lip movements. In this paper, we propose an Audio Knowledge empowered Visual Speech Recognition framework (AKVSR) to complement the insufficient speech information of visual modality by using audio modality. Different fro… ▽ More

    Submitted 11 January, 2024; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: Accepted by IEEE Transactions on Multimedia

  7. EmphasisChecker: A Tool for Guiding Chart and Caption Emphasis

    Authors: Dae Hyun Kim, Seulgi Choi, Juho Kim, Vidya Setlur, Maneesh Agrawala

    Abstract: Recent work has shown that when both the chart and caption emphasize the same aspects of the data, readers tend to remember the doubly-emphasized features as takeaways; when there is a mismatch, readers rely on the chart to form takeaways and can miss information in the caption text. Through a survey of 280 chart-caption pairs in real-world sources (e.g., news media, poll reports, government repor… ▽ More

    Submitted 20 January, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: IEEE VIS 2023

    Journal ref: IEEE Transactions on Visualization and Computer Graphics, vol. 30, no. 1, pp. 120-130, 2024

  8. arXiv:2209.00862  [pdf, other

    cs.CR cs.AI

    Spatio-Temporal Attack Course-of-Action (COA) Search Learning for Scalable and Time-Varying Networks

    Authors: Haemin Lee, Seok Bin Son, Won Joon Yun, Joongheon Kim, Soyi Jung, Dong Hwa Kim

    Abstract: One of the key topics in network security research is the autonomous COA (Couse-of-Action) attack search method. Traditional COA attack search methods that passively search for attacks can be difficult, especially as the network gets bigger. To address these issues, new autonomous COA techniques are being developed, and among them, an intelligent spatial algorithm is designed in this paper for eff… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

  9. arXiv:2201.08605  [pdf, other

    cs.NI

    Seamless and Energy Efficient Maritime Coverage in Coordinated 6G Space-Air-Sea Non-Terrestrial Networks

    Authors: Sheikh Salman Hassan, Do Hyeon Kim, Yan Kyaw Tun, Nguyen H. Tran, Walid Saad, Choong Seon Hong

    Abstract: Non-terrestrial networks (NTNs), which integrate space and aerial networks with terrestrial systems, are a key area in the emerging sixth-generation (6G) wireless networks. As part of 6G, NTNs must provide pervasive connectivity to a wide range of devices, including smartphones, vehicles, sensors, robots, and maritime users. However, due to the high mobility and deployment of NTNs, managing the sp… ▽ More

    Submitted 21 January, 2022; originally announced January 2022.

  10. arXiv:2112.01049  [pdf, other

    cs.LG cs.AI

    Bayesian Optimization over Permutation Spaces

    Authors: Aryan Deshwal, Syrine Belakaria, Janardhan Rao Doppa, Dae Hyun Kim

    Abstract: Optimizing expensive to evaluate black-box functions over an input space consisting of all permutations of d objects is an important problem with many real-world applications. For example, placement of functional blocks in hardware design to optimize performance via simulations. The overall goal is to minimize the number of function evaluations to find high-performing permutations. The key challen… ▽ More

    Submitted 2 December, 2021; originally announced December 2021.

    Comments: Accepted at AAAI 2022

  11. arXiv:2109.07995  [pdf, other

    cs.RO

    Towards Defensive Autonomous Driving: Collecting and Probing Driving Demonstrations of Mixed Qualities

    Authors: Jeongwoo Oh, Gunmin Lee, Jeongeun Park, Wooseok Oh, Jaeseok Heo, Hojun Chung, Do Hyung Kim, Byungkyu Park, Chang-Gun Lee, Sungjoon Choi, Songhwai Oh

    Abstract: Designing or learning an autonomous driving policy is undoubtedly a challenging task as the policy has to maintain its safety in all corner cases. In order to secure safety in autonomous driving, the ability to detect hazardous situations, which can be seen as an out-of-distribution (OOD) detection problem, becomes crucial. However, most conventional datasets only provide expert driving demonstrat… ▽ More

    Submitted 18 September, 2021; v1 submitted 16 September, 2021; originally announced September 2021.

    Comments: 6 pages, 6 figures, 3 tables

  12. Ruin Theory for User Association and Energy Optimization in Multi-access Edge Computing

    Authors: Do Hyeon Kim, Aunas Manzoor, Madyan Alsenwi, Yan Kyaw Tun, Walid Saad, Choong Seon Hong

    Abstract: In this correspondence, a novel framework is proposed for analyzing data offloading in a multi-access edge computing system. Specifically, a two-phase algorithm, is proposed, including two key phases: 1) user association phase and 2) task offloading phase. In the first phase, a ruin theory-based approach is developed to obtain the users association considering the users' transmission reliability a… ▽ More

    Submitted 21 April, 2023; v1 submitted 2 July, 2021; originally announced July 2021.

    Comments: Accepted Article By IEEE Transactions on Vehicular Technology, DOI: https://doi.org/10.1109/TVT.2023.3269427 (In Press)

  13. Towards Understanding How Readers Integrate Charts and Captions: A Case Study with Line Charts

    Authors: Dae Hyun Kim, Vidya Setlur, Maneesh Agrawala

    Abstract: Charts often contain visually prominent features that draw attention to aspects of the data and include text captions that emphasize aspects of the data. Through a crowdsourced study, we explore how readers gather takeaways when considering charts and captions together. We first ask participants to mark visually prominent regions in a set of line charts. We then generate text captions based on the… ▽ More

    Submitted 20 January, 2021; originally announced January 2021.

    Comments: To appear at CHI 2021

  14. HeM3D: Heterogeneous Manycore Architecture Based on Monolithic 3D Vertical Integration

    Authors: Aqeeb Iqbal Arka, Biresh Kumar Joardar, Ryan Gary Kim, Dae Hyun Kim, Janardhan Rao Doppa, Partha Pratim Pande

    Abstract: Heterogeneous manycore architectures are the key to efficiently execute compute- and data-intensive applications. Through silicon via (TSV)-based 3D manycore system is a promising solution in this direction as it enables integration of disparate computing cores on a single system. However, the achievable performance of conventional through-silicon-via (TSV)-based 3D systems is ultimately bottlenec… ▽ More

    Submitted 7 December, 2020; v1 submitted 30 November, 2020; originally announced December 2020.

    Comments: This work has been accepted in ACM Transactions on Design Automation of Electronic Systems

    ACM Class: C.2

  15. arXiv:2008.10148  [pdf, other

    cs.HC cs.AI cs.LG eess.SY

    Drive Safe: Cognitive-Behavioral Mining for Intelligent Transportation Cyber-Physical System

    Authors: Md. Shirajum Munir, Sarder Fakhrul Abedin, Ki Tae Kim, Do Hyeon Kim, Md. Golam Rabiul Alam, Choong Seon Hong

    Abstract: This paper presents a cognitive behavioral-based driver mood repairment platform in intelligent transportation cyber-physical systems (IT-CPS) for road safety. In particular, we propose a driving safety platform for distracted drivers, namely \emph{drive safe}, in IT-CPS. The proposed platform recognizes the distracting activities of the drivers as well as their emotions for mood repair. Further,… ▽ More

    Submitted 23 August, 2020; originally announced August 2020.

    Comments: Submitted to IEEE Transactions on Intelligent Transportation Systems, Special Issue on Technologies for risk mitigation and support of impaired drivers

  16. arXiv:2008.05772  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    CycleMorph: Cycle Consistent Unsupervised Deformable Image Registration

    Authors: Boah Kim, Dong Hwan Kim, Seong Ho Park, Jieun Kim, June-Goo Lee, Jong Chul Ye

    Abstract: Image registration is a fundamental task in medical image analysis. Recently, deep learning based image registration methods have been extensively investigated due to their excellent performance despite the ultra-fast computational time. However, the existing deep learning methods still have limitation in the preservation of original topology during the deformation with registration vector fields.… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

  17. arXiv:2008.03718  [pdf, other

    cs.CV

    1-Point RANSAC-Based Method for Ground Object Pose Estimation

    Authors: Jeong-Kyun Lee, Young-Ki Baik, Hankyu Cho, Kang Kim, Duck Hoon Kim

    Abstract: Solving Perspective-n-Point (PnP) problems is a traditional way of estimating object poses. Given outlier-contaminated data, a pose of an object is calculated with PnP algorithms of n = {3, 4} in the RANSAC-based scheme. However, the computational complexity considerably increases along with n and the high complexity imposes a severe strain on devices which should estimate multiple object poses in… ▽ More

    Submitted 10 June, 2021; v1 submitted 9 August, 2020; originally announced August 2020.

    Comments: Accepted in the workshop on Autonomous Driving: Perception, Prediction and Planning in conjunction with CVPR 2021

  18. Controlling the Outbreak of COVID-19: A Noncooperative Game Perspective

    Authors: Anupam Kumar Bairagi, Mehedi Masud, Do Hyeon Kim, Md. Shirajum Munir, Abdullah Al Nahid, Sarder Fakhrul Abedin, Kazi Masudul Alam, Sujit Biswas, Sultan S Alshamrani, Zhu Han, Choong Seon Hong

    Abstract: COVID-19 is a global epidemic. Till now, there is no remedy for this epidemic. However, isolation and social distancing are seemed to be effective preventive measures to control this pandemic. Therefore, in this paper, an optimization problem is formulated that accommodates both isolation and social distancing features of the individuals. To promote social distancing, we solve the formulated probl… ▽ More

    Submitted 26 November, 2020; v1 submitted 27 July, 2020; originally announced July 2020.

    Comments: Accepted article by IEEE Access. DOI: 10.1109/ACCESS.2020.3040821

  19. arXiv:2006.14380  [pdf, other

    cs.CV eess.IV

    Deep Convolutional GANs for Car Image Generation

    Authors: Dong Hui Kim

    Abstract: In this paper, we investigate the application of deep convolutional GANs on car image generation. We improve upon the commonly used DCGAN architecture by implementing Wasserstein loss to decrease mode collapse and introducing dropout at the end of the discrimiantor to introduce stochasticity. Furthermore, we introduce convolutional layers at the end of the generator to improve expressiveness and s… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

    Comments: 6 pages, 8 figures

  20. arXiv:2005.08630  [pdf, other

    cs.CV cs.LG

    End-to-End Lane Marker Detection via Row-wise Classification

    Authors: Seungwoo Yoo, Heeseok Lee, Heesoo Myeong, Sungrack Yun, Hyoungwoo Park, Janghoon Cho, Duck Hoon Kim

    Abstract: In autonomous driving, detecting reliable and accurate lane marker positions is a crucial yet challenging task. The conventional approaches for the lane marker detection problem perform a pixel-level dense prediction task followed by sophisticated post-processing that is inevitable since lane markers are typically represented by a collection of line segments without thickness. In this paper, we pr… ▽ More

    Submitted 6 May, 2020; originally announced May 2020.

  21. arXiv:1908.11024  [pdf, other

    cs.CV

    Metric-based Regularization and Temporal Ensemble for Multi-task Learning using Heterogeneous Unsupervised Tasks

    Authors: Dae Ha Kim, Seung Hyun Lee, Byung Cheol Song

    Abstract: One of the ways to improve the performance of a target task is to learn the transfer of abundant knowledge of a pre-trained network. However, learning of the pre-trained network requires high computation capability and large-scale labeled dataset. To mitigate the burden of large-scale labeling, learning in un/self-supervised manner can be a solution. In addition, using unsupervised multi-task lear… ▽ More

    Submitted 28 August, 2019; originally announced August 2019.

    Comments: 11 pages. To Appear in the IEEE International Conference on Computer Vision Workshops (ICCVW) 2019

  22. arXiv:1907.03956  [pdf, other

    cs.RO

    Planning for target retrieval using a robotic manipulator in cluttered and occluded environments

    Authors: Changjoo Nam, Jinhwi Lee, Younggil Cho, Jeongho Lee, Dong Hwan Kim, ChangHwan Kim

    Abstract: This paper presents planning algorithms for a robotic manipulator with a fixed base in order to grasp a target object in cluttered environments. We consider a configuration of objects in a confined space with a high density so no collision-free path to the target exists. The robot must relocate some objects to retrieve the target while avoiding collisions. For fast completion of the retrieval task… ▽ More

    Submitted 8 July, 2019; originally announced July 2019.

    Comments: 8 pages, 14 figures

  23. arXiv:1907.01319  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Unsupervised Deformable Image Registration Using Cycle-Consistent CNN

    Authors: Boah Kim, Jieun Kim, June-Goo Lee, Dong Hwan Kim, Seong Ho Park, Jong Chul Ye

    Abstract: Medical image registration is one of the key processing steps for biomedical image analysis such as cancer diagnosis. Recently, deep learning based supervised and unsupervised image registration methods have been extensively studied due to its excellent performance in spite of ultra-fast computational time compared to the classical approaches. In this paper, we present a novel unsupervised medical… ▽ More

    Submitted 2 July, 2019; originally announced July 2019.

    Comments: accepted for MICCAI 2019

  24. arXiv:1906.04293  [pdf

    cs.ET cs.NI

    Inter-Tier Process Variation-Aware Monolithic 3D NoC Architectures

    Authors: Shouvik Musavvir, Anwesha Chatterjee, Ryan Gary Kim, Dae Hyun Kim, Partha Pratim Pande

    Abstract: Monolithic 3D (M3D) technology enables high density integration, performance, and energy-efficiency by sequentially stacking tiers on top of each other. M3D-based network-on-chip (NoC) architectures can exploit these benefits by adopting tier partitioning for intra-router stages. However, conventional fabrication methods are infeasible for M3D-enabled designs due to temperature related issues. Thi… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: Submitted to IEEE TVLSI (Under Review)

  25. arXiv:1810.02069  [pdf, other

    cs.LG stat.ML

    Finding Solutions to Generative Adversarial Privacy

    Authors: Dae Hyun Kim, Taeyoung Kong, Seungbin Jeong

    Abstract: We present heuristics for solving the maximin problem induced by the generative adversarial privacy setting for linear and convolutional neural network (CNN) adversaries. In the linear adversary setting, we present a greedy algorithm for approximating the optimal solution for the privatizer, which performs better as the number of instances increases. We also provide an analysis of the algorithm to… ▽ More

    Submitted 4 October, 2018; originally announced October 2018.

  26. arXiv:1807.06819  [pdf, other

    cs.LG cs.CV stat.ML

    Self-supervised Knowledge Distillation Using Singular Value Decomposition

    Authors: Seung Hyun Lee, Dae Ha Kim, Byung Cheol Song

    Abstract: To solve deep neural network (DNN)'s huge training dataset and its high computation issue, so-called teacher-student (T-S) DNN which transfers the knowledge of T-DNN to S-DNN has been proposed. However, the existing T-S-DNN has limited range of use, and the knowledge of T-DNN is insufficiently transferred to S-DNN. To improve the quality of the transferred knowledge from T-DNN, we propose a new kn… ▽ More

    Submitted 18 July, 2018; originally announced July 2018.

    Comments: accepted to ECCV 2018

  27. arXiv:1805.09956  [pdf, other

    cs.DS

    Improved Approximation for Node-Disjoint Paths in Grids with Sources on the Boundary

    Authors: Julia Chuzhoy, David H. K. Kim, Rachit Nimavat

    Abstract: We study the classical Node-Disjoint Paths (NDP) problem: given an undirected $n$-vertex graph G, together with a set {(s_1,t_1),...,(s_k,t_k)} of pairs of its vertices, called source-destination, or demand pairs, find a maximum-cardinality set of mutually node-disjoint paths that connect the demand pairs. The best current approximation for the problem is achieved by a simple greedy $O(\sqrt{n})$-… ▽ More

    Submitted 24 May, 2018; originally announced May 2018.

    Comments: To appear in the proceedings of ICALP 2018

    ACM Class: F.2.2

  28. arXiv:1711.01980  [pdf, other

    cs.DS

    Almost Polynomial Hardness of Node-Disjoint Paths in Grids

    Authors: Julia Chuzhoy, David H. K. Kim, Rachit Nimavat

    Abstract: In the classical Node-Disjoint Paths (NDP) problem, we are given an $n$-vertex graph $G=(V,E)$, and a collection $M=\{(s_1,t_1),\ldots,(s_k,t_k)\}$ of pairs of its vertices, called source-destination, or demand pairs. The goal is to route as many of the demand pairs as possible, where to route a pair we need to select a path connecting it, so that all selected paths are disjoint in their vertices.… ▽ More

    Submitted 6 November, 2017; originally announced November 2017.

  29. arXiv:1611.05429  [pdf, other

    cs.DS

    New Hardness Results for Routing on Disjoint Paths

    Authors: Julia Chuzhoy, David H. K. Kim, Rachit Nimavat

    Abstract: In the classical Node-Disjoint Paths (NDP) problem, the input consists of an undirected $n$-vertex graph $G$, and a collection $\mathcal{M}=\{(s_1,t_1),\ldots,(s_k,t_k)\}$ of pairs of its vertices, called source-destination, or demand, pairs. The goal is to route the largest possible number of the demand pairs via node-disjoint paths. The best current approximation for the problem is achieved by a… ▽ More

    Submitted 16 November, 2016; originally announced November 2016.

  30. arXiv:1603.05520  [pdf, other

    cs.DS

    Improved Approximation for Node-Disjoint Paths in Planar Graphs

    Authors: Julia Chuzhoy, David H. K. Kim, Shi Li

    Abstract: We study the classical Node-Disjoint Paths (NDP) problem: given an $n$-vertex graph $G$ and a collection $M=\{(s_1,t_1),\ldots,(s_k,t_k)\}$ of pairs of vertices of $G$ called demand pairs, find a maximum-cardinality set of node-disjoint paths connecting the demand pairs. NDP is one of the most basic routing problems, that has been studied extensively. Despite this, there are still wide gaps in our… ▽ More

    Submitted 17 March, 2016; originally announced March 2016.

  31. arXiv:1404.4416  [pdf, ps, other

    cs.CC math.DS

    A characterization of eventually periodicity

    Authors: Teturo Kamae, Dong Han Kim

    Abstract: In this article, we show that the Kamae-Xue complexity function for an infinite sequence classifies eventual periodicity completely. We prove that an infinite binary word $x_1x_2 \cdots $ is eventually periodic if and only if $Σ(x_1x_2\cdots x_n)/n^3$ has a positive limit, where $Σ(x_1x_2\cdots x_n)$ is the sum of the squares of all the numbers of appearance of finite words in… ▽ More

    Submitted 16 April, 2014; originally announced April 2014.

    Comments: 11 pages

    MSC Class: 68R15