Skip to main content

Showing 1–50 of 94 results for author: Lee, P

  1. arXiv:2407.13040  [pdf, other

    cs.CL

    Turkish Delights: a Dataset on Turkish Euphemisms

    Authors: Hasan Can Biyik, Patrick Lee, Anna Feldman

    Abstract: Euphemisms are a form of figurative language relatively understudied in natural language processing. This research extends the current computational work on potentially euphemistic terms (PETs) to Turkish. We introduce the Turkish PET dataset, the first available of its kind in the field. By creating a list of euphemisms in Turkish, collecting example contexts, and annotating them, we provide both… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: In Proceedings of The First SIGTURK workshop co-located with ACL 2024: https://sigturk.github.io/workshop/

  2. The AI-DEC: A Card-based Design Method for User-centered AI Explanations

    Authors: Christine P Lee, Min Kyung Lee, Bilge Mutlu

    Abstract: Increasing evidence suggests that many deployed AI systems do not sufficiently support end-user interaction and information needs. Engaging end-users in the design of these systems can reveal user needs and expectations, yet effective ways of engaging end-users in the AI explanation design remain under-explored. To address this gap, we developed a design method, called AI-DEC, that defines four di… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Journal ref: Designing Interactive Systems Conference, 2024, (DIS '24)

  3. REX: Designing User-centered Repair and Explanations to Address Robot Failures

    Authors: Christine P Lee, Pragathi Praveena, Bilge Mutlu

    Abstract: Robots in real-world environments continuously engage with multiple users and encounter changes that lead to unexpected conflicts in fulfilling user requests. Recent technical advancements (e.g., large-language models (LLMs), program synthesis) offer various methods for automatically generating repair plans that address such conflicts. In this work, we understand how automated repair and explanati… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Journal ref: Designing Interactive Systems Conference, 2024, (DIS '24)

  4. arXiv:2403.14268  [pdf

    eess.AS cs.SD

    Speech-Aware Neural Diarization with Encoder-Decoder Attractor Guided by Attention Constraints

    Authors: PeiYing Lee, HauYun Guo, Berlin Chen

    Abstract: End-to-End Neural Diarization with Encoder-Decoder based Attractor (EEND-EDA) is an end-to-end neural model for automatic speaker segmentation and labeling. It achieves the capability to handle flexible number of speakers by estimating the number of attractors. EEND-EDA, however, struggles to accurately capture local speaker dynamics. This work proposes an auxiliary loss that aims to guide the Tra… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted to The 28th International Conference on Technologies and Applications of Artificial Intelligence (TAAI), in Chinese language

    Report number: TAAI2023-Domestic-131

  5. arXiv:2403.13589  [pdf, other

    cs.CV

    ReGround: Improving Textual and Spatial Grounding at No Cost

    Authors: Phillip Y. Lee, Minhyuk Sung

    Abstract: When an image generation process is guided by both a text prompt and spatial cues, such as a set of bounding boxes, do these elements work in harmony, or does one dominate the other? Our analysis of a pretrained image diffusion model that integrates gated self-attention into the U-Net reveals that spatial grounding often outweighs textual grounding due to the sequential flow from gated self-attent… ▽ More

    Submitted 19 July, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: Accepted to ECCV 2024. Project page: https://re-ground.github.io/

  6. arXiv:2403.03230  [pdf, other

    q-bio.NC cs.AI

    Large language models surpass human experts in predicting neuroscience results

    Authors: Xiaoliang Luo, Akilles Rechardt, Guangzhi Sun, Kevin K. Nejad, Felipe Yáñez, Bati Yilmaz, Kangjoo Lee, Alexandra O. Cohen, Valentina Borghesani, Anton Pashkov, Daniele Marinazzo, Jonathan Nicholas, Alessandro Salatiello, Ilia Sucholutsky, Pasquale Minervini, Sepehr Razavi, Roberta Rocca, Elkhan Yusifov, Tereza Okalova, Nianlong Gu, Martin Ferianc, Mikail Khona, Kaustubh R. Patil, Pui-Shee Lee, Rui Mata , et al. (14 additional authors not shown)

    Abstract: Scientific discoveries often hinge on synthesizing decades of research, a task that potentially outstrips human information processing capacities. Large language models (LLMs) offer a solution. LLMs trained on the vast scientific literature could potentially integrate noisy yet interrelated findings to forecast novel results better than human experts. To evaluate this possibility, we created Brain… ▽ More

    Submitted 21 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  7. arXiv:2402.17963  [pdf, other

    cs.DC

    The Design and Implementation of a High-Performance Log-Structured RAID System for ZNS SSDs

    Authors: Jinhong Li, Qiuping Wang, Shujie Han, Patrick P. C. Lee

    Abstract: Zoned Namespace (ZNS) defines a new abstraction for host software to flexibly manage storage in flash-based SSDs as append-only zones. It also provides a Zone Append primitive to further boost the write performance of ZNS SSDs by exploiting intra-zone parallelism. However, making Zone Append effective for reliable and scalable storage, in the form of a RAID array of multiple ZNS SSDs, is non-trivi… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 29 pages

    ACM Class: C.4; C.5.0

  8. arXiv:2402.06124  [pdf, other

    cs.HC

    Teleoscope: Exploring Themes in Large Document Sets By Example

    Authors: Paul Bucci, Leo Foord-Kelcey, Patrick Yung Kang Lee, Alamjeet Singh, Ivan Beschastnikh

    Abstract: Qualitative thematic exploration of data by hand does not scale and researchers create and update a personalized point of view as they explore data. As a result, machine learning (ML) approaches that might help with exploration are challenging to apply. We developed Teleoscope, a web-based system that supports interactive exploration of large corpora (100K-1M) of short documents (1-3 paragraphs).… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 28 pages, 9 figures, pre-print

  9. arXiv:2401.14838  [pdf, other

    cs.CV

    Multi-modality action recognition based on dual feature shift in vehicle cabin monitoring

    Authors: Dan Lin, Philip Hann Yung Lee, Yiming Li, Ruoyu Wang, Kim-Hui Yap, Bingbing Li, You Shing Ngim

    Abstract: Driver Action Recognition (DAR) is crucial in vehicle cabin monitoring systems. In real-world applications, it is common for vehicle cabins to be equipped with cameras featuring different modalities. However, multi-modality fusion strategies for the DAR task within car cabins have rarely been studied. In this paper, we propose a novel yet efficient multi-modality driver action recognition method b… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  10. arXiv:2401.14526  [pdf, other

    cs.CL

    MEDs for PETs: Multilingual Euphemism Disambiguation for Potentially Euphemistic Terms

    Authors: Patrick Lee, Alain Chirino Trujillo, Diana Cuevas Plancarte, Olumide Ebenezer Ojo, Xinyi Liu, Iyanuoluwa Shode, Yuan Zhao, Jing Peng, Anna Feldman

    Abstract: This study investigates the computational processing of euphemisms, a universal linguistic phenomenon, across multiple languages. We train a multilingual transformer model (XLM-RoBERTa) to disambiguate potentially euphemistic terms (PETs) in multilingual and cross-lingual settings. In line with current trends, we demonstrate that zero-shot learning across languages takes place. We also show cases… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  11. Design, Development, and Deployment of Context-Adaptive AI Systems for Enhanced End-User Adoption

    Authors: Christine P Lee

    Abstract: My research centers on the development of context-adaptive AI systems to improve end-user adoption through the integration of technical methods. I deploy these AI systems across various interaction modalities, including user interfaces and embodied agents like robots, to expand their practical applicability. My research unfolds in three key stages: design, development, and deployment. In the desig… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, 5 pages

    Journal ref: Extended Abstracts of the CHI Conference on Human Factors in Computing Systems (CHI EA '24), May 11--16, 2024, Honolulu, HI, USA

  12. Understanding Large-Language Model (LLM)-powered Human-Robot Interaction

    Authors: Callie Y. Kim, Christine P. Lee, Bilge Mutlu

    Abstract: Large-language models (LLMs) hold significant promise in improving human-robot interaction, offering advanced conversational skills and versatility in managing diverse, open-ended user requests in various tasks and domains. Despite the potential to transform human-robot interaction, very little is known about the distinctive design requirements for utilizing LLMs in robots, which may differ from t… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

    Comments: 10 pages, 4 figures. Callie Y. Kim and Christine P. Lee contributed equally to the work. To be published in Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction (HRI '24), March 11--14, 2024, Boulder, CO, USA

  13. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  14. arXiv:2312.00083  [pdf, other

    cs.CV cs.LG

    BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos

    Authors: Pilhyeon Lee, Hyeran Byun

    Abstract: Temporal sentence grounding aims to localize moments relevant to a language description. Recently, DETR-like approaches achieved notable progress by predicting the center and length of a target moment. However, they suffer from the issue of center misalignment raised by the inherent ambiguity of moment centers, leading to inaccurate predictions. To remedy this problem, we propose a novel boundary-… ▽ More

    Submitted 18 July, 2024; v1 submitted 30 November, 2023; originally announced December 2023.

    Comments: Accepted by ECCV 2024

  15. arXiv:2311.13319  [pdf, other

    eess.IV cs.CV cs.LG

    Deep Learning for Vascular Segmentation and Applications in Phase Contrast Tomography Imaging

    Authors: Ekin Yagis, Shahab Aslani, Yashvardhan Jain, Yang Zhou, Shahrokh Rahmani, Joseph Brunet, Alexandre Bellier, Christopher Werlein, Maximilian Ackermann, Danny Jonigk, Paul Tafforeau, Peter D Lee, Claire Walsh

    Abstract: Automated blood vessel segmentation is vital for biomedical imaging, as vessel changes indicate many pathologies. Still, precise segmentation is difficult due to the complexity of vascular structures, anatomical variations across patients, the scarcity of annotated public datasets, and the quality of images. We present a thorough literature review, highlighting the state of machine learning techni… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  16. arXiv:2311.00993  [pdf, other

    cs.LG

    Scalable Probabilistic Forecasting in Retail with Gradient Boosted Trees: A Practitioner's Approach

    Authors: Xueying Long, Quang Bui, Grady Oktavian, Daniel F. Schmidt, Christoph Bergmeir, Rakshitha Godahewa, Seong Per Lee, Kaifeng Zhao, Paul Condylis

    Abstract: The recent M5 competition has advanced the state-of-the-art in retail forecasting. However, we notice important differences between the competition challenge and the challenges we face in a large e-commerce company. The datasets in our scenario are larger (hundreds of thousands of time series), and e-commerce can afford to have a larger assortment than brick-and-mortar retailers, leading to more i… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  17. Development and validation of an interpretable machine learning-based calculator for predicting 5-year weight trajectories after bariatric surgery: a multinational retrospective cohort SOPHIA study

    Authors: Patrick Saux, Pierre Bauvin, Violeta Raverdy, Julien Teigny, Hélène Verkindt, Tomy Soumphonphakdy, Maxence Debert, Anne Jacobs, Daan Jacobs, Valerie Monpellier, Phong Ching Lee, Chin Hong Lim, Johanna C Andersson-Assarsson, Lena Carlsson, Per-Arne Svensson, Florence Galtier, Guelareh Dezfoulian, Mihaela Moldovanu, Severine Andrieux, Julien Couster, Marie Lepage, Erminia Lembo, Ornella Verrastro, Maud Robert, Paulina Salminen , et al. (9 additional authors not shown)

    Abstract: Background Weight loss trajectories after bariatric surgery vary widely between individuals, and predicting weight loss before the operation remains challenging. We aimed to develop a model using machine learning to provide individual preoperative prediction of 5-year weight loss trajectories after surgery. Methods In this multinational retrospective observational study we enrolled adult participa… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: The Lancet Digital Health, 2023

  18. arXiv:2308.10554  [pdf, other

    cs.CV

    Improving Diversity in Zero-Shot GAN Adaptation with Semantic Variations

    Authors: Seogkyu Jeon, Bei Liu, Pilhyeon Lee, Kibeom Hong, Jianlong Fu, Hyeran Byun

    Abstract: Training deep generative models usually requires a large amount of data. To alleviate the data collection cost, the task of zero-shot GAN adaptation aims to reuse well-trained generators to synthesize images of an unseen target domain without any further training samples. Due to the data absence, the textual description of the target domain and the vision-language models, e.g., CLIP, are utilized… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023 (poster)

  19. arXiv:2308.03417  [pdf, other

    cs.CR cs.LG

    PURL: Safe and Effective Sanitization of Link Decoration

    Authors: Shaoor Munir, Patrick Lee, Umar Iqbal, Zubair Shafiq, Sandra Siby

    Abstract: While privacy-focused browsers have taken steps to block third-party cookies and mitigate browser fingerprinting, novel tracking techniques that can bypass existing countermeasures continue to emerge. Since trackers need to share information from the client-side to the server-side through link decoration regardless of the tracking technique they employ, a promising orthogonal approach is to detect… ▽ More

    Submitted 6 March, 2024; v1 submitted 7 August, 2023; originally announced August 2023.

  20. arXiv:2308.01887  [pdf, other

    cs.CL

    Athena 2.0: Discourse and User Modeling in Open Domain Dialogue

    Authors: Omkar Patil, Lena Reed, Kevin K. Bowden, Juraj Juraska, Wen Cui, Vrindavan Harrison, Rishi Rajasekaran, Angela Ramirez, Cecilia Li, Eduardo Zamora, Phillip Lee, Jeshwanth Bheemanpally, Rohan Pandey, Adwait Ratnaparkhi, Marilyn Walker

    Abstract: Conversational agents are consistently growing in popularity and many people interact with them every day. While many conversational agents act as personal assistants, they can have many different goals. Some are task-oriented, such as providing customer support for a bank or making a reservation. Others are designed to be empathetic and to form emotional connections with the user. The Alexa Prize… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: Alexa Prize Proceedings, 2021. Socialbot Grand Challenge 4

  21. arXiv:2307.09724  [pdf, other

    cs.CV

    AesPA-Net: Aesthetic Pattern-Aware Style Transfer Networks

    Authors: Kibeom Hong, Seogkyu Jeon, Junsoo Lee, Namhyuk Ahn, Kunhee Kim, Pilhyeon Lee, Daesik Kim, Youngjung Uh, Hyeran Byun

    Abstract: To deliver the artistic expression of the target style, recent studies exploit the attention mechanism owing to its ability to map the local patches of the style image to the corresponding patches of the content image. However, because of the low semantic correspondence between arbitrary content and artworks, the attention module repeatedly abuses specific local patches from the style image, resul… ▽ More

    Submitted 8 August, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: Accepted by ICCV 2023. Code is available at this https://github.com/Kibeom-Hong/AesPA-Net

  22. arXiv:2306.09626  [pdf, other

    cs.CV

    PAtt-Lite: Lightweight Patch and Attention MobileNet for Challenging Facial Expression Recognition

    Authors: Jia Le Ngwe, Kian Ming Lim, Chin Poo Lee, Thian Song Ong

    Abstract: Facial Expression Recognition (FER) is a machine learning problem that deals with recognizing human facial expressions. While existing work has achieved performance improvements in recent years, FER in the wild and under challenging conditions remains a challenge. In this paper, a lightweight patch and attention network based on MobileNetV1, referred to as PAtt-Lite, is proposed to improve FER per… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  23. arXiv:2306.00217  [pdf, other

    cs.CL

    FEED PETs: Further Experimentation and Expansion on the Disambiguation of Potentially Euphemistic Terms

    Authors: Patrick Lee, Iyanuoluwa Shode, Alain Chirino Trujillo, Yuan Zhao, Olumide Ebenezer Ojo, Diana Cuevas Plancarte, Anna Feldman, Jing Peng

    Abstract: Transformers have been shown to work well for the task of English euphemism disambiguation, in which a potentially euphemistic term (PET) is classified as euphemistic or non-euphemistic in a particular context. In this study, we expand on the task in two ways. First, we annotate PETs for vagueness, a linguistic property associated with euphemisms, and find that transformers are generally better at… ▽ More

    Submitted 6 June, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

  24. arXiv:2304.13678  [pdf, other

    cs.CV

    A marker-less human motion analysis system for motion-based biomarker discovery in knee disorders

    Authors: Kai Armstrong, Lei Zhang, Yan Wen, Alexander P. Willmott, Paul Lee, Xujioing Ye

    Abstract: In recent years the NHS has been having increased difficulty seeing all low-risk patients, this includes but not limited to suspected osteoarthritis (OA) patients. To help address the increased waiting lists and shortages of staff, we propose a novel method of automated biomarker identification for diagnosis of knee disorders and the monitoring of treatment progression. The proposed method allows… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: 11 pages, 5 figures

  25. arXiv:2303.17285  [pdf, other

    cs.CV

    Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection

    Authors: Pilhyeon Lee, Taeoh Kim, Minho Shim, Dongyoon Wee, Hyeran Byun

    Abstract: Temporal action detection aims to predict the time intervals and the classes of action instances in the video. Despite the promising performance, existing two-stream models exhibit slow inference speed due to their reliance on computationally expensive optical flow. In this paper, we introduce a decomposed cross-modal distillation framework to build a strong RGB-based detector by transferring know… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR 2023

  26. arXiv:2303.12712  [pdf, other

    cs.CL cs.AI

    Sparks of Artificial General Intelligence: Early experiments with GPT-4

    Authors: Sébastien Bubeck, Varun Chandrasekaran, Ronen Eldan, Johannes Gehrke, Eric Horvitz, Ece Kamar, Peter Lee, Yin Tat Lee, Yuanzhi Li, Scott Lundberg, Harsha Nori, Hamid Palangi, Marco Tulio Ribeiro, Yi Zhang

    Abstract: Artificial intelligence (AI) researchers have been developing and refining large language models (LLMs) that exhibit remarkable capabilities across a variety of domains and tasks, challenging our understanding of learning and cognition. The latest model developed by OpenAI, GPT-4, was trained using an unprecedented scale of compute and data. In this paper, we report on our investigation of an earl… ▽ More

    Submitted 13 April, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

  27. arXiv:2301.08448  [pdf, other

    eess.SP cs.AI cs.CV cs.LG

    Source-free Subject Adaptation for EEG-based Visual Recognition

    Authors: Pilhyeon Lee, Seogkyu Jeon, Sunhee Hwang, Minjung Shin, Hyeran Byun

    Abstract: This paper focuses on subject adaptation for EEG-based visual recognition. It aims at building a visual stimuli recognition system customized for the target subject whose EEG samples are limited, by transferring knowledge from abundant data of source subjects. Existing approaches consider the scenario that samples of source subjects are accessible during training. However, it is often infeasible a… ▽ More

    Submitted 20 January, 2023; originally announced January 2023.

    Comments: Accepted by the 11th IEEE International Winter Conference on Brain-Computer Interface (BCI 2023). Code is available at https://github.com/DeepBCI/Deep-BCI

  28. arXiv:2211.13327  [pdf, other

    cs.CL cs.AI

    A Report on the Euphemisms Detection Shared Task

    Authors: Patrick Lee, Anna Feldman, Jing Peng

    Abstract: This paper presents The Shared Task on Euphemism Detection for the Third Workshop on Figurative Language Processing (FigLang 2022) held in conjunction with EMNLP 2022. Participants were invited to investigate the euphemism detection task: given input text, identify whether it contains a euphemism. The input data is a corpus of sentences containing potentially euphemistic terms (PETs) collected fro… ▽ More

    Submitted 3 December, 2022; v1 submitted 23 November, 2022; originally announced November 2022.

  29. arXiv:2210.13576  [pdf, ps, other

    cs.SD eess.AS

    Spectral Clustering-aware Learning of Embeddings for Speaker Diarisation

    Authors: Evonne P. C. Lee, Guangzhi Sun, Chao Zhang, Philip C. Woodland

    Abstract: In speaker diarisation, speaker embedding extraction models often suffer from the mismatch between their training loss functions and the speaker clustering method. In this paper, we propose the method of spectral clustering-aware learning of embeddings (SCALE) to address the mismatch. Specifically, besides an angular prototype cal (AP) loss, SCALE uses a novel affinity matrix loss which directly m… ▽ More

    Submitted 14 March, 2023; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: To appear in ICASSP 2023, 5 pages

  30. Exploiting Shape Cues for Weakly Supervised Semantic Segmentation

    Authors: Sungpil Kho, Pilhyeon Lee, Wonyoung Lee, Minsong Ki, Hyeran Byun

    Abstract: Weakly supervised semantic segmentation (WSSS) aims to produce pixel-wise class predictions with only image-level labels for training. To this end, previous methods adopt the common pipeline: they generate pseudo masks from class activation maps (CAMs) and use such masks to supervise segmentation networks. However, it is challenging to derive comprehensive pseudo masks that cover the whole extent… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

    Comments: Accepted by Pattern Recognition. The first two authors contributed equally

    Journal ref: Pattern Recognition 132 (2022): 108953

  31. Towards Visualization of Time-Series Ecological Momentary Assessment (EMA) Data on Standalone Voice-First Virtual Assistants

    Authors: Yichen Han, Christopher Bo Han, Chen Chen, Peng Wei Lee, Michael Hogarth, Alison A. Moore, Nadir Weibel, Emilia Farcas

    Abstract: Population aging is an increasingly important consideration for health care in the 21th century, and continuing to have access and interact with digital health information is a key challenge for aging populations. Voice-based Intelligent Virtual Assistants (IVAs) are promising to improve the Quality of Life (QoL) of older adults, and coupled with Ecological Momentary Assessments (EMA) they can be… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

    Comments: 4 pages, The 24th International ACM SIGACCESS Conference on Computers and Accessibility

    ACM Class: K.4.2; K.6.m; J.3

  32. Exploiting Domain Transferability for Collaborative Inter-level Domain Adaptive Object Detection

    Authors: Mirae Do, Seogkyu Jeon, Pilhyeon Lee, Kibeom Hong, Yu-seung Ma, Hyeran Byun

    Abstract: Domain adaptation for object detection (DAOD) has recently drawn much attention owing to its capability of detecting target objects without any annotations. To tackle the problem, previous works focus on aligning features extracted from partial levels (e.g., image-level, instance-level, RPN-level) in a two-stage detector via adversarial training. However, individual levels in the object detection… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: Accepted to Expert Systems with Applications. The first three authors contributed equally

    Journal ref: Expert Systems with Applications 205 (2022): 117697

  33. arXiv:2207.05138  [pdf, other

    eess.SY cs.AI eess.SP

    Towards Personalized Healthcare in Cardiac Population: The Development of a Wearable ECG Monitoring System, an ECG Lossy Compression Schema, and a ResNet-Based AF Detector

    Authors: Wei-Ying Yi, Peng-Fei Liu, Sheung-Lai Lo, Ya-Fen Chan, Yu Zhou, Yee Leung, Kam-Sang Woo, Alex Pui-Wai Lee, Jia-Min Chen, Kwong-Sak Leung

    Abstract: Cardiovascular diseases (CVDs) are the number one cause of death worldwide. While there is growing evidence that the atrial fibrillation (AF) has strong associations with various CVDs, this heart arrhythmia is usually diagnosed using electrocardiography (ECG) which is a risk-free, non-intrusive, and cost-efficient tool. Continuously and remotely monitoring the subjects' ECG information unlocks the… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

  34. arXiv:2206.12980  [pdf

    eess.IV cs.CV q-bio.QM

    Detecting Schizophrenia with 3D Structural Brain MRI Using Deep Learning

    Authors: Junhao Zhang, Vishwanatha M. Rao, Ye Tian, Yanting Yang, Nicolas Acosta, Zihan Wan, Pin-Yu Lee, Chloe Zhang, Lawrence S. Kegeles, Scott A. Small, Jia Guo

    Abstract: Schizophrenia is a chronic neuropsychiatric disorder that causes distinct structural alterations within the brain. We hypothesize that deep learning applied to a structural neuroimaging dataset could detect disease-related alteration and improve classification and diagnostic accuracy. We tested this hypothesis using a single, widely available, and conventional T1-weighted MRI scan, from which we e… ▽ More

    Submitted 7 July, 2022; v1 submitted 26 June, 2022; originally announced June 2022.

    Comments: 13 pages, 6 figures

  35. arXiv:2205.14555  [pdf, other

    cs.IT

    Two New Piggybacking Designs with Lower Repair Bandwidth

    Authors: Zhengyi Jiang, Hanxu Hou, Yunghsiang S. Han, Patrick P. C. Lee, Bo Bai, Zhongyi Huang

    Abstract: Piggybacking codes are a special class of MDS array codes that can achieve small repair bandwidth with small sub-packetization by first creating some instances of an $(n,k)$ MDS code, such as a Reed-Solomon (RS) code, and then designing the piggyback function. In this paper, we propose a new piggybacking coding design which designs the piggyback function over some instances of both $(n,k)$ MDS cod… ▽ More

    Submitted 28 May, 2022; originally announced May 2022.

  36. arXiv:2205.11753  [pdf, other

    cs.PF

    Efficient LSM-Tree Key-Value Data Management on Hybrid SSD/HDD Zoned Storage

    Authors: Jinhong Li, Qiuping Wang, Patrick P. C. Lee

    Abstract: Zoned storage devices, such as zoned namespace (ZNS) solid-state drives (SSDs) and host-managed shingled magnetic recording (HM-SMR) hard-disk drives (HDDs), expose interfaces for host-level applications to support fine-grained, high-performance storage management. Combining ZNS SSDs and HM-SMR HDDs into a unified hybrid storage system is a natural direction to scale zoned storage at low cost, yet… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  37. arXiv:2205.10451  [pdf, other

    cs.CL

    Searching for PETs: Using Distributional and Sentiment-Based Methods to Find Potentially Euphemistic Terms

    Authors: Patrick Lee, Martha Gavidia, Anna Feldman, Jing Peng

    Abstract: This paper presents a linguistically driven proof of concept for finding potentially euphemistic terms, or PETs. Acknowledging that PETs tend to be commonly used expressions for a certain range of sensitive topics, we make use of distributional similarities to select and filter phrase candidates from a sentence and rank them using a set of simple sentiment-based metrics. We present the results of… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

    Journal ref: Proceedings of UnImplicit: The Second Workshop on Understanding Implicit and Underspecified Language, NAACL 2022, Seattle

  38. arXiv:2205.02728  [pdf, other

    cs.CL

    CATs are Fuzzy PETs: A Corpus and Analysis of Potentially Euphemistic Terms

    Authors: Martha Gavidia, Patrick Lee, Anna Feldman, Jing Peng

    Abstract: Euphemisms have not received much attention in natural language processing, despite being an important element of polite and figurative language. Euphemisms prove to be a difficult topic, not only because they are subject to language change, but also because humans may not agree on what is a euphemism and what is not. Nevertheless, the first step to tackling the issue is to collect and analyze exa… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

    Comments: Proceedings of LREC 2022

  39. arXiv:2203.16209  [pdf, other

    cs.CV

    Fair Contrastive Learning for Facial Attribute Classification

    Authors: Sungho Park, Jewook Lee, Pilhyeon Lee, Sunhee Hwang, Dohyung Kim, Hyeran Byun

    Abstract: Learning visual representation of high quality is essential for image classification. Recently, a series of contrastive representation learning methods have achieved preeminent success. Particularly, SupCon outperformed the dominant methods based on cross-entropy loss in representation learning. However, we notice that there could be potential ethical risks in supervised contrastive learning. In t… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: CVPR 2022

  40. arXiv:2203.10766  [pdf, other

    cs.DC

    An In-Depth Comparative Analysis of Cloud Block Storage Workloads: Findings and Implications

    Authors: Jinhong Li, Qiuping Wang, Patrick P. C. Lee, Chao Shi

    Abstract: Cloud block storage systems support diverse types of applications in modern cloud services. Characterizing their I/O activities is critical for guiding better system designs and optimizations. In this paper, we present an in-depth comparative analysis of production cloud block storage workloads through the block-level I/O traces of billions of I/O requests collected from two production systems, Al… ▽ More

    Submitted 19 November, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

    Comments: 30 pages. Accepted by ACM Transactions on Storage

  41. The Unboxing Experience: Exploration and Design of Initial Interactions Between Children and Social Robots

    Authors: Christine P Lee, Bengisu Cagiltay, Bilge Mutlu

    Abstract: Social robots are increasingly introduced into children's lives as educational and social companions, yet little is known about how these products might best be introduced to their environments. The emergence of the "unboxing" phenomenon in media suggests that introduction is key to technology adoption where initial impressions are made. To better understand this phenomenon toward designing a posi… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

    Comments: To be published in 2022 CHI Conference on Human Factors in Computing Systems (CHI '22)

    Journal ref: CHI Conference on Human Factors in Computing Systems (CHI '22), April 29-May 5, 2022, New Orleans, LA, USA

  42. arXiv:2202.02901  [pdf, other

    eess.SP cs.AI cs.CV

    Inter-subject Contrastive Learning for Subject Adaptive EEG-based Visual Recognition

    Authors: Pilhyeon Lee, Sunhee Hwang, Jewook Lee, Minjung Shin, Seogkyu Jeon, Hyeran Byun

    Abstract: This paper tackles the problem of subject adaptive EEG-based visual recognition. Its goal is to accurately predict the categories of visual stimuli based on EEG signals with only a handful of samples for the target subject during training. The key challenge is how to appropriately transfer the knowledge obtained from abundant data of source subjects to the subject of interest. To this end, we intr… ▽ More

    Submitted 6 February, 2022; originally announced February 2022.

    Comments: Accepted by the 10th IEEE International Winter Conference on Brain-Computer Interface (BCI 2022). Code is available at https://github.com/DeepBCI/Deep-BCI

  43. Improving Across-Dataset Brain Tissue Segmentation Using Transformer

    Authors: Vishwanatha M. Rao, Zihan Wan, Soroush Arabshahi, David J. Ma, Pin-Yu Lee, Ye Tian, Xuzhe Zhang, Andrew F. Laine, Jia Guo

    Abstract: Brain tissue segmentation has demonstrated great utility in quantifying MRI data through Voxel-Based Morphometry and highlighting subtle structural changes associated with various conditions within the brain. However, manual segmentation is highly labor-intensive, and automated approaches have struggled due to properties inherent to MRI acquisition, leaving a great need for an effective segmentati… ▽ More

    Submitted 31 January, 2023; v1 submitted 21 January, 2022; originally announced January 2022.

    ACM Class: I.4.6

  44. arXiv:2112.09771  [pdf, ps, other

    cs.CR cs.IT

    Privacy Leakage over Dependent Attributes in One-Sided Differential Privacy

    Authors: Phillip Lee, Kevin Smith

    Abstract: Providing a provable privacy guarantees while maintaining the utility of data is a challenging task in many real-world applications. Recently, a new framework called One-Sided Differential Privacy (OSDP) was introduced that extends existing differential privacy approaches. OSDP increases the utility of the data by taking advantage of the fact that not all records are sensitive. However, the previo… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

  45. arXiv:2111.02519  [pdf, other

    cs.CL

    Athena 2.0: Contextualized Dialogue Management for an Alexa Prize SocialBot

    Authors: Juraj Juraska, Kevin K. Bowden, Lena Reed, Vrindavan Harrison, Wen Cui, Omkar Patil, Rishi Rajasekaran, Angela Ramirez, Cecilia Li, Eduardo Zamora, Phillip Lee, Jeshwanth Bheemanpally, Rohan Pandey, Adwait Ratnaparkhi, Marilyn Walker

    Abstract: Athena 2.0 is an Alexa Prize SocialBot that has been a finalist in the last two Alexa Prize Grand Challenges. One reason for Athena's success is its novel dialogue management strategy, which allows it to dynamically construct dialogues and responses from component modules, leading to novel conversations with every interaction. Here we describe Athena's system design and performance in the Alexa Pr… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: Accepted to EMNLP 2021 System Demonstrations

  46. arXiv:2110.13470  [pdf, other

    cs.CV cs.AI

    Subject Adaptive EEG-based Visual Recognition

    Authors: Pilhyeon Lee, Sunhee Hwang, Seogkyu Jeon, Hyeran Byun

    Abstract: This paper focuses on EEG-based visual recognition, aiming to predict the visual object class observed by a subject based on his/her EEG signals. One of the main challenges is the large variation between signals from different subjects. It limits recognition systems to work only for the subjects involved in model training, which is undesirable for real-world scenarios where new subjects are freque… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: Accepted by ACPR 2021. Code is available at https://github.com/DeepBCI/Deep-BCI

  47. arXiv:2110.04785  [pdf, ps, other

    cs.IT

    A Generalization of Array Codes with Local Properties and Efficient Encoding/Decoding

    Authors: Hanxu Hou, Yunghsiang S. Han, Patrick P. C. Lee, You Wu, Guojun Han, Mario Blaum

    Abstract: A maximum distance separable (MDS) array code is composed of $m\times (k+r)$ arrays such that any $k$ out of $k+r$ columns suffice to retrieve all the information symbols. Expanded-Blaum-Roth (EBR) codes and Expanded-Independent-Parity (EIP) codes are two classes of MDS array codes that can repair any one symbol in a column by locally accessing some other symbols within the column, where the numbe… ▽ More

    Submitted 12 September, 2022; v1 submitted 10 October, 2021; originally announced October 2021.

  48. arXiv:2108.11173  [pdf, other

    cs.NE

    Incorporating Surprisingly Popular Algorithm and Euclidean Distance-based Adaptive Topology into PSO

    Authors: Xuan Wu, Jizong Han, Di Wang, Pengyue Gao, Quanlong Cui, Liang Chen, Yanchun Liang, Han Huang, Heow Pueh Lee, Chunyan Miao, You Zhou, Chunguo Wu

    Abstract: While many Particle Swarm Optimization (PSO) algorithms only use fitness to assess the performance of particles, in this work, we adopt Surprisingly Popular Algorithm (SPA) as a complementary metric in addition to fitness. Consequently, particles that are not widely known also have the opportunity to be selected as the learning exemplars. In addition, we propose a Euclidean distance-based adaptive… ▽ More

    Submitted 12 September, 2023; v1 submitted 25 August, 2021; originally announced August 2021.

  49. arXiv:2108.08596  [pdf, other

    cs.CV

    Feature Stylization and Domain-aware Contrastive Learning for Domain Generalization

    Authors: Seogkyu Jeon, Kibeom Hong, Pilhyeon Lee, Jewook Lee, Hyeran Byun

    Abstract: Domain generalization aims to enhance the model robustness against domain shift without accessing the target domain. Since the available source domains for training are limited, recent approaches focus on generating samples of novel domains. Nevertheless, they either struggle with the optimization problem when synthesizing abundant domains or cause the distortion of class semantics. To these ends,… ▽ More

    Submitted 19 August, 2021; originally announced August 2021.

    Comments: Accepted to ACM MM 2021 (oral)

  50. arXiv:2108.05029  [pdf, other

    cs.CV

    Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization

    Authors: Pilhyeon Lee, Hyeran Byun

    Abstract: We tackle the problem of localizing temporal intervals of actions with only a single frame label for each action instance for training. Owing to label sparsity, existing work fails to learn action completeness, resulting in fragmentary action predictions. In this paper, we propose a novel framework, where dense pseudo-labels are generated to provide completeness guidance for the model. Concretely,… ▽ More

    Submitted 11 August, 2021; originally announced August 2021.

    Comments: Accepted by ICCV 2021 (Oral). Code is available at https://github.com/Pilhyeon