Skip to main content

Showing 1–50 of 115 results for author: Qiu, M

  1. arXiv:2407.13863  [pdf, other

    cs.CV

    A Closer Look at GAN Priors: Exploiting Intermediate Features for Enhanced Model Inversion Attacks

    Authors: Yixiang Qiu, Hao Fang, Hongyao Yu, Bin Chen, MeiKang Qiu, Shu-Tao Xia

    Abstract: Model Inversion (MI) attacks aim to reconstruct privacy-sensitive training data from released models by utilizing output information, raising extensive concerns about the security of Deep Neural Networks (DNNs). Recent advances in generative adversarial networks (GANs) have contributed significantly to the improved performance of MI attacks due to their powerful ability to generate realistic image… ▽ More

    Submitted 27 July, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  2. arXiv:2407.11356  [pdf, other

    cs.CV

    The Devil is in the Statistics: Mitigating and Exploiting Statistics Difference for Generalizable Semi-supervised Medical Image Segmentation

    Authors: Muyang Qiu, Jian Zhang, Lei Qi, Qian Yu, Yinghuan Shi, Yang Gao

    Abstract: Despite the recent success of domain generalization in medical image segmentation, voxel-wise annotation for all source domains remains a huge burden. Semi-supervised domain generalization has been proposed very recently to combat this challenge by leveraging limited labeled data along with abundant unlabeled data collected from multiple medical institutions, depending on precisely harnessing unla… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  3. arXiv:2407.10563  [pdf, other

    cs.CV

    Pathformer3D: A 3D Scanpath Transformer for 360° Images

    Authors: Rong Quan, Yantao Lai, Mengyu Qiu, Dong Liang

    Abstract: Scanpath prediction in 360° images can help realize rapid rendering and better user interaction in Virtual/Augmented Reality applications. However, existing scanpath prediction models for 360° images execute scanpath prediction on 2D equirectangular projection plane, which always result in big computation error owing to the 2D plane's distortion and coordinate discontinuity. In this work, we perfo… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  4. arXiv:2407.09966  [pdf, other

    cs.CV eess.IV

    Optimizing ROI Benefits Vehicle ReID in ITS

    Authors: Mei Qiu, Lauren Ann Christopher, Lingxi Li, Stanley Chien, Yaobin Chen

    Abstract: Vehicle re-identification (ReID) is a computer vision task that matches the same vehicle across different cameras or viewpoints in a surveillance system. This is crucial for Intelligent Transportation Systems (ITS), where the effectiveness is influenced by the regions from which vehicle images are cropped. This study explores whether optimal vehicle detection regions, guided by detection confidenc… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  5. arXiv:2407.07842  [pdf, other

    cs.CV

    Study on Aspect Ratio Variability toward Robustness of Vision Transformer-based Vehicle Re-identification

    Authors: Mei Qiu, Lauren Christopher, Lingxi Li

    Abstract: Vision Transformers (ViTs) have excelled in vehicle re-identification (ReID) tasks. However, non-square aspect ratios of image or video input might significantly affect the re-identification performance. To address this issue, we propose a novel ViT-based ReID framework in this paper, which fuses models trained on a variety of aspect ratios. Our main contributions are threefold: (i) We analyze asp… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  6. arXiv:2407.05610  [pdf, other

    cs.CV

    Described Spatial-Temporal Video Detection

    Authors: Wei Ji, Xiangyan Liu, Yingfei Sun, Jiajun Deng, You Qin, Ammar Nuwanna, Mengyao Qiu, Lina Wei, Roger Zimmermann

    Abstract: Detecting visual content on language expression has become an emerging topic in the community. However, in the video domain, the existing setting, i.e., spatial-temporal video grounding (STVG), is formulated to only detect one pre-existing object in each frame, ignoring the fact that language descriptions can involve none or multiple entities within a video. In this work, we advance the STVG to a… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  7. arXiv:2407.04688  [pdf, other

    cs.CV

    Enhancing Vehicle Re-identification and Matching for Weaving Analysis

    Authors: Mei Qiu, Wei Lin, Stanley Chien, Lauren Christopher, Yaobin Chen, Shu Hu

    Abstract: Vehicle weaving on highways contributes to traffic congestion, raises safety issues, and underscores the need for sophisticated traffic management systems. Current tools are inadequate in offering precise and comprehensive data on lane-specific weaving patterns. This paper introduces an innovative method for collecting non-overlapping video data in weaving zones, enabling the generation of quantit… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  8. Self-consistent Deep Geometric Learning for Heterogeneous Multi-source Spatial Point Data Prediction

    Authors: Dazhou Yu, Xiaoyun Gong, Yun Li, Meikang Qiu, Liang Zhao

    Abstract: Multi-source spatial point data prediction is crucial in fields like environmental monitoring and natural resource management, where integrating data from various sensors is the key to achieving a holistic environmental understanding. Existing models in this area often fall short due to their domain-specific nature and lack a strategy for integrating information from various sources in the absence… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  9. arXiv:2406.09317  [pdf, other

    eess.IV cs.CV

    Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases

    Authors: Meng Wang, Tian Lin, Aidi Lin, Kai Yu, Yuanyuan Peng, Lianyu Wang, Cheng Chen, Ke Zou, Huiyu Liang, Man Chen, Xue Yao, Meiqin Zhang, Binwei Huang, Chaoxin Zheng, Peixin Zhang, Wei Chen, Yilong Luo, Yifan Chen, Honghe Xia, Tingkun Shi, Qi Zhang, Jinming Guo, Xiaolin Chen, Jingcheng Wang, Yih Chung Tham , et al. (24 additional authors not shown)

    Abstract: Previous foundation models for retinal images were pre-trained with limited disease categories and knowledge base. Here we introduce RetiZero, a vision-language foundation model that leverages knowledge from over 400 fundus diseases. To RetiZero's pre-training, we compiled 341,896 fundus images paired with text descriptions, sourced from public datasets, ophthalmic literature, and online resources… ▽ More

    Submitted 30 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  10. arXiv:2406.05704  [pdf, other

    cs.CV

    Hierarchical Features Matter: A Deep Exploration of GAN Priors for Improved Dataset Distillation

    Authors: Xinhao Zhong, Hao Fang, Bin Chen, Xulin Gu, Tao Dai, Meikang Qiu, Shu-Tao Xia

    Abstract: Dataset distillation is an emerging dataset reduction method, which condenses large-scale datasets while maintaining task accuracy. Current methods have integrated parameterization techniques to boost synthetic dataset performance by shifting the optimization space from pixel to another informative feature domain. However, they limit themselves to a fixed optimization space for distillation, negle… ▽ More

    Submitted 12 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

  11. arXiv:2405.16919  [pdf, other

    cs.CV cs.AI cs.CL

    VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models

    Authors: Zejun Li, Ruipu Luo, Jiwen Zhang, Minghui Qiu, Zhongyu Wei

    Abstract: While large multi-modal models (LMMs) have exhibited impressive capabilities across diverse tasks, their effectiveness in handling complex tasks has been limited by the prevailing single-step reasoning paradigm. To this end, this paper proposes VoCoT, a multi-step Visually grounded object-centric Chain-of-Thought reasoning framework tailored for inference with LMMs. VoCoT is characterized by two k… ▽ More

    Submitted 28 May, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  12. arXiv:2405.07547  [pdf, other

    cs.IT eess.SP

    Channel Coding Toward 6G: Technical Overview and Outlook

    Authors: Mohammad Rowshan, Min Qiu, Yixuan Xie, Xinyi Gu, Jinhong Yuan

    Abstract: Channel coding plays a pivotal role in ensuring reliable communication over wireless channels. With the growing need for ultra-reliable communication in emerging wireless use cases, the significance of channel coding has amplified. Furthermore, minimizing decoding latency is crucial for critical-mission applications, while optimizing energy efficiency is paramount for mobile and the Internet of Th… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 102 pages, 87 figures, IEEE Open Journal of the Communications Society (invited paper)

  13. arXiv:2404.15212  [pdf, other

    cs.CV eess.IV

    Real-time Lane-wise Traffic Monitoring in Optimal ROIs

    Authors: Mei Qiu, Wei Lin, Lauren Ann Christopher, Stanley Chien, Yaobin Chen, Shu Hu

    Abstract: In the US, thousands of Pan, Tilt, and Zoom (PTZ) traffic cameras monitor highway conditions. There is a great interest in using these highway cameras to gather valuable road traffic data to support traffic analysis and decision-making for highway safety and efficient traffic management. However, there are too many cameras for a few human traffic operators to effectively monitor, so a fully automa… ▽ More

    Submitted 28 March, 2024; originally announced April 2024.

  14. arXiv:2404.04861  [pdf, other

    cs.CR

    Privacy-Preserving Traceable Functional Encryption for Inner Product

    Authors: Muyao Qiu, Jinguang Han

    Abstract: Functional encryption introduces a new paradigm of public key encryption that decryption only reveals the function value of encrypted data. To curb key leakage issues and trace users in FE-IP, a new primitive called traceable functional encryption for inner product (TFE-IP) has been proposed. However, the privacy protection of user's identities has not been considered in the existing TFE-IP scheme… ▽ More

    Submitted 14 April, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

  15. arXiv:2403.14922  [pdf, other

    cs.LG cs.NI

    CODA: A COst-efficient Test-time Domain Adaptation Mechanism for HAR

    Authors: Minghui Qiu, Yandao Huang, Lin Chen, Lu Wang, Kaishun Wu

    Abstract: In recent years, emerging research on mobile sensing has led to novel scenarios that enhance daily life for humans, but dynamic usage conditions often result in performance degradation when systems are deployed in real-world settings. Existing solutions typically employ one-off adaptation schemes based on neural networks, which struggle to ensure robustness against uncertain drifting conditions in… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  16. arXiv:2402.17613  [pdf, other

    cs.CL

    Neural Automated Writing Evaluation with Corrective Feedback

    Authors: Izia Xiaoxiao Wang, Xihan Wu, Edith Coates, Min Zeng, Jiexin Kuang, Siliang Liu, Mengyang Qiu, Jungyeul Park

    Abstract: The utilization of technology in second language learning and teaching has become ubiquitous. For the assessment of writing specifically, automated writing evaluation (AWE) and grammatical error correction (GEC) have become immensely popular and effective methods for enhancing writing proficiency and delivering instant and individualized feedback to learners. By leveraging the power of natural lan… ▽ More

    Submitted 6 May, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Supported by the SoTL Seed Program at UBC

  17. arXiv:2402.15931  [pdf, ps, other

    cs.CL

    Frustratingly Simple Prompting-based Text Denoising

    Authors: Jungyeul Park, Mengyang Qiu

    Abstract: This paper introduces a novel perspective on the automated essay scoring (AES) task, challenging the conventional view of the ASAP dataset as a static entity. Employing simple text denoising techniques using prompting, we explore the dynamic potential within the dataset. While acknowledging the previous emphasis on building regression systems, our paper underscores how making minor changes to a da… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

    Comments: Published as a Tiny Paper at ICLR 2024

  18. arXiv:2402.15930  [pdf, ps, other

    cs.CL

    Evaluating Prompting Strategies for Grammatical Error Correction Based on Language Proficiency

    Authors: Min Zeng, Jiexin Kuang, Mengyang Qiu, Jayoung Song, Jungyeul Park

    Abstract: The writing examples of English language learners may be different from those of native speakers. Given that there is a significant differences in second language (L2) learners' error types by their proficiency levels, this paper attempts to reduce overcorrection by examining the interaction between LLM's performance and L2 language proficiency. Our method focuses on zero-shot and few-shot prompti… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

    Comments: To appear in LREC-COLING 2024, short paper (preprint)

  19. arXiv:2402.15521  [pdf, other

    cs.AI cs.LG

    HKD-SHO: A hybrid smart home system based on knowledge-based and data-driven services

    Authors: Mingming Qiu, Elie Najm, Rémi Sharrock, Bruno Traverson

    Abstract: A smart home is realized by setting up various services. Several methods have been proposed to create smart home services, which can be divided into knowledge-based and data-driven approaches. However, knowledge-based approaches usually require manual input from the inhabitant, which can be complicated if the physical phenomena of the concerned environment states are complex, and the inhabitant do… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: keywords: Hybrid System, Knowledge Representation, Reinforcement Learning, Services, Smart Home

  20. arXiv:2401.11058  [pdf, ps, other

    cs.IT eess.SP

    Low Complexity Turbo SIC-MMSE Detection for Orthogonal Time Frequency Space Modulation

    Authors: Qi Li, Jinhong Yuan, Min Qiu, Shuangyang Li, Yixuan Xie

    Abstract: Recently, orthogonal time frequency space (OTFS) modulation has garnered considerable attention due to its robustness against doubly-selective wireless channels. In this paper, we propose a low-complexity iterative successive interference cancellation based minimum mean squared error (SIC-MMSE) detection algorithm for zero-padded OTFS (ZP-OTFS) modulation. In the proposed algorithm, signals are de… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: 15 pages, 12 figures, accepted by IEEE Transactions on Communications

  21. arXiv:2401.01433  [pdf, other

    cs.IT eess.SP

    Multiple Access Techniques for Intelligent and Multi-Functional 6G: Tutorial, Survey, and Outlook

    Authors: Bruno Clerckx, Yijie Mao, Zhaohui Yang, Mingzhe Chen, Ahmed Alkhateeb, Liang Liu, Min Qiu, Jinhong Yuan, Vincent W. S. Wong, Juan Montojo

    Abstract: Multiple access (MA) is a crucial part of any wireless system and refers to techniques that make use of the resource dimensions to serve multiple users/devices/machines/services, ideally in the most efficient way. Given the needs of multi-functional wireless networks for integrated communications, sensing, localization, computing, coupled with the surge of machine learning / artificial intelligenc… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: submitted for publication in Proceedings of the IEEE

  22. arXiv:2312.17493  [pdf, other

    cs.LG cs.CR

    Differentially Private Low-Rank Adaptation of Large Language Model Using Federated Learning

    Authors: Xiao-Yang Liu, Rongyi Zhu, Daochen Zha, Jiechao Gao, Shan Zhong, Matt White, Meikang Qiu

    Abstract: The surge in interest and application of large language models (LLMs) has sparked a drive to fine-tune these models to suit specific applications, such as finance and medical science. However, concerns regarding data privacy have emerged, especially when multiple stakeholders aim to collaboratively enhance LLMs using sensitive data. In this scenario, federated learning becomes a natural choice, al… ▽ More

    Submitted 2 June, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

    Comments: 21 pages, 1 figure, 19 tables

  23. arXiv:2311.06761  [pdf, other

    cs.CL

    Learning Knowledge-Enhanced Contextual Language Representations for Domain Natural Language Understanding

    Authors: Ruyao Xu, Taolin Zhang, Chengyu Wang, Zhongjie Duan, Cen Chen, Minghui Qiu, Dawei Cheng, Xiaofeng He, Weining Qian

    Abstract: Knowledge-Enhanced Pre-trained Language Models (KEPLMs) improve the performance of various downstream NLP tasks by injecting knowledge facts from large-scale Knowledge Graphs (KGs). However, existing methods for pre-training KEPLMs with relational triples are difficult to be adapted to close domains due to the lack of sufficient domain graph semantics. In this paper, we propose a Knowledge-enhance… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: emnlp 2023

  24. arXiv:2310.08420  [pdf, other

    cs.CV

    Visual Attention Prompted Prediction and Learning

    Authors: Yifei Zhang, Siyi Gu, Bo Pan, Guangji Bai, Meikang Qiu, Xiaofeng Yang, Liang Zhao

    Abstract: Visual explanation (attention)-guided learning uses not only labels but also explanations to guide model reasoning process. While visual attention-guided learning has shown promising results, it requires a large number of explanation annotations that are time-consuming to prepare. However, in many real-world situations, it is usually desired to prompt the model with visual attention without model… ▽ More

    Submitted 23 April, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

  25. arXiv:2308.15840  [pdf, other

    cs.LG cs.AI physics.soc-ph q-bio.PE

    MSGNN: Multi-scale Spatio-temporal Graph Neural Network for Epidemic Forecasting

    Authors: Mingjie Qiu, Zhiyi Tan, Bing-kun Bao

    Abstract: Infectious disease forecasting has been a key focus and proved to be crucial in controlling epidemic. A recent trend is to develop forecast-ing models based on graph neural networks (GNNs). However, existing GNN-based methods suffer from two key limitations: (1) Current models broaden receptive fields by scaling the depth of GNNs, which is insuffi-cient to preserve the semantics of long-range conn… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: 29 pages

    Report number: DAMI-D-23-00319R2

    Journal ref: Data Min Knowl Disc (2024)

  26. arXiv:2308.13229  [pdf, other

    cs.CV

    ReST: A Reconfigurable Spatial-Temporal Graph Model for Multi-Camera Multi-Object Tracking

    Authors: Cheng-Che Cheng, Min-Xuan Qiu, Chen-Kuo Chiang, Shang-Hong Lai

    Abstract: Multi-Camera Multi-Object Tracking (MC-MOT) utilizes information from multiple views to better handle problems with occlusion and crowded scenes. Recently, the use of graph-based approaches to solve tracking problems has become very popular. However, many current graph-based methods do not effectively utilize information regarding spatial and temporal consistency. Instead, they rely on single-came… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV2023

  27. arXiv:2308.09012  [pdf, other

    cs.CV

    FashionLOGO: Prompting Multimodal Large Language Models for Fashion Logo Embeddings

    Authors: Yulin Su, Min Yang, Minghui Qiu, Jing Wang, Tao Wang

    Abstract: Logo embedding plays a crucial role in various e-commerce applications by facilitating image retrieval or recognition, such as intellectual property protection and product search. However, current methods treat logo embedding as a purely visual problem, which may limit their performance in real-world scenarios. A notable issue is that the textual knowledge embedded in logo images has not been adeq… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

  28. arXiv:2308.08883  [pdf, other

    cs.IT eess.SP

    Coexistence of Heterogeneous Services in the Uplink with Discrete Signaling and Treating Interference as Noise

    Authors: Min Qiu, Yu-Chih Huang, Jinhong Yuan

    Abstract: The problem of enabling the coexistence of heterogeneous services, e.g., different ultra-reliable low-latency communications (URLLC) services and/or enhanced mobile broadband (eMBB) services, in the uplink is studied. Each service has its own error probability and blocklength constraints and the longer transmission block suffers from heterogeneous interference. Due to the latency concern, the deco… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: 7 pages, accepted for presentation at IEEE Global Communications Conference (GLOBECOM) 2023

  29. arXiv:2308.04278  [pdf, other

    eess.SP cs.IT

    Achieving Covert Communication With A Probabilistic Jamming Strategy

    Authors: Xun Chen, Fujun Gao, Min Qiu, Jia Zhang, Feng Shu, Shihao Yan

    Abstract: In this work, we consider a covert communication scenario, where a transmitter Alice communicates to a receiver Bob with the aid of a probabilistic and uninformed jammer against an adversary warden's detection. The transmission status and power of the jammer are random and follow some priori probabilities. We first analyze the warden's detection performance as a function of the jammer's transmissi… ▽ More

    Submitted 29 August, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

  30. arXiv:2308.02457  [pdf, other

    cs.AI

    A Survey on Temporal Knowledge Graph Completion: Taxonomy, Progress, and Prospects

    Authors: Jiapu Wang, Boyue Wang, Meikang Qiu, Shirui Pan, Bo Xiong, Heng Liu, Linhao Luo, Tengfei Liu, Yongli Hu, Baocai Yin, Wen Gao

    Abstract: Temporal characteristics are prominently evident in a substantial volume of knowledge, which underscores the pivotal role of Temporal Knowledge Graphs (TKGs) in both academia and industry. However, TKGs often suffer from incompleteness for three main reasons: the continuous emergence of new knowledge, the weakness of the algorithm for extracting structured information from unstructured data, and t… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  31. arXiv:2307.04525  [pdf, other

    eess.IV cs.CV cs.LG

    Cluster-Induced Mask Transformers for Effective Opportunistic Gastric Cancer Screening on Non-contrast CT Scans

    Authors: Mingze Yuan, Yingda Xia, Xin Chen, Jiawen Yao, Junli Wang, Mingyan Qiu, Hexin Dong, Jingren Zhou, Bin Dong, Le Lu, Li Zhang, Zaiyi Liu, Ling Zhang

    Abstract: Gastric cancer is the third leading cause of cancer-related mortality worldwide, but no guideline-recommended screening test exists. Existing methods can be invasive, expensive, and lack sensitivity to identify early-stage gastric cancer. In this study, we explore the feasibility of using a deep learning approach on non-contrast CT scans for gastric cancer detection. We propose a novel cluster-ind… ▽ More

    Submitted 15 July, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

    Comments: MICCAI 2023

  32. arXiv:2306.17451  [pdf, ps, other

    cs.IT eess.SP

    Self-Connected Spatially Coupled LDPC Codes with Improved Termination

    Authors: Yihuan Liao, Min Qiu, Jinhong Yuan

    Abstract: This paper investigates the design of self-connected spatially coupled low-density parity-check (SC-LDPC) codes. First, a termination method is proposed to reduce rate loss. Particularly, a single-side open SC-LDPC ensemble is introduced, which halves the rate loss of a conventional terminated SC-LDPC by reducing the number of check nodes. We further propose a self-connection method that allows re… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: 6 pages, 8 figures, accepted for publication in IEEE Communications Letters

  33. arXiv:2306.07207  [pdf, other

    cs.CV cs.AI cs.CL

    Valley: Video Assistant with Large Language model Enhanced abilitY

    Authors: Ruipu Luo, Ziwang Zhao, Min Yang, Junwei Dong, Da Li, Pengcheng Lu, Tao Wang, Linmei Hu, Minghui Qiu, Zhongyu Wei

    Abstract: Large language models (LLMs), with their remarkable conversational capabilities, have demonstrated impressive performance across various applications and have emerged as formidable AI assistants. In view of this, it raises an intuitive question: Can we harness the power of LLMs to build multimodal AI assistants for visual applications? Recently, several multi-modal models have been developed for t… ▽ More

    Submitted 8 October, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

  34. arXiv:2305.02200  [pdf, other

    cs.SI cs.LG

    Deep Graph Representation Learning and Optimization for Influence Maximization

    Authors: Chen Ling, Junji Jiang, Junxiang Wang, My Thai, Lukas Xue, James Song, Meikang Qiu, Liang Zhao

    Abstract: Influence maximization (IM) is formulated as selecting a set of initial users from a social network to maximize the expected number of influenced users. Researchers have made great progress in designing various traditional methods, and their theoretical design and performance gain are close to a limit. In the past few years, learning-based IM methods have emerged to achieve stronger generalization… ▽ More

    Submitted 6 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: In Proceedings of the 40th International Conference on Machine Learning (ICML 2023), Honolulu, Hawaii, USA. PMLR 202, 2023

  35. arXiv:2304.00212  [pdf, other

    cs.CV cs.LG

    Devil is in the Queries: Advancing Mask Transformers for Real-world Medical Image Segmentation and Out-of-Distribution Localization

    Authors: Mingze Yuan, Yingda Xia, Hexin Dong, Zifan Chen, Jiawen Yao, Mingyan Qiu, Ke Yan, Xiaoli Yin, Yu Shi, Xin Chen, Zaiyi Liu, Bin Dong, Jingren Zhou, Le Lu, Ling Zhang, Li Zhang

    Abstract: Real-world medical image segmentation has tremendous long-tailed complexity of objects, among which tail conditions correlate with relatively rare diseases and are clinically significant. A trustworthy medical AI algorithm should demonstrate its effectiveness on tail conditions to avoid clinically dangerous damage in these out-of-distribution (OOD) cases. In this paper, we adopt the concept of obj… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

    Comments: CVPR 2023 Highlight

  36. arXiv:2302.08018  [pdf, other

    cs.SE cs.AI

    Towards Fair Machine Learning Software: Understanding and Addressing Model Bias Through Counterfactual Thinking

    Authors: Zichong Wang, Yang Zhou, Meikang Qiu, Israat Haque, Laura Brown, Yi He, Jianwu Wang, David Lo, Wenbin Zhang

    Abstract: The increasing use of Machine Learning (ML) software can lead to unfair and unethical decisions, thus fairness bugs in software are becoming a growing concern. Addressing these fairness bugs often involves sacrificing ML performance, such as accuracy. To address this issue, we present a novel counterfactual approach that uses counterfactual thinking to tackle the root causes of bias in ML software… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

  37. arXiv:2302.03507  [pdf, other

    cs.CL cs.AI

    Meta-Learning Siamese Network for Few-Shot Text Classification

    Authors: Chengcheng Han, Yuhe Wang, Yingnan Fu, Xiang Li, Minghui Qiu, Ming Gao, Aoying Zhou

    Abstract: Few-shot learning has been used to tackle the problem of label scarcity in text classification, of which meta-learning based methods have shown to be effective, such as the prototypical networks (PROTO). Despite the success of PROTO, there still exist three main problems: (1) ignore the randomness of the sampled support sets when computing prototype vectors; (2) disregard the importance of labeled… ▽ More

    Submitted 16 March, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

  38. arXiv:2301.12291  [pdf, other

    eess.IV cs.CV

    CancerUniT: Towards a Single Unified Model for Effective Detection, Segmentation, and Diagnosis of Eight Major Cancers Using a Large Collection of CT Scans

    Authors: Jieneng Chen, Yingda Xia, Jiawen Yao, Ke Yan, Jianpeng Zhang, Le Lu, Fakai Wang, Bo Zhou, Mingyan Qiu, Qihang Yu, Mingze Yuan, Wei Fang, Yuxing Tang, Minfeng Xu, Jian Zhou, Yuqian Zhao, Qifeng Wang, Xianghua Ye, Xiaoli Yin, Yu Shi, Xin Chen, Jingren Zhou, Alan Yuille, Zaiyi Liu, Ling Zhang

    Abstract: Human readers or radiologists routinely perform full-body multi-organ multi-disease detection and diagnosis in clinical practice, while most medical AI systems are built to focus on single organs with a narrow list of a few diseases. This might severely limit AI's clinical adoption. A certain number of AI models need to be assembled non-trivially to match the diagnostic process of a human reading… ▽ More

    Submitted 6 October, 2023; v1 submitted 28 January, 2023; originally announced January 2023.

    Comments: ICCV 2023 Camera Ready Version

  39. arXiv:2301.09303  [pdf, other

    cs.IT eess.SP

    Downlink Transmission under Heterogeneous Blocklength Constraints: Discrete Signaling with Single-User Decoding

    Authors: Min Qiu, Yu-Chih Huang, Jinhong Yuan

    Abstract: In this paper, we consider the downlink broadcast channel under heterogenous blocklength constraints, where each user experiences different interference statistics across its received symbols. Different from the homogeneous blocklength case, the strong users with short blocklength transmitted symbol blocks usually cannot wait to receive the entire transmission frame and perform successive interfer… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

    Comments: 7 pages, 1 figure, accepted for presentation at IEEE ICC 2023. arXiv admin note: substantial text overlap with arXiv:2212.01736

  40. arXiv:2212.10013  [pdf, other

    cs.AI cs.CL

    DocAsRef: An Empirical Study on Repurposing Reference-Based Summary Quality Metrics Reference-Freely

    Authors: Forrest Sheng Bao, Ruixuan Tu, Ge Luo, Yinfei Yang, Hebi Li, Minghui Qiu, Youbiao He, Cen Chen

    Abstract: Automated summary quality assessment falls into two categories: reference-based and reference-free. Reference-based metrics, historically deemed more accurate due to the additional information provided by human-written references, are limited by their reliance on human input. In this paper, we hypothesize that the comparison methodologies used by some reference-based metrics to evaluate a system s… ▽ More

    Submitted 26 November, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Accepted into Findings of EMNLP 2023

  41. arXiv:2212.01736  [pdf, other

    cs.IT eess.SP

    Downlink Transmission with Heterogeneous URLLC Services: Discrete Signaling With Single-User Decoding

    Authors: Min Qiu, Yu-Chih Huang, Jinhong Yuan

    Abstract: The problem of designing downlink transmission schemes for supporting heterogeneous ultra-reliable low-latency communications (URLLC) and/or with other types of services is investigated. We consider the broadcast channel, where the base station sends superimposed signals to multiple users. Under heterogeneous blocklength constraints, strong users who are URLLC users cannot wait to receive the enti… ▽ More

    Submitted 2 May, 2023; v1 submitted 3 December, 2022; originally announced December 2022.

    Comments: 16 pages, 7 figures, accepted by IEEE Journal on Selected Areas in Communications

  42. arXiv:2211.05100  [pdf, other

    cs.CL

    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

    Authors: BigScience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major , et al. (369 additional authors not shown)

    Abstract: Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access… ▽ More

    Submitted 27 June, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  43. arXiv:2210.11674  [pdf, other

    cs.HC

    WristSketcher: Creating Dynamic Sketches in AR with a Sensing Wristband

    Authors: Enting Ying, Tianyang Xiong, Shihui Guo, Ming Qiu, Yipeng Qin, Hongbo Fu

    Abstract: Restricted by the limited interaction area of native AR glasses (e.g., touch bars), it is challenging to create sketches in AR glasses. Recent works have attempted to use mobile devices (e.g., tablets) or mid-air bare-hand gestures to expand the interactive spaces and can work as the 2D/3D sketching input interfaces for AR glasses. Between them, mobile devices allow for accurate sketching but are… ▽ More

    Submitted 26 October, 2022; v1 submitted 20 October, 2022; originally announced October 2022.

  44. arXiv:2210.09049  [pdf, other

    cs.CL

    SpanProto: A Two-stage Span-based Prototypical Network for Few-shot Named Entity Recognition

    Authors: Jianing Wang, Chengcheng Han, Chengyu Wang, Chuanqi Tan, Minghui Qiu, Songfang Huang, Jun Huang, Ming Gao

    Abstract: Few-shot Named Entity Recognition (NER) aims to identify named entities with very little annotated data. Previous methods solve this problem based on token-wise classification, which ignores the information of entity boundaries, and inevitably the performance is affected by the massive non-entity tokens. To this end, we propose a seminal span-based prototypical network (SpanProto) that tackles few… ▽ More

    Submitted 21 November, 2022; v1 submitted 17 October, 2022; originally announced October 2022.

  45. arXiv:2210.08536  [pdf, other

    cs.CL

    Knowledge Prompting in Pre-trained Language Model for Natural Language Understanding

    Authors: Jianing Wang, Wenkang Huang, Qiuhui Shi, Hongbin Wang, Minghui Qiu, Xiang Li, Ming Gao

    Abstract: Knowledge-enhanced Pre-trained Language Model (PLM) has recently received significant attention, which aims to incorporate factual knowledge into PLMs. However, most existing methods modify the internal structures of fixed types of PLMs by stacking complicated modules, and introduce redundant and irrelevant factual knowledge from knowledge bases (KBs). In this paper, to address these problems, we… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

    Comments: 14 pages, 5 figures. This paper has been accepted for the main conference of EMNLP2022 (long paper)

  46. arXiv:2207.08814  [pdf, other

    cs.AI cs.HC

    PBRE: A Rule Extraction Method from Trained Neural Networks Designed for Smart Home Services

    Authors: Mingming Qiu, Elie Najm, Remi Sharrock, Bruno Traverson

    Abstract: Designing smart home services is a complex task when multiple services with a large number of sensors and actuators are deployed simultaneously. It may rely on knowledge-based or data-driven approaches. The former can use rule-based methods to design services statically, and the latter can use learning methods to discover inhabitants' preferences dynamically. However, neither of these approaches i… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

  47. arXiv:2206.13752  [pdf, other

    cs.IT eess.SP

    Sub-Block Rearranged Staircase Codes for Optical Transport Networks

    Authors: Min Qiu, Jinhong Yuan

    Abstract: We propose a new family of spatially coupled product codes, called sub-block rearranged staircase (SR-staircase) codes. Each SR-staircase code block is constructed by encoding rearranged preceding code blocks and new information blocks, where the rearrangement involves sub-blocks decomposition and transposition. The proposed codes can be constructed to have each code block size of $1/q$ to that of… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: 6 pages, 3 figures, 1 table, accepted by the 2022 IEEE International Symposium on Information Theory (ISIT). arXiv admin note: substantial text overlap with arXiv:2201.09415

  48. arXiv:2205.05313  [pdf, other

    cs.CL cs.AI

    Towards Unified Prompt Tuning for Few-shot Text Classification

    Authors: Jianing Wang, Chengyu Wang, Fuli Luo, Chuanqi Tan, Minghui Qiu, Fei Yang, Qiuhui Shi, Songfang Huang, Ming Gao

    Abstract: Prompt-based fine-tuning has boosted the performance of Pre-trained Language Models (PLMs) on few-shot text classification by employing task-specific prompts. Yet, PLMs are unfamiliar with prompt-style expressions during pre-training, which limits the few-shot learning performance on downstream tasks. It would be desirable if the models can acquire some prompting knowledge before adaptation to spe… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

  49. arXiv:2205.03071  [pdf, other

    cs.CL cs.AI

    KECP: Knowledge Enhanced Contrastive Prompting for Few-shot Extractive Question Answering

    Authors: Jianing Wang, Chengyu Wang, Minghui Qiu, Qiuhui Shi, Hongbin Wang, Jun Huang, Ming Gao

    Abstract: Extractive Question Answering (EQA) is one of the most important tasks in Machine Reading Comprehension (MRC), which can be solved by fine-tuning the span selecting heads of Pre-trained Language Models (PLMs). However, most existing approaches for MRC may perform poorly in the few-shot learning scenario. To solve this issue, we propose a novel framework named Knowledge Enhanced Contrastive Prompt-… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

  50. arXiv:2205.00258  [pdf, other

    cs.CL

    EasyNLP: A Comprehensive and Easy-to-use Toolkit for Natural Language Processing

    Authors: Chengyu Wang, Minghui Qiu, Chen Shi, Taolin Zhang, Tingting Liu, Lei Li, Jianing Wang, Ming Wang, Jun Huang, Wei Lin

    Abstract: The success of Pre-Trained Models (PTMs) has reshaped the development of Natural Language Processing (NLP). Yet, it is not easy to obtain high-performing models and deploy them online for industrial practitioners. To bridge this gap, EasyNLP is designed to make it easy to build NLP applications, which supports a comprehensive suite of NLP algorithms. It further features knowledge-enhanced pre-trai… ▽ More

    Submitted 13 March, 2023; v1 submitted 30 April, 2022; originally announced May 2022.

    Comments: 8 pages