Skip to main content

Showing 1–50 of 247 results for author: Bhattacharya, S

  1. arXiv:2407.13377  [pdf, other

    cs.CL cs.AI eess.AS

    Linear-Complexity Self-Supervised Learning for Speech Processing

    Authors: Shucong Zhang, Titouan Parcollet, Rogier van Dalen, Sourav Bhattacharya

    Abstract: Self-supervised learning (SSL) models usually require weeks of pre-training with dozens of high-end GPUs. These models typically have a multi-headed self-attention (MHSA) context encoder. However, MHSA takes quadratic time and space in the input length, contributing to the high pre-training cost. Linear-complexity alternatives to MHSA have been proposed. For instance, in supervised training, the S… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: Interspeech 2024

  2. arXiv:2407.12063  [pdf, other

    physics.soc-ph cs.SI

    Unveiling Scaling Laws in the Regulatory Functions of Reddit

    Authors: Shambhobi Bhattacharya, Jisung Yoon, Hyejin Youn

    Abstract: Online platforms like Reddit, Wikipedia, and Facebook are integral to modern life, enabling content creation and sharing through posts, comments, and discussions. Despite their virtual and often anonymous nature, these platforms need rules and oversight to maintain a safe and productive environment. As these communities grow, a key question arises: how does the need for regulatory functions scale?… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  3. arXiv:2407.05141  [pdf, other

    cs.LG cs.DC

    Impact of Network Topology on Byzantine Resilience in Decentralized Federated Learning

    Authors: Siddhartha Bhattacharya, Daniel Helo, Joshua Siegel

    Abstract: Federated learning (FL) enables a collaborative environment for training machine learning models without sharing training data between users. This is typically achieved by aggregating model gradients on a central server. Decentralized federated learning is a rising paradigm that enables users to collaboratively train machine learning models in a peer-to-peer manner, without the need for a central… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: 8 pages, 6 figures

    ACM Class: I.2.11; C.4; C.2.4

  4. arXiv:2406.15527  [pdf, other

    cs.LG cs.CL

    Data Efficient Evaluation of Large Language Models and Text-to-Image Models via Adaptive Sampling

    Authors: Cong Xu, Gayathri Saranathan, Mahammad Parwez Alam, Arpit Shah, James Lim, Soon Yee Wong, Foltin Martin, Suparna Bhattacharya

    Abstract: Evaluating LLMs and text-to-image models is a computationally intensive task often overlooked. Efficient evaluation is crucial for understanding the diverse capabilities of these models and enabling comparisons across a growing number of new models and benchmarks. To address this, we introduce SubLIME, a data-efficient evaluation framework that employs adaptive sampling techniques, such as cluster… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  5. arXiv:2406.13944  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Generalization error of min-norm interpolators in transfer learning

    Authors: Yanke Song, Sohom Bhattacharya, Pragya Sur

    Abstract: This paper establishes the generalization error of pooled min-$\ell_2$-norm interpolation in transfer learning where data from diverse distributions are available. Min-norm interpolators emerge naturally as implicit regularized limits of modern machine learning algorithms. Previous work characterized their out-of-distribution risk when samples from the test distribution are unavailable during trai… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 53 pages, 2 figures

  6. arXiv:2405.19164  [pdf, other

    cs.AI cs.IR

    Learning from Litigation: Graphs and LLMs for Retrieval and Reasoning in eDiscovery

    Authors: Sounak Lahiri, Sumit Pai, Tim Weninger, Sanmitra Bhattacharya

    Abstract: Electronic Discovery (eDiscovery) involves identifying relevant documents from a vast collection based on legal production requests. The integration of artificial intelligence (AI) and natural language processing (NLP) has transformed this process, helping document review and enhance efficiency and cost-effectiveness. Although traditional approaches like BM25 or fine-tuned pre-trained models are c… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 8 pages, 2 tables, 6 figures

  7. arXiv:2405.15449  [pdf, ps, other

    cs.DS

    Faster $(Δ+ 1)$-Edge Coloring: Breaking the $m \sqrt{n}$ Time Barrier

    Authors: Sayan Bhattacharya, Din Carmon, Martín Costa, Shay Solomon, Tianyi Zhang

    Abstract: Vizing's theorem states that any $n$-vertex $m$-edge graph of maximum degree $Δ$ can be {\em edge colored} using at most $Δ+ 1$ different colors [Diskret.~Analiz, '64]. Vizing's original proof is algorithmic and shows that such an edge coloring can be found in $\tilde{O}(mn)$ time. This was subsequently improved to $\tilde O(m\sqrt{n})$, independently by Arjomandi [1982] and by Gabow et al.~[1985]… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Started to circulate in April 2024

  8. arXiv:2404.14855  [pdf, other

    cs.LG cs.CG

    The Geometry of the Set of Equivalent Linear Neural Networks

    Authors: Jonathan Richard Shewchuk, Sagnik Bhattacharya

    Abstract: We characterize the geometry and topology of the set of all weight vectors for which a linear neural network computes the same linear transformation $W$. This set of weight vectors is called the fiber of $W$ (under the matrix multiplication map), and it is embedded in the Euclidean weight space of all possible weight vectors. The fiber is an algebraic variety that is not necessarily a manifold. We… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 99 pages, 14 figures

  9. arXiv:2404.13855  [pdf, other

    cs.CL

    Understanding the role of FFNs in driving multilingual behaviour in LLMs

    Authors: Sunit Bhattacharya, Ondřej Bojar

    Abstract: Multilingualism in Large Language Models (LLMs) is an yet under-explored area. In this paper, we conduct an in-depth analysis of the multilingual capabilities of a family of a Large Language Model, examining its architecture, activation patterns, and processing mechanisms across languages. We introduce novel metrics to probe the model's multilingual behaviour at different layers and shed light on… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 10 pages

  10. arXiv:2404.07645  [pdf, other

    cs.CV

    Simba: Mamba augmented U-ShiftGCN for Skeletal Action Recognition in Videos

    Authors: Soumyabrata Chaudhuri, Saumik Bhattacharya

    Abstract: Skeleton Action Recognition (SAR) involves identifying human actions using skeletal joint coordinates and their interconnections. While plain Transformers have been attempted for this task, they still fall short compared to the current leading methods, which are rooted in Graph Convolutional Networks (GCNs) due to the absence of structural priors. Recently, a novel selective state space model, Mam… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 20 pages, 6 tables, 1 figure

  11. arXiv:2403.05174  [pdf, other

    cs.LG

    VTruST: Controllable value function based subset selection for Data-Centric Trustworthy AI

    Authors: Soumi Das, Shubhadip Nag, Shreyyash Sharma, Suparna Bhattacharya, Sourangshu Bhattacharya

    Abstract: Trustworthy AI is crucial to the widespread adoption of AI in high-stakes applications with fairness, robustness, and accuracy being some of the key trustworthiness metrics. In this work, we propose a controllable framework for data-centric trustworthy AI (DCTAI)- VTruST, that allows users to control the trade-offs between the different trustworthiness metrics of the constructed training datasets.… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Accepted in ICLR 2024 DMLR workshop

  12. arXiv:2403.03767  [pdf, other

    physics.chem-ph cs.LG

    Predicting the Temperature Dependence of Surfactant CMCs Using Graph Neural Networks

    Authors: Christoforos Brozos, Jan G. Rittig, Sandip Bhattacharya, Elie Akanny, Christina Kohlmann, Alexander Mitsos

    Abstract: The critical micelle concentration (CMC) of surfactant molecules is an essential property for surfactant applications in industry. Recently, classical QSPR and Graph Neural Networks (GNNs), a deep learning technique, have been successfully applied to predict the CMC of surfactants at room temperature. However, these models have not yet considered the temperature dependency of the CMC, which is hig… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  13. arXiv:2403.02765  [pdf, other

    cs.LG q-bio.BM

    G4-Attention: Deep Learning Model with Attention for predicting DNA G-Quadruplexes

    Authors: Shrimon Mukherjee, Pulakesh Pramanik, Partha Basuchowdhuri, Santanu Bhattacharya

    Abstract: G-Quadruplexes are the four-stranded non-canonical nucleic acid secondary structures, formed by the stacking arrangement of the guanine tetramers. They are involved in a wide range of biological roles because of their exceptionally unique and distinct structural characteristics. After the completion of the human genome sequencing project, a lot of bioinformatic algorithms were introduced to predic… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  14. arXiv:2402.17778  [pdf, other

    eess.SP cs.AI eess.SY

    Dynamic Anchor Selection and Real-Time Pose Prediction for Ultra-wideband Tagless Gate

    Authors: Junyoung Choi, Sagnik Bhattacharya, Joohyun Lee

    Abstract: Ultra-wideband (UWB) is emerging as a promising solution that can realize proximity services, such as UWB tagless gate (UTG), thanks to centimeter-level localization accuracy based on two different ranging methods such as downlink time-difference of arrival (DL-TDoA) and double-sided two-way ranging (DS-TWR). The UTG is a UWB-based proximity service that provides a seamless gate pass system withou… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2402.08399

  15. arXiv:2402.13952  [pdf, ps, other

    cs.CC

    Aaronson-Ambainis Conjecture Is True For Random Restrictions

    Authors: Sreejata Kishor Bhattacharya

    Abstract: In an attempt to show that the acceptance probability of a quantum query algorithm making $q$ queries can be well-approximated almost everywhere by a classical decision tree of depth $\leq \text{poly}(q)$, Aaronson and Ambainis proposed the following conjecture: let $f: \{ \pm 1\}^n \rightarrow [0,1]$ be a degree $d$ polynomial with variance $\geq ε$. Then, there exists a coordinate of $f$ with in… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  16. arXiv:2402.10515  [pdf, other

    eess.SP cs.AI

    Power-Efficient Indoor Localization Using Adaptive Channel-aware Ultra-wideband DL-TDOA

    Authors: Sagnik Bhattacharya, Junyoung Choi, Joohyun Lee

    Abstract: Among the various Ultra-wideband (UWB) ranging methods, the absence of uplink communication or centralized computation makes downlink time-difference-of-arrival (DL-TDOA) localization the most suitable for large-scale industrial deployments. However, temporary or permanent obstacles in the deployment region often lead to non-line-of-sight (NLOS) channel path and signal outage effects, which result… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Journal ref: IEEE GLOBECOM 2023

  17. arXiv:2402.08399  [pdf, other

    eess.SP cs.NI

    Deep Learning-based Real-time Smartphone Pose Detection for Ultra-wideband Tagless Gate

    Authors: Junyoung Choi, Sagnik Bhattacharya

    Abstract: As commercial interest in proximity services increased, the development of various wireless localization techniques was promoted. In line with this trend, Ultra-wideband (UWB) is emerging as a promising solution that can realize proximity services thanks to centimeter-level localization accuracy. In addition, since the actual location of the mobile device (MD) on the human body, called pose, affec… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Journal ref: IEEE GLOBECOM 2023

  18. arXiv:2402.06738  [pdf, other

    cs.CL

    EntGPT: Linking Generative Large Language Models with Knowledge Bases

    Authors: Yifan Ding, Amrit Poudel, Qingkai Zeng, Tim Weninger, Balaji Veeramani, Sanmitra Bhattacharya

    Abstract: The ability of Large Language Models (LLMs) to generate factually correct output remains relatively unexplored due to the lack of fact-checking and knowledge grounding during training and inference. In this work, we aim to address this challenge through the Entity Disambiguation (ED) task. We first consider prompt engineering, and design a three-step hard-prompting method to probe LLMs' ED perform… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  19. arXiv:2402.04364  [pdf, ps, other

    cs.CC

    Exponential Separation Between Powers of Regular and General Resolution Over Parities

    Authors: Sreejata Kishor Bhattacharya, Arkadev Chattopadhyay, Pavel Dvořák

    Abstract: Proving super-polynomial lower bounds on the size of proofs of unsatisfiability of Boolean formulas using resolution over parities is an outstanding problem that has received a lot of attention after its introduction by Raz and Tzamaret [Ann. Pure Appl. Log.'08]. Very recently, Efremenko, Garlík and Itsykson [ECCC'23] proved the first exponential lower bounds on the size of ResLin proofs that were… ▽ More

    Submitted 23 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  20. arXiv:2401.09243  [pdf, other

    cs.RO cs.AI cs.LG

    DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven Policy Learning

    Authors: Sabariswaran Mani, Sreyas Venkataraman, Abhranil Chandra, Adyan Rizvi, Yash Sirvi, Soumojit Bhattacharya, Aritra Hazra

    Abstract: Robot learning tasks are extremely compute-intensive and hardware-specific. Thus the avenues of tackling these challenges, using a diverse dataset of offline demonstrations that can be used to train robot manipulation agents, is very appealing. The Train-Offline-Test-Online (TOTO) Benchmark provides a well-curated open-source dataset for offline training comprised mostly of expert data and also be… ▽ More

    Submitted 23 May, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: NeurIPS 2023 Train Offline Test Online Workshop and Competition (Best Paper Oral Presentation / Winning Competition Submission)

  21. arXiv:2401.01874  [pdf, other

    physics.chem-ph cs.LG

    Graph Neural Networks for Surfactant Multi-Property Prediction

    Authors: Christoforos Brozos, Jan G. Rittig, Sandip Bhattacharya, Elie Akanny, Christina Kohlmann, Alexander Mitsos

    Abstract: Surfactants are of high importance in different industrial sectors such as cosmetics, detergents, oil recovery and drug delivery systems. Therefore, many quantitative structure-property relationship (QSPR) models have been developed for surfactants. Each predictive model typically focuses on one surfactant class, mostly nonionics. Graph Neural Networks (GNNs) have exhibited a great predictive perf… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  22. arXiv:2401.01008  [pdf, other

    cs.CV cs.AI

    Fast Sampling Through The Reuse Of Attention Maps In Diffusion Models

    Authors: Rosco Hunter, Łukasz Dudziak, Mohamed S. Abdelfattah, Abhinav Mehrotra, Sourav Bhattacharya, Hongkai Wen

    Abstract: Text-to-image diffusion models have demonstrated unprecedented capabilities for flexible and realistic image synthesis. Nevertheless, these models rely on a time-consuming sampling procedure, which has motivated attempts to reduce their latency. When improving efficiency, researchers often use the original diffusion model to train an additional network designed specifically for fast image generati… ▽ More

    Submitted 24 May, 2024; v1 submitted 13 December, 2023; originally announced January 2024.

  23. arXiv:2312.17726  [pdf, ps, other

    cs.CR cs.SE

    Comparing Effectiveness and Efficiency of Interactive Application Security Testing (IAST) and Runtime Application Self-Protection (RASP) Tools in a Large Java-based System

    Authors: Aishwarya Seth, Saikath Bhattacharya, Sarah Elder, Nusrat Zahan, Laurie Williams

    Abstract: Security resources are scarce, and practitioners need guidance in the effective and efficient usage of techniques and tools available in the cybersecurity industry. Two emerging tool types, Interactive Application Security Testing (IAST) and Runtime Application Self-Protection (RASP), have not been thoroughly evaluated against well-established counterparts such as Dynamic Application Security Test… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

  24. arXiv:2312.04825  [pdf, other

    cs.RO math.AT

    Weighted Combinatorial Laplacian and its Application to Coverage Repair in Sensor Networks

    Authors: Shunsaku Yadokoro, Subhrajit Bhattacharya

    Abstract: We define the weighted combinatorial Laplacian operators on a simplicial complex and investigate their spectral properties. Eigenvalues close to zero and the corresponding eigenvectors of them are especially of our interest, and we show that they can detect almost $n$-dimensional holes in the given complex. Real-valued weights on simplices allow gradient descent based optimization, which in turn g… ▽ More

    Submitted 14 April, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: 26 pages, 11 figures

  25. arXiv:2311.12046  [pdf, other

    eess.IV cs.CV

    LATIS: Lambda Abstraction-based Thermal Image Super-resolution

    Authors: Gargi Panda, Soumitra Kundu, Saumik Bhattacharya, Aurobinda Routray

    Abstract: Single image super-resolution (SISR) is an effective technique to improve the quality of low-resolution thermal images. Recently, transformer-based methods have achieved significant performance in SISR. However, in the SR task, only a small number of pixels are involved in the transformers self-attention (SA) mechanism due to the computational complexity of the attention mechanism. The lambda abst… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  26. arXiv:2311.08367  [pdf, other

    cs.DS

    Arboricity-Dependent Algorithms for Edge Coloring

    Authors: Sayan Bhattacharya, Martín Costa, Nadav Panski, Shay Solomon

    Abstract: The problem of edge coloring has been extensively studied over the years. Recently, this problem has received significant attention in the dynamic setting, where we are given a dynamic graph evolving via a sequence of edge insertions and deletions and our objective is to maintain an edge coloring of the graph. Currently, it is not known whether it is possible to maintain a $(Δ+ O(Δ^{1 - μ}))$-ed… ▽ More

    Submitted 7 February, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: Started to circulate in September 2023

  27. arXiv:2311.08027  [pdf, other

    cs.CR

    A practical key-recovery attack on LWE-based key-encapsulation mechanism schemes using Rowhammer

    Authors: Puja Mondal, Suparna Kundu, Sarani Bhattacharya, Angshuman Karmakar, Ingrid Verbauwhede

    Abstract: Physical attacks are serious threats to cryptosystems deployed in the real world. In this work, we propose a microarchitectural end-to-end attack methodology on generic lattice-based post-quantum key encapsulation mechanisms to recover the long-term secret key. Our attack targets a critical component of a Fujisaki-Okamoto transform that is used in the construction of almost all lattice-based key e… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    ACM Class: E.3.3

  28. arXiv:2311.03267  [pdf, ps, other

    cs.DS

    Nibbling at Long Cycles: Dynamic (and Static) Edge Coloring in Optimal Time

    Authors: Sayan Bhattacharya, Martín Costa, Nadav Panski, Shay Solomon

    Abstract: We consider the problem of maintaining a $(1+ε)Δ$-edge coloring in a dynamic graph $G$ with $n$ nodes and maximum degree at most $Δ$. The state-of-the-art update time is $O_ε(\text{polylog}(n))$, by Duan, He and Zhang [SODA'19] and by Christiansen [STOC'23], and more precisely $O(\log^7 n/ε^2)$, where $Δ= Ω(\log^2 n / ε^2)$. The following natural question arises: What is the best possible update… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted at SODA 2024

  29. arXiv:2311.03084  [pdf, other

    cs.CL cs.AI

    A Simple yet Efficient Ensemble Approach for AI-generated Text Detection

    Authors: Harika Abburi, Kalyani Roy, Michael Suesserman, Nirmala Pudota, Balaji Veeramani, Edward Bowen, Sanmitra Bhattacharya

    Abstract: Recent Large Language Models (LLMs) have demonstrated remarkable capabilities in generating text that closely resembles human writing across wide range of styles and genres. However, such capabilities are prone to potential abuse, such as fake news generation, spam email creation, and misuse in academic assignments. Hence, it is essential to build automated approaches capable of distinguishing bet… ▽ More

    Submitted 7 November, 2023; v1 submitted 6 November, 2023; originally announced November 2023.

  30. arXiv:2311.01534  [pdf, other

    cs.MA cs.AI cs.RO

    Approximate Multiagent Reinforcement Learning for On-Demand Urban Mobility Problem on a Large Map (extended version)

    Authors: Daniel Garces, Sushmita Bhattacharya, Dimitri Bertsekas, Stephanie Gil

    Abstract: In this paper, we focus on the autonomous multiagent taxi routing problem for a large urban environment where the location and number of future ride requests are unknown a-priori, but can be estimated by an empirical distribution. Recent theory has shown that a rollout algorithm with a stable base policy produces a near-optimal stable policy. In the routing setting, a policy is stable if its execu… ▽ More

    Submitted 8 March, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: 11 pages, 5 figures, 1 lemma, and 2 theorems

  31. arXiv:2310.20638  [pdf, other

    cs.CV cs.AI q-bio.TO

    Histopathological Image Analysis with Style-Augmented Feature Domain Mixing for Improved Generalization

    Authors: Vaibhav Khamankar, Sutanu Bera, Saumik Bhattacharya, Debashis Sen, Prabir Kumar Biswas

    Abstract: Histopathological images are essential for medical diagnosis and treatment planning, but interpreting them accurately using machine learning can be challenging due to variations in tissue preparation, staining and imaging protocols. Domain generalization aims to address such limitations by enabling the learning models to generalize to new datasets or populations. Style transfer-based data augmenta… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: Paper is published in MedAGI 2023 (MICCAI 2023 1st International Workshop on Foundation Models for General Medical AI) Code link: https://github.com/Vaibhav-Khamankar/FuseStyle Paper link: https://nbviewer.org/github/MedAGI/medagi.github.io/blob/main/src/assets/papers/P17.pdf

  32. arXiv:2310.18371  [pdf, ps, other

    cs.CL cs.AI

    In-Context Ability Transfer for Question Decomposition in Complex QA

    Authors: Venktesh V, Sourangshu Bhattacharya, Avishek Anand

    Abstract: Answering complex questions is a challenging task that requires question decomposition and multistep reasoning for arriving at the solution. While existing supervised and unsupervised approaches are specialized to a certain task and involve training, recently proposed prompt-based approaches offer generalizable solutions to tackle a wide variety of complex question-answering (QA) tasks. However, e… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 10 pages

  33. arXiv:2310.17420  [pdf, other

    cs.DS

    Fully Dynamic $k$-Clustering in $\tilde O(k)$ Update Time

    Authors: Sayan Bhattacharya, Martín Costa, Silvio Lattanzi, Nikos Parotsidis

    Abstract: We present a $O(1)$-approximate fully dynamic algorithm for the $k$-median and $k$-means problems on metric spaces with amortized update time $\tilde O(k)$ and worst-case query time $\tilde O(k^2)$. We complement our theoretical analysis with the first in-depth experimental study for the dynamic $k$-median problem on general metrics, focusing on comparing our dynamic algorithm to the current state… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted at NeurIPS 2023

  34. arXiv:2310.15552  [pdf, other

    cs.CL

    Unveiling Multilinguality in Transformer Models: Exploring Language Specificity in Feed-Forward Networks

    Authors: Sunit Bhattacharya, Ondrej Bojar

    Abstract: Recent research suggests that the feed-forward module within Transformers can be viewed as a collection of key-value memories, where the keys learn to capture specific patterns from the input based on the training examples. The values then combine the output from the 'memories' of the keys to generate predictions about the next token. This leads to an incremental process of prediction that gradual… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  35. arXiv:2310.05172  [pdf, other

    cs.CR cs.AR

    On the Amplification of Cache Occupancy Attacks in Randomized Cache Architectures

    Authors: Anirban Chakraborty, Nimish Mishra, Sayandeep Saha, Sarani Bhattacharya, Debdeep Mukhopadhyay

    Abstract: In this work, we explore the applicability of cache occupancy attacks and the implications of secured cache design rationales on such attacks. In particular, we show that one of the well-known cache randomization schemes, MIRAGE, touted to be resilient against eviction-based attacks, amplifies the chances of cache occupancy attack, making it more vulnerable compared to contemporary designs. We lev… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

  36. arXiv:2310.04453  [pdf, other

    cs.CL cs.LG cs.SI

    COVID-19 South African Vaccine Hesitancy Models Show Boost in Performance Upon Fine-Tuning on M-pox Tweets

    Authors: Nicholas Perikli, Srimoy Bhattacharya, Blessing Ogbuokiri, Zahra Movahedi Nia, Benjamin Lieberman, Nidhi Tripathi, Salah-Eddine Dahbi, Finn Stevenson, Nicola Bragazzi, Jude Kong, Bruce Mellado

    Abstract: Very large numbers of M-pox cases have, since the start of May 2022, been reported in non-endemic countries leading many to fear that the M-pox Outbreak would rapidly transition into another pandemic, while the COVID-19 pandemic ravages on. Given the similarities of M-pox with COVID-19, we chose to test the performance of COVID-19 models trained on South African twitter data on a hand-labelled M-p… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  37. arXiv:2310.00917  [pdf, other

    cs.CV

    Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting Performance

    Authors: Alloy Das, Sanket Biswas, Ayan Banerjee, Josep Lladós, Umapada Pal, Saumik Bhattacharya

    Abstract: The adaptation capability to a wide range of domains is crucial for scene text spotting models when deployed to real-world conditions. However, existing state-of-the-art (SOTA) approaches usually incorporate scene text detection and recognition simply by pretraining on natural scene text datasets, which do not directly exploit the intermediate feature representations between multiple domains. Here… ▽ More

    Submitted 1 November, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted to the 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024)

  38. arXiv:2309.07755  [pdf, other

    cs.CL cs.AI

    Generative AI Text Classification using Ensemble LLM Approaches

    Authors: Harika Abburi, Michael Suesserman, Nirmala Pudota, Balaji Veeramani, Edward Bowen, Sanmitra Bhattacharya

    Abstract: Large Language Models (LLMs) have shown impressive performance across a variety of Artificial Intelligence (AI) and natural language processing tasks, such as content creation, report generation, etc. However, unregulated malign application of these models can create undesirable consequences such as generation of fake news, plagiarism, etc. As a result, accurate detection of AI-generated language… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  39. Finding the Perfect Fit: Applying Regression Models to ClimateBench v1.0

    Authors: Anmol Chaure, Ashok Kumar Behera, Sudip Bhattacharya

    Abstract: Climate projections using data driven machine learning models acting as emulators, is one of the prevailing areas of research to enable policy makers make informed decisions. Use of machine learning emulators as surrogates for computationally heavy GCM simulators reduces time and carbon footprints. In this direction, ClimateBench [1] is a recently curated benchmarking dataset for evaluating the pe… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Journal ref: International Journal of Computer Applications 185(29):31-39, August 2023

  40. arXiv:2308.09104  [pdf, other

    stat.ML cs.LG stat.ME

    A comprehensive study of spike and slab shrinkage priors for structurally sparse Bayesian neural networks

    Authors: Sanket Jantre, Shrijita Bhattacharya, Tapabrata Maiti

    Abstract: Network complexity and computational efficiency have become increasingly significant aspects of deep learning. Sparse deep learning addresses these challenges by recovering a sparse representation of the underlying target function by reducing heavily over-parameterized deep neural networks. Specifically, deep neural architectures compressed via structured sparsity (e.g. node sparsity) provide low… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

  41. arXiv:2308.08917  [pdf, other

    cs.IT eess.SP

    Unfolding for Joint Channel Estimation and Symbol Detection in MIMO Communication Systems

    Authors: Swati Bhattacharya, K. V. S. Hari, Yonina C. Eldar

    Abstract: This paper proposes a Joint Channel Estimation and Symbol Detection (JED) scheme for Multiple-Input Multiple-Output (MIMO) wireless communication systems. Our proposed method for JED using Alternating Direction Method of Multipliers (JED-ADMM) and its model-based neural network version JED using Unfolded ADMM (JED-U-ADMM) markedly improve the symbol detection performance over JED using Alternating… ▽ More

    Submitted 21 August, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

    Comments: 14 pages, 19 figures, submitted to IEEE Transactions on Signal Processing

  42. arXiv:2308.04178  [pdf, ps, other

    cs.AI cs.RO

    Assistive Chatbots for healthcare: a succinct review

    Authors: Basabdatta Sen Bhattacharya, Vibhav Sinai Pissurlenkar

    Abstract: Artificial Intelligence (AI) for supporting healthcare services has never been more necessitated than by the recent global pandemic. Here, we review the state-of-the-art in AI-enabled Chatbots in healthcare proposed during the last 10 years (2013-2023). The focus on AI-enabled technology is because of its potential for enhancing the quality of human-machine interaction via Chatbots, reducing depen… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  43. arXiv:2308.03908  [pdf, other

    cs.CV cs.AI cs.LG

    ViLP: Knowledge Exploration using Vision, Language, and Pose Embeddings for Video Action Recognition

    Authors: Soumyabrata Chaudhuri, Saumik Bhattacharya

    Abstract: Video Action Recognition (VAR) is a challenging task due to its inherent complexities. Though different approaches have been explored in the literature, designing a unified framework to recognize a large number of human actions is still a challenging problem. Recently, Multi-Modal Learning (MML) has demonstrated promising results in this domain. In literature, 2D skeleton or pose modality has ofte… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: 7 pages, 3 figures, 2 Tables

  44. arXiv:2308.02905  [pdf, other

    cs.CV cs.MM

    FAST: Font-Agnostic Scene Text Editing

    Authors: Alloy Das, Prasun Roy, Saumik Bhattacharya, Subhankar Ghosh, Umapada Pal, Michael Blumenstein

    Abstract: Scene Text Editing (STE) is a challenging research problem, and it aims to modify existing texts in an image while preserving the background and the font style of the original text of the image. Due to its various real-life applications, researchers have explored several approaches toward STE in recent years. However, most of the existing STE methods show inferior editing performance because of (1… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: 13 pages, in submission

  45. arXiv:2308.01140  [pdf, other

    cs.LG cs.CV

    Dynamically Scaled Temperature in Self-Supervised Contrastive Learning

    Authors: Siladittya Manna, Soumitri Chattopadhyay, Rakesh Dey, Saumik Bhattacharya, Umapada Pal

    Abstract: In contemporary self-supervised contrastive algorithms like SimCLR, MoCo, etc., the task of balancing attraction between two semantically similar samples and repulsion between two samples of different classes is primarily affected by the presence of hard negative samples. While the InfoNCE loss has been shown to impose penalties based on hardness, the temperature hyper-parameter is the key to regu… ▽ More

    Submitted 10 May, 2024; v1 submitted 2 August, 2023; originally announced August 2023.

  46. Shared Information for a Markov Chain on a Tree

    Authors: Sagnik Bhattacharya, Prakash Narayan

    Abstract: Shared information is a measure of mutual dependence among multiple jointly distributed random variables with finite alphabets. For a Markov chain on a tree with a given joint distribution, we give a new proof of an explicit characterization of shared information. The Markov chain on a tree is shown to possess a global Markov property based on graph separation; this property plays a key role in ou… ▽ More

    Submitted 21 January, 2024; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: 13 pages, 4 figures, submitted to IEEE Transactions on Information Theory

  47. arXiv:2307.15072  [pdf, other

    cs.CY cs.CL cs.LG cs.SI

    Detecting the Presence of COVID-19 Vaccination Hesitancy from South African Twitter Data Using Machine Learning

    Authors: Nicholas Perikli, Srimoy Bhattacharya, Blessing Ogbuokiri, Zahra Movahedi Nia, Benjamin Lieberman, Nidhi Tripathi, Salah-Eddine Dahbi, Finn Stevenson, Nicola Bragazzi, Jude Kong, Bruce Mellado

    Abstract: Very few social media studies have been done on South African user-generated content during the COVID-19 pandemic and even fewer using hand-labelling over automated methods. Vaccination is a major tool in the fight against the pandemic, but vaccine hesitancy jeopardizes any public health effort. In this study, sentiment analysis on South African tweets related to vaccine hesitancy was performed, w… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

  48. arXiv:2307.07421  [pdf, other

    cs.CL cs.SD eess.AS

    SummaryMixing: A Linear-Complexity Alternative to Self-Attention for Speech Recognition and Understanding

    Authors: Titouan Parcollet, Rogier van Dalen, Shucong Zhang, Sourav Bhattacharya

    Abstract: Modern speech processing systems rely on self-attention. Unfortunately, token mixing with self-attention takes quadratic time in the length of the speech utterance, slowing down inference and training and increasing memory consumption. Cheaper alternatives to self-attention for ASR have been developed, but they fail to consistently reach the same level of accuracy. This paper, therefore, proposes… ▽ More

    Submitted 11 July, 2024; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: Interspeech 2024

  49. arXiv:2307.06659  [pdf, other

    cs.CR cs.CV cs.CY

    A Comprehensive Analysis of Blockchain Applications for Securing Computer Vision Systems

    Authors: Ramalingam M, Chemmalar Selvi, Nancy Victor, Rajeswari Chengoden, Sweta Bhattacharya, Praveen Kumar Reddy Maddikunta, Duehee Lee, Md. Jalil Piran, Neelu Khare, Gokul Yendri, Thippa Reddy Gadekallu

    Abstract: Blockchain (BC) and Computer Vision (CV) are the two emerging fields with the potential to transform various sectors.The ability of BC can help in offering decentralized and secure data storage, while CV allows machines to learn and understand visual data. This integration of the two technologies holds massive promise for developing innovative applications that can provide solutions to the challen… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  50. arXiv:2307.02415  [pdf, ps, other

    cs.DS

    Density-Sensitive Algorithms for $(Δ+ 1)$-Edge Coloring

    Authors: Sayan Bhattacharya, Martín Costa, Nadav Panski, Shay Solomon

    Abstract: Vizing's theorem asserts the existence of a {$(Δ+1)$-edge coloring} for any graph $G$, where $Δ= Δ(G)$ denotes the maximum degree of $G$. Several polynomial time $(Δ+1)$-edge coloring algorithms are known, and the state-of-the-art running time (up to polylogarithmic factors) is $\tilde{O}(\min\{m \cdot \sqrt{n}, m \cdot Δ\})$, by Gabow et al.\ from 1985, where $n$ and $m$ denote the number of vert… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.