Skip to main content

Showing 1–32 of 32 results for author: Balasubramanian, A

  1. arXiv:2407.07858  [pdf, other

    cs.LG cs.CL

    FACTS About Building Retrieval Augmented Generation-based Chatbots

    Authors: Rama Akkiraju, Anbang Xu, Deepak Bora, Tan Yu, Lu An, Vishal Seth, Aaditya Shukla, Pritam Gundecha, Hridhay Mehta, Ashwin Jha, Prithvi Raj, Abhinav Balasubramanian, Murali Maram, Guru Muthusamy, Shivakesh Reddy Annepally, Sidney Knowles, Min Du, Nick Burnett, Sean Javiya, Ashok Marannan, Mamta Kumari, Surbhi Jha, Ethan Dereszenski, Anupam Chakraborty, Subhash Ranjan , et al. (13 additional authors not shown)

    Abstract: Enterprise chatbots, powered by generative AI, are emerging as key applications to enhance employee productivity. Retrieval Augmented Generation (RAG), Large Language Models (LLMs), and orchestration frameworks like Langchain and Llamaindex are crucial for building these chatbots. However, creating effective enterprise chatbots is challenging and requires meticulous RAG pipeline engineering. This… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 8 pages, 6 figures, 2 tables, Preprint submission to ACM CIKM 2024

  2. Predicting Visual Attention in Graphic Design Documents

    Authors: Souradeep Chakraborty, Zijun Wei, Conor Kelton, Seoyoung Ahn, Aruna Balasubramanian, Gregory J. Zelinsky, Dimitris Samaras

    Abstract: We present a model for predicting visual attention during the free viewing of graphic design documents. While existing works on this topic have aimed at predicting static saliency of graphic designs, our work is the first attempt to predict both spatial attention and dynamic temporal order in which the document regions are fixated by gaze using a deep learning based model. We propose a two-stage m… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Journal ref: IEEE Transactions on Multimedia 25 (2022): 4478-4493

  3. arXiv:2405.11085  [pdf, ps, other

    cs.LO

    Decidability and Complexity of Decision Problems for Affine Continuous VASS

    Authors: A. R. Balasubramanian

    Abstract: Vector addition system with states (VASS) is a popular model for the verification of concurrent systems. VASS consists of finitely many control states and a set of counters which can be incremented and decremented, but not tested for zero. VASS is a relatively well-studied model of computation and many results regarding the decidability of decision problems for VASS are well-known. Given that the… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  4. arXiv:2310.16798  [pdf, other

    cs.FL cs.PL

    Reachability in Continuous Pushdown VASS

    Authors: A. R. Balasubramanian, Rupak Majumdar, Ramanathan S. Thinniyam, Georg Zetzsche

    Abstract: Pushdown Vector Addition Systems with States (PVASS) consist of finitely many control states, a pushdown stack, and a set of counters that can be incremented and decremented, but not tested for zero. Whether the reachability problem is decidable for PVASS is a long-standing open problem. We consider continuous PVASS, which are PVASS with a continuous semantics. This means, the counter values are… ▽ More

    Submitted 31 October, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

  5. arXiv:2305.00417   

    cs.SD cs.CV eess.AS

    Transformer-based Sequence Labeling for Audio Classification based on MFCCs

    Authors: C. S. Sonali, Chinmayi B S, Ahana Balasubramanian

    Abstract: Audio classification is vital in areas such as speech and music recognition. Feature extraction from the audio signal, such as Mel-Spectrograms and MFCCs, is a critical step in audio classification. These features are transformed into spectrograms for classification. Researchers have explored various techniques, including traditional machine and deep learning methods to classify spectrograms, but… ▽ More

    Submitted 5 July, 2023; v1 submitted 30 April, 2023; originally announced May 2023.

    Comments: Error in the explanation as well inadequate results and conclusion

  6. arXiv:2304.13065  [pdf, other

    cs.LO cs.DC

    Parameterized Verification of Coverability in Infinite State Broadcast Networks

    Authors: A. R. Balasubramanian

    Abstract: Parameterized verification of coverability in broadcast networks with finite state processes has been studied for different types of models and topologies. In this paper, we attempt to develop a theory of broadcast networks in which the processes can be well-structured transition systems. The resulting formalism is called well-structured broadcast networks. For various types of communication topol… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Comments: Full journal version of arXiv:1809.03099

  7. arXiv:2304.08917  [pdf, ps, other

    cs.DC

    Coefficient Synthesis for Threshold Automata

    Authors: A. R. Balasubramanian

    Abstract: Threshold automata are a formalism for modeling fault-tolerant distributed algorithms. The main feature of threshold automata is the notion of a threshold guard, which allows us to compare the number of received messages with the total number of different types of processes. In this paper, we consider the coefficient synthesis problem for threshold automata, in which we are given a sketch of a thr… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  8. arXiv:2302.05771  [pdf, other

    cs.NI

    Analyzing DCTCP and Cubic Buffer Sharing under Diverse Router Configurations

    Authors: Santiago Vargas, Aruna Balasubramanian, Srikanth Sundaresan

    Abstract: In this work, we look at the impact of router configurations on DCTCP and Cubic traffic when both algorithms share router buffers in the data center. Modern data centers host traffic with mixed congestion controls, including DCTCP and Cubic traffic. Both DCTCP and Cubic in the data center can compete with each other and potentially starve and/or be unfair to each other when sharing buffer space in… ▽ More

    Submitted 11 February, 2023; originally announced February 2023.

  9. arXiv:2201.10432  [pdf, ps, other

    cs.LO cs.MA

    Parameterized Analysis of Reconfigurable Broadcast Networks (Long Version)

    Authors: A. R. Balasubramanian, Lucie Guillou, Chana Weil-Kennedy

    Abstract: Reconfigurable broadcast networks (RBN) are a model of distributed computation in which agents can broadcast messages to other agents using some underlying communication topology which can change arbitrarily over the course of executions. In this paper, we conduct parameterized analysis of RBN. We consider cubes,(infinite) sets of configurations in the form of lower and upper bounds on the number… ▽ More

    Submitted 11 July, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: This is the long version of a paper accepted at FoSSaCS 2022. Erratum: The proof of Theorem 2 contains a mistake, kindly pointed out by Nicolas Waldburger. We are working on a solution

  10. arXiv:2112.14704  [pdf, ps, other

    cs.IT

    Efficient Data Exchange in Unmanned Aerial Vehicle Networks Utilizing Unsupervised Learning-Based Clustering

    Authors: Hao Song, Lingjia Liu, Ananth Balasubramanian

    Abstract: An unmanned aerial vehicle (UAV) network can serve as an aerial relay to periodically receive packets from macro base stations (BSs). Severe packet loss may happen especially when UAVs have bad wireless connections to a BS. In this paper, a data exchange scheme is proposed utilizing unsupervised learning to enable efficient lost packet retrieval through reliable wireless transmissions between UAVs… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    Comments: 9 pages, 8 figures

  11. arXiv:2109.08315  [pdf, ps, other

    cs.LO cs.DC cs.MA

    Reconfigurable Broadcast Networks and Asynchronous Shared-Memory Systems are Equivalent

    Authors: A. R. Balasubramanian, Chana Weil-Kennedy

    Abstract: We show the equivalence of two distributed computing models, namely reconfigurable broadcast networks (RBN) and asynchronous shared-memory systems (ASMS), that were introduced independently. Both RBN and ASMS are systems in which a collection of anonymous, finite-state processes run the same protocol. In RBN, the processes communicate by selective broadcast: a process can broadcast a message which… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: In Proceedings GandALF 2021, arXiv:2109.07798. A long version of this paper, containing all proofs, appears at arXiv:2108.07510

    Journal ref: EPTCS 346, 2021, pp. 18-34

  12. arXiv:2108.07510  [pdf, ps, other

    cs.LO cs.DC cs.MA

    Reconfigurable Broadcast Networks and Asynchronous Shared-Memory Systems are Equivalent (Long Version)

    Authors: A. R. Balasubramanian, Chana Weil-Kennedy

    Abstract: We show the equivalence of two distributed computing models, namely reconfigurable broadcast networks (RBN) and asynchronous shared-memory systems (ASMS), that were introduced independently. Both RBN and ASMS are systems in which a collection of anonymous, finite-state processes run the same protocol. In RBN, the processes communicate by selective broadcast: a process can broadcast a message which… ▽ More

    Submitted 26 August, 2021; v1 submitted 17 August, 2021; originally announced August 2021.

    Comments: Long version of the paper accepted at Gandalf 2021

  13. arXiv:2106.01199  [pdf, other

    cs.CL

    IrEne: Interpretable Energy Prediction for Transformers

    Authors: Qingqing Cao, Yash Kumar Lal, Harsh Trivedi, Aruna Balasubramanian, Niranjan Balasubramanian

    Abstract: Existing software-based energy measurements of NLP models are not accurate because they do not consider the complex interactions between energy consumption and model execution. We present IrEne, an interpretable and extensible energy prediction system that accurately predicts the inference energy consumption of a wide range of Transformer-based NLP models. IrEne constructs a model tree graph that… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: ACL 2021 camera ready

  14. arXiv:2104.09716  [pdf, ps, other

    cs.LO

    Decidability and Complexity in Weakening and Contraction Hypersequent Substructural Logics

    Authors: A. R. Balasubramanian, Timo Lang, Revantha Ramanayake

    Abstract: We establish decidability for the infinitely many axiomatic extensions of the commutative Full Lambek logic with weakening FLew (i.e. IMALLW) that have a cut-free hypersequent proof calculus (specifically: every analytic structural rule extension). Decidability for the corresponding extensions of its contraction counterpart FLec was established recently but their computational complexity was left… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: Accepted for publication in the proceedings of LICS 2021

  15. arXiv:2102.06897  [pdf, other

    cs.FL

    Adaptive Synchronisation of Pushdown Automata

    Authors: A. R. Balasubramanian, K. S. Thejaswini

    Abstract: We introduce the notion of adaptive synchronisation for pushdown automata, in which there is an external observer who has no knowledge about the current state of the pushdown automaton, but can observe the contents of the stack. The observer would then like to decide if it is possible to bring the automaton from any state into some predetermined state by giving inputs to it in an \emph{adaptive} m… ▽ More

    Submitted 13 February, 2021; originally announced February 2021.

    Comments: 29 pages, 5 figures

    MSC Class: 68Q45; 68Q17

  16. arXiv:2101.07344  [pdf, other

    cs.LG cs.DC cs.PF

    Accelerating Deep Learning Inference via Learned Caches

    Authors: Arjun Balasubramanian, Adarsh Kumar, Yuhan Liu, Han Cao, Shivaram Venkataraman, Aditya Akella

    Abstract: Deep Neural Networks (DNNs) are witnessing increased adoption in multiple domains owing to their high accuracy in solving real-world problems. However, this high accuracy has been achieved by building deeper networks, posing a fundamental challenge to the low latency inference desired by user-facing applications. Current low latency solutions trade-off on accuracy or fail to exploit the inherent t… ▽ More

    Submitted 18 January, 2021; originally announced January 2021.

  17. arXiv:2012.05818  [pdf, other

    cs.IR cs.CL

    Bew: Towards Answering Business-Entity-Related Web Questions

    Authors: Qingqing Cao, Oriana Riva, Aruna Balasubramanian, Niranjan Balasubramanian

    Abstract: We present BewQA, a system specifically designed to answer a class of questions that we call Bew questions. Bew questions are related to businesses/services such as restaurants, hotels, and movie theaters; for example, "Until what time is happy hour?". These questions are challenging to answer because the answers are found in open-domain Web, are present in short sentences without surrounding cont… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.

  18. Finding Cut-Offs in Leaderless Rendez-Vous Protocols is Easy

    Authors: A. R. Balasubramanian, Javier Esparza, Mikhail Raskin

    Abstract: In rendez-vous protocols an arbitrarily large number of indistinguishable finite-state agents interact in pairs. The cut-off problem asks if there exists a number $B$ such that all initial configurations of the protocol with at least $B$ agents in a given initial state can reach a final configuration with all agents in a given final state. In a recent paper (Horn and Sangnier, CONCUR 2020), Horn a… ▽ More

    Submitted 11 October, 2023; v1 submitted 19 October, 2020; originally announced October 2020.

    Journal ref: Logical Methods in Computer Science, Volume 19, Issue 4 (October 12, 2023) lmcs:8354

  19. arXiv:2010.05248  [pdf, other

    cs.CL

    Towards Accurate and Reliable Energy Measurement of NLP Models

    Authors: Qingqing Cao, Aruna Balasubramanian, Niranjan Balasubramanian

    Abstract: Accurate and reliable measurement of energy consumption is critical for making well-informed design choices when choosing and training large scale NLP models. In this work, we show that existing software-based energy measurements are not accurate because they do not take into account hardware differences and how resource utilization affects energy consumption. We conduct energy measurement experim… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

    Comments: Accepted to SustaiNLP 2020 (co-located with EMNLP 2020)

  20. arXiv:2007.06248  [pdf, ps, other

    cs.LO cs.DC

    Complexity of Verification and Synthesis of Threshold Automata

    Authors: A. R. Balasubramanian, Javier Esparza, Marijana Lazic

    Abstract: Threshold automata are a formalism for modeling and analyzing fault-tolerant distributed algorithms, recently introduced by Konnov, Veith, and Widder, describing protocols executed by a fixed but arbitrary number of processes. We conduct the first systematic study of the complexity of verification and synthesis problems for threshold automata. We prove that the coverability, reachability, safety,… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

    Comments: Accepted at ATVA20

  21. arXiv:2005.00697  [pdf, other

    cs.CL cs.AI cs.LG

    DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering

    Authors: Qingqing Cao, Harsh Trivedi, Aruna Balasubramanian, Niranjan Balasubramanian

    Abstract: Transformer-based QA models use input-wide self-attention -- i.e. across both the question and the input passage -- at all layers, causing them to be slow and memory-intensive. It turns out that we can get by without input-wide self-attention at all layers, especially in the lower layers. We introduce DeFormer, a decomposed transformer, which substitutes the full self-attention with question-wide… ▽ More

    Submitted 2 May, 2020; originally announced May 2020.

    Comments: ACL 2020 camera ready

  22. arXiv:2004.09621  [pdf, other

    cs.LO

    Characterizing consensus in the Heard-Of model

    Authors: A. R. Balasubramanian, Igor Walukiewicz

    Abstract: The Heard-Of model is a simple and relatively expressive model of distributed computation. Because of this, it has gained a considerable attention of the verification community. We give a characterization of all algorithms solving consensus in a fragment of this model. The fragment is big enough to cover many prominent consensus algorithms. The characterization is purely syntactic: it is expressed… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

  23. arXiv:2002.02645  [pdf, other

    cs.LG stat.ML

    Accelerating Deep Learning Inference via Freezing

    Authors: Adarsh Kumar, Arjun Balasubramanian, Shivaram Venkataraman, Aditya Akella

    Abstract: Over the last few years, Deep Neural Networks (DNNs) have become ubiquitous owing to their high accuracy on real-world tasks. However, this increase in accuracy comes at the cost of computationally expensive models leading to higher prediction latencies. Prior efforts to reduce this latency such as quantization, model distillation, and any-time prediction models typically trade-off accuracy for pe… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

    Comments: 11th USENIX Workshop on Hot Topics in Cloud Computing, HotCloud 2019

  24. arXiv:1911.09849  [pdf, other

    cs.DC

    Archipelago: A Scalable Low-Latency Serverless Platform

    Authors: Arjun Singhvi, Kevin Houck, Arjun Balasubramanian, Mohammed Danish Shaikh, Shivaram Venkataraman, Aditya Akella

    Abstract: The increased use of micro-services to build web applications has spurred the rapid growth of Function-as-a-Service (FaaS) or serverless computing platforms. While FaaS simplifies provisioning and scaling for application developers, it introduces new challenges in resource management that need to be handled by the cloud provider. Our analysis of popular serverless workloads indicates that schedule… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

    Comments: 14 pages

  25. arXiv:1909.01667  [pdf, ps, other

    cs.LO cs.CC

    Complexity of controlled bad sequences over finite sets of $\mathbb{N}^d$

    Authors: A. R. Balasubramanian

    Abstract: We provide upper and lower bounds for the length of controlled bad sequences over the majoring and the minoring orderings of finite sets of $\mathbb{N}^d$. The results are obtained by bounding the length of such sequences by functions from the Cichon hierarchy. This allows us to translate these results to bounds over the fast-growing complexity classes. The obtained bounds are proven to be tight… ▽ More

    Submitted 8 June, 2020; v1 submitted 4 September, 2019; originally announced September 2019.

  26. arXiv:1907.01484  [pdf, other

    cs.DC

    Themis: Fair and Efficient GPU Cluster Scheduling

    Authors: Kshiteej Mahajan, Arjun Balasubramanian, Arjun Singhvi, Shivaram Venkataraman, Aditya Akella, Amar Phanishayee, Shuchi Chawla

    Abstract: Modern distributed machine learning (ML) training workloads benefit significantly from leveraging GPUs. However, significant contention ensues when multiple such workloads are run atop a shared cluster of GPUs. A key question is how to fairly apportion GPUs across workloads. We find that established cluster scheduling disciplines are a poor fit because of ML workloads' unique attributes: ML jobs h… ▽ More

    Submitted 29 October, 2019; v1 submitted 2 July, 2019; originally announced July 2019.

  27. arXiv:1904.08903  [pdf, ps, other

    math.CO cs.DM

    Generalized threshold arrangements

    Authors: A. R. Balasubramanian

    Abstract: An arrangement of hyperplanes is a finite collection of hyperplanes in a real Euclidean space. To such a collection one associates the characteristic polynomial that encodes the combinatorics of intersections of the hyperplanes. Finding the characteristic polynomial of the Shi threshold and the Catalan threshold arrangements was an open problem in Stanley's list of problems in [1]. Seunghyun Seo s… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

  28. arXiv:1904.01095  [pdf

    physics.comp-ph cond-mat.mtrl-sci cs.CE cs.LG cs.NE

    Fast, accurate, and transferable many-body interatomic potentials by symbolic regression

    Authors: Alberto Hernandez, Adarsh Balasubramanian, Fenglin Yuan, Simon Mason, Tim Mueller

    Abstract: The length and time scales of atomistic simulations are limited by the computational cost of the methods used to predict material properties. In recent years there has been great progress in the use of machine learning algorithms to develop fast and accurate interatomic potential models, but it remains a challenge to develop models that generalize well and are fast enough to be used at extreme tim… ▽ More

    Submitted 17 August, 2019; v1 submitted 1 April, 2019; originally announced April 2019.

  29. Parameterized Verification of Coverability in Well-Structured Broadcast Networks

    Authors: A. R. Balasubramanian

    Abstract: Parameterized verification of coverability in broadcast networks with finite state processes has been studied for different types of models and topologies. In this paper, we attempt to develop a theory of broadcast networks in which the processes can be well-structured transition systems. The resulting formalism is called well-structured broadcast networks. We give an algorithm to decide coverabil… ▽ More

    Submitted 9 September, 2018; originally announced September 2018.

    Comments: In Proceedings GandALF 2018, arXiv:1809.02416

    Journal ref: EPTCS 277, 2018, pp. 133-146

  30. arXiv:1802.08469  [pdf, ps, other

    cs.LO cs.DC cs.FL

    Parameterized verification of synchronization in constrained reconfigurable broadcast networks

    Authors: A. R. Balasubramanian, Nathalie Bertrand, Nicolas Markey

    Abstract: Reconfigurable broadcast networks provide a convenient formalism for modelling and reasoning about networks of mobile agents broadcasting messages to other agents following some (evolving) communication topology. The parameterized verification of such models aims at checking whether a given property holds irrespective of the initial configuration (number of agents, initial states and initial commu… ▽ More

    Submitted 23 February, 2018; originally announced February 2018.

    Comments: Accepted for publication in TACAS 2018

  31. arXiv:1706.00878  [pdf, other

    cs.DC cs.LG cs.NE

    MobiRNN: Efficient Recurrent Neural Network Execution on Mobile GPU

    Authors: Qingqing Cao, Niranjan Balasubramanian, Aruna Balasubramanian

    Abstract: In this paper, we explore optimizations to run Recurrent Neural Network (RNN) models locally on mobile devices. RNN models are widely used for Natural Language Processing, Machine Translation, and other tasks. However, existing mobile applications that use RNN models do so on the cloud. To address privacy and efficiency concerns, we show how RNN models can be run locally on mobile devices. Existin… ▽ More

    Submitted 2 June, 2017; originally announced June 2017.

    Comments: Published at 1st International Workshop on Embedded and Mobile Deep Learning colocated with MobiSys 2017

  32. Secure Symmetrical Multilevel Diversity Coding

    Authors: Anantharaman Balasubramanian, Hung D. Ly, Shuo Li, Tie Liu, Scott L. Miller

    Abstract: Symmetrical Multilevel Diversity Coding (SMDC) is a network compression problem introduced by Roche (1992) and Yeung (1995). In this setting, a simple separate coding strategy known as superposition coding was shown to be optimal in terms of achieving the minimum sum rate (Roche, Yeung, and Hau, 1997) and the entire admissible rate region (Yeung and Zhang, 1999) of the problem. This paper consider… ▽ More

    Submitted 9 January, 2012; originally announced January 2012.

    Comments: Submitted to the IEEE Transactions on Information Theory in May 2011. Minor revision made to the current version in January 2012