Skip to main content

Showing 1–23 of 23 results for author: Ranjan, S

  1. arXiv:2407.07858  [pdf, other

    cs.LG cs.CL

    FACTS About Building Retrieval Augmented Generation-based Chatbots

    Authors: Rama Akkiraju, Anbang Xu, Deepak Bora, Tan Yu, Lu An, Vishal Seth, Aaditya Shukla, Pritam Gundecha, Hridhay Mehta, Ashwin Jha, Prithvi Raj, Abhinav Balasubramanian, Murali Maram, Guru Muthusamy, Shivakesh Reddy Annepally, Sidney Knowles, Min Du, Nick Burnett, Sean Javiya, Ashok Marannan, Mamta Kumari, Surbhi Jha, Ethan Dereszenski, Anupam Chakraborty, Subhash Ranjan , et al. (13 additional authors not shown)

    Abstract: Enterprise chatbots, powered by generative AI, are emerging as key applications to enhance employee productivity. Retrieval Augmented Generation (RAG), Large Language Models (LLMs), and orchestration frameworks like Langchain and Llamaindex are crucial for building these chatbots. However, creating effective enterprise chatbots is challenging and requires meticulous RAG pipeline engineering. This… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 8 pages, 6 figures, 2 tables, Preprint submission to ACM CIKM 2024

  2. arXiv:2406.09443  [pdf, other

    eess.AS cs.HC cs.LG

    Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness

    Authors: Satyam Kumar, Sai Srujana Buddi, Utkarsh Oggy Sarawgi, Vineet Garg, Shivesh Ranjan, Ognjen, Rudovic, Ahmed Hussen Abdelaziz, Saurabh Adya

    Abstract: Voice activity detection (VAD) is a critical component in various applications such as speech recognition, speech enhancement, and hands-free communication systems. With the increasing demand for personalized and context-aware technologies, the need for effective personalized VAD systems has become paramount. In this paper, we present a comparative analysis of Personalized Voice Activity Detection… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  3. arXiv:2405.07730  [pdf, other

    cs.CL

    Does Dependency Locality Predict Non-canonical Word Order in Hindi?

    Authors: Sidharth Ranjan, Marten van Schijndel

    Abstract: Previous work has shown that isolated non-canonical sentences with Object-before-Subject (OSV) order are initially harder to process than their canonical counterparts with Subject-before-Object (SOV) order. Although this difficulty diminishes with appropriate discourse context, the underlying cognitive factors responsible for alleviating processing challenges in OSV sentences remain a question. In… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: Accepted at CogSci-2024 with full paper publication

  4. arXiv:2404.18684  [pdf, other

    cs.CL econ.TH math.OC

    Work Smarter...Not Harder: Efficient Minimization of Dependency Length in SOV Languages

    Authors: Sidharth Ranjan, Titus von der Malsburg

    Abstract: Dependency length minimization is a universally observed quantitative property of natural languages. However, the extent of dependency length minimization, and the cognitive mechanisms through which the language processor achieves this minimization remain unclear. This research offers mechanistic insights by postulating that moving a short preverbal constituent next to the main verb explains preve… ▽ More

    Submitted 10 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted at CogSci-2024 as talk with full paper publication

  5. arXiv:2312.10092  [pdf, other

    cs.CY

    Introspecting the Happiness amongst University Students using Machine Learning

    Authors: Sakshi Ranjan, Pooja Priyadarshini, Subhankar Mishra

    Abstract: Happiness underlines the intuitive constructs of a specified population based on positive psychological outcomes. It is the cornerstone of the cognitive skills and exploring university student's happiness has been the essence of the researchers lately. In this study, we have analyzed the university student's happiness and its facets using statistical distribution charts; designing research questio… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 5 Figures, 10 tables, 12 pages. Accepted at Happiness Meet IIT Kharagpur-2022

  6. Perceiving University Student's Opinions from Google App Reviews

    Authors: Sakshi Ranjan, Subhankar Mishra

    Abstract: Google app market captures the school of thought of users from every corner of the globe via ratings and text reviews, in a multilinguistic arena. The potential information from the reviews cannot be extracted manually, due to its exponential growth. So, Sentiment analysis, by machine learning and deep learning algorithms employing NLP, explicitly uncovers and interprets the emotions. This study p… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: Accepted in Concurrency and Computation Practice and Experience

    Journal ref: Concurrency and Computation: Practice and Experience, 34(10), p.e6800 (2022)

  7. arXiv:2306.04332  [pdf

    cs.HC

    A Systematic Study Of Various Fingertip Detection Techniques For Air Writing Using Machine Learning

    Authors: Heena, Sandeep Ranjan

    Abstract: The recent advancement in technology breaks the barriers to communication between users and computers. The communication between humans and computers includes emotion and gesture recognition. Emotions can be recognized on the face of humans whereas gesture recognition includes hand and body gesture recognition. Fingertip detection is also part of it. Gesture recognition is the way of interaction t… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  8. arXiv:2304.11410  [pdf, other

    cs.CL econ.TH

    A bounded rationality account of dependency length minimization in Hindi

    Authors: Sidharth Ranjan, Titus von der Malsburg

    Abstract: The principle of DEPENDENCY LENGTH MINIMIZATION, which seeks to keep syntactically related words close in a sentence, is thought to universally shape the structure of human languages for effective communication. However, the extent to which dependency length minimization is applied in human language systems is not yet fully understood. Preverbally, the placement of long-before-short constituents a… ▽ More

    Submitted 22 April, 2023; originally announced April 2023.

    Comments: Accepted at CogSci-2023

  9. arXiv:2302.04577  [pdf, other

    cs.SD eess.AS

    Incorporating Total Variation Regularization in the design of an intelligent Query by Humming system

    Authors: Shivangi Ranjan, Vishal Srivastava

    Abstract: A Query-By-Humming (QBH) system constitutes a particular case of music information retrieval where the input is a user-hummed melody and the output is the original song which contains that melody. A typical QBH system consists of melody extraction and candidate melody retrieval. For melody extraction, accurate note transcription is the key enabling technology. However, current transcription meth… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  10. arXiv:2210.14380  [pdf, other

    cs.CL

    Progressive Sentiment Analysis for Code-Switched Text Data

    Authors: Sudhanshu Ranjan, Dheeraj Mekala, Jingbo Shang

    Abstract: Multilingual transformer language models have recently attracted much attention from researchers and are used in cross-lingual transfer learning for many NLP tasks such as text classification and named entity recognition. However, similar methods for transfer learning from monolingual text to code-switched text have not been extensively explored mainly due to the following challenges: (1) Code-swi… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: To appear in Findings of EMNLP 2022

  11. arXiv:2210.13940  [pdf, other

    cs.CL cs.AI cs.IT

    Discourse Context Predictability Effects in Hindi Word Order

    Authors: Sidharth Ranjan, Marten van Schijndel, Sumeet Agarwal, Rajakrishnan Rajkumar

    Abstract: We test the hypothesis that discourse predictability influences Hindi syntactic choice. While prior work has shown that a number of factors (e.g., information status, dependency length, and syntactic surprisal) influence Hindi word order preferences, the role of discourse predictability is underexplored in the literature. Inspired by prior work on syntactic priming, we investigate how the words an… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022

  12. arXiv:2210.13938  [pdf, other

    cs.CL cs.AI cs.IT

    Dual Mechanism Priming Effects in Hindi Word Order

    Authors: Sidharth Ranjan, Marten van Schijndel, Sumeet Agarwal, Rajakrishnan Rajkumar

    Abstract: Word order choices during sentence production can be primed by preceding sentences. In this work, we test the DUAL MECHANISM hypothesis that priming is driven by multiple different sources. Using a Hindi corpus of text productions, we model lexical priming with an n-gram cache model and we capture more abstract syntactic priming with an adaptive neural language model. We permute the preverbal cons… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: Accepted to AACL 2022

  13. arXiv:2204.02455  [pdf, other

    cs.SD cs.LG eess.AS

    Improving Voice Trigger Detection with Metric Learning

    Authors: Prateeth Nayak, Takuya Higuchi, Anmol Gupta, Shivesh Ranjan, Stephen Shum, Siddharth Sigtia, Erik Marchi, Varun Lakshminarasimhan, Minsik Cho, Saurabh Adya, Chandra Dhir, Ahmed Tewfik

    Abstract: Voice trigger detection is an important task, which enables activating a voice assistant when a target user speaks a keyword phrase. A detector is typically trained on speech data independent of speaker information and used for the voice trigger detection task. However, such a speaker independent voice trigger detector typically suffers from performance degradation on speech from underrepresented… ▽ More

    Submitted 13 September, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: Accepted at InterSpeech 2022

  14. arXiv:2201.13029  [pdf, other

    cs.NI

    A Flexible IAB Architecture for Beyond 5G Network

    Authors: Shashi Ranjan, Pranav Jha, Abhay Karandikar, Prasanna Chaporkar

    Abstract: IAB is an innovative wireless backhaul solution to provide cost-efficient deployment of small cells for successful 5G adoption. Besides, IAB can utilize the same spectrum for access and backhaul purposes. The 3GPP standardized IAB in Release 16 and would incorporate a few enhancements in the upcoming releases. The 3GPP IAB architecture, however, suffers from some limitations, such as it does not s… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

    Comments: 7 pages, 5 figures, journal

  15. arXiv:2112.15589  [pdf, ps, other

    cs.CV

    3-D Material Style Transfer for Reconstructing Unknown Appearance in Complex Natural Materials

    Authors: Shashank Ranjan, Corey Toler-Franklin

    Abstract: We propose a 3-D material style transfer framework for reconstructing invisible (or faded) appearance properties in complex natural materials. Our algorithm addresses the technical challenge of transferring appearance properties from one object to another of the same material when both objects have intricate, noncorresponding color patterns. Eggshells, exoskeletons, and minerals, for example, have… ▽ More

    Submitted 31 December, 2021; originally announced December 2021.

    Comments: 15 pages, 22 figures

    ACM Class: I.3; I.3.5; I.3.7; I.3.8

  16. arXiv:2109.00780  [pdf, ps, other

    cs.GR cs.CV

    Non-Photorealistic Rendering of Layered Materials: A Multispectral Approach

    Authors: Corey Toler-Franklin, Shashank Ranjan

    Abstract: We present multispectral rendering techniques for visualizing layered materials found in biological specimens. We are the first to use acquired data from the near-infrared and ultraviolet spectra for non-photorealistic rendering (NPR). Several plant and animal species are more comprehensively understood by multispectral analysis. However, traditional NPR techniques ignore unique information outsid… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

    Comments: 15 pages, 35 figures

    ACM Class: I.3.3; I.3.8; I.4.0; I.4.1; I.4.3; I.4.8; I.4.9

  17. arXiv:2101.02628  [pdf

    econ.GN cs.LG cs.SI

    Analyzing the response to TV serials retelecast during COVID19 lockdown in India

    Authors: Sandeep Ranjan

    Abstract: TV serials are a popular source of entertainment. The ongoing COVID19 lockdown has a high probability of degrading the publics mental health. The Government of India started the retelecast of yesteryears popular TV serials on public broadcaster Doordarshan from 28th March 2020 to 31st July 2020. Tweets corresponding to the Doordarshan hashtag were mined to create a dataset. The experiment aims to… ▽ More

    Submitted 10 January, 2021; v1 submitted 22 December, 2020; originally announced January 2021.

  18. arXiv:2011.05186  [pdf, other

    eess.IV cs.CV cs.LG

    Pristine annotations-based multi-modal trained artificial intelligence solution to triage chest X-ray for COVID-19

    Authors: Tao Tan, Bipul Das, Ravi Soni, Mate Fejes, Sohan Ranjan, Daniel Attila Szabo, Vikram Melapudi, K S Shriram, Utkarsh Agrawal, Laszlo Rusko, Zita Herczeg, Barbara Darazs, Pal Tegzes, Lehel Ferenczi, Rakesh Mullick, Gopal Avinash

    Abstract: The COVID-19 pandemic continues to spread and impact the well-being of the global population. The front-line modalities including computed tomography (CT) and X-ray play an important role for triaging COVID patients. Considering the limited access of resources (both hardware and trained personnel) and decontamination considerations, CT may not be ideal for triaging suspected subjects. Artificial i… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

  19. Using LSTM for the Prediction of Disruption in ADITYA Tokamak

    Authors: Aman Agarwal, Aditya Mishra, Priyanka Sharma, Swati Jain, Sutapa Ranjan, Ranjana Manchanda

    Abstract: Major disruptions in tokamak pose a serious threat to the vessel and its surrounding pieces of equipment. The ability of the systems to detect any behavior that can lead to disruption can help in alerting the system beforehand and prevent its harmful effects. Many machine learning techniques have already been in use at large tokamaks like JET and ASDEX, but are not suitable for ADITYA, which is co… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

    Comments: 7 pages, 4 figures

    Journal ref: Plasma Physics and Controlled Fusion, Volume 63, Number 11, 2021

  20. arXiv:2006.09739  [pdf, other

    cs.IR cs.LG stat.ML

    Comparative Sentiment Analysis of App Reviews

    Authors: Sakshi Ranjan, Subhankar Mishra

    Abstract: Google app market captures the school of thought of users via ratings and text reviews. The critique's viewpoint regarding an app is proportional to their satisfaction level. Consequently, this helps other users to gain insights before downloading or purchasing the apps. The potential information from the reviews can't be extracted manually, due to its exponential growth. Sentiment analysis, by ma… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

    Comments: 10 pages, 7 figures, Accepted to the 11th ICCCNT, 2020, IIT KGP

  21. arXiv:1904.07386  [pdf, other

    eess.AS cs.CL cs.SD

    I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences

    Authors: Kong Aik Lee, Ville Hautamaki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Hector Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda , et al. (21 additional authors not shown)

    Abstract: The I4U consortium was established to facilitate a joint entry to NIST speaker recognition evaluations (SRE). The latest edition of such joint submission was in SRE 2018, in which the I4U submission was among the best-performing systems. SRE'18 also marks the 10-year anniversary of I4U consortium into NIST SRE series of evaluation. The primary objective of the current paper is to summarize the res… ▽ More

    Submitted 15 April, 2019; originally announced April 2019.

    Comments: 5 pages

  22. arXiv:1610.07651  [pdf, ps, other

    cs.CL

    UTD-CRSS Systems for 2016 NIST Speaker Recognition Evaluation

    Authors: Chunlei Zhang, Fahimeh Bahmaninezhad, Shivesh Ranjan, Chengzhu Yu, Navid Shokouhi, John H. L. Hansen

    Abstract: This document briefly describes the systems submitted by the Center for Robust Speech Systems (CRSS) from The University of Texas at Dallas (UTD) to the 2016 National Institute of Standards and Technology (NIST) Speaker Recognition Evaluation (SRE). We developed several UBM and DNN i-Vector based speaker recognition systems with different data sets and feature representations. Given that the empha… ▽ More

    Submitted 24 October, 2016; originally announced October 2016.

    Comments: 5 pages

  23. arXiv:1311.4900  [pdf

    cs.IR cs.DB

    Query Interface Integrator For Domain Specific Hidden Web

    Authors: Sudhakar Ranjan, Komal K. Bhatia

    Abstract: Web is title admittance today mainly relies on search engines. A large amount of data is hidden in the databases behind the search interfaces referred to as Hidden web, which needs to be indexed so in order to serve user query. In this paper database and data mining techniques are used for query interface integration. The query interface must resemble the look and feel of local interface as much a… ▽ More

    Submitted 16 November, 2013; originally announced November 2013.

    Comments: 8 Pages. International Journal of Computer Engineering and Applications, 2013