Subbarao Kambhampati’s Post

Prof at ASU (Former President of AAAI)

1mo Edited

𝕎𝕙𝕪 #𝔸𝕀 𝕗𝕠𝕝𝕜𝕤 𝕟𝕖𝕖𝕕 𝕒 𝕓𝕣𝕠𝕒𝕕 𝕓𝕒𝕤𝕖𝕕 𝕀𝕟𝕥𝕣𝕠 𝕥𝕠 #𝔸𝕀 👉 As I go around giving talks/tutorials on the planning and reasoning abilities of LLMs, I am constantly surprised at the rather narrow ML-centric background of grad students/young researchers have about #AI. This seems to be especially the case with those who think LLMs are already doing planning and reasoning etc. Most of them don't seem to know much about the many topics that are taught in a broad-based Intro to #AI course--such as combinatorial search, logic, CSP, difference between inductive vs. deductive reasoning (aka learning vs. inference), soundness vs. completeness of inference/reasoning etc. I can understand why a strong background in ML and DL is sine qua non these days in using/applying the current #AI technology. That doesn't however mean that other things, that are typically not covered in ML courses, but are covered in Intro #AI courses, are expendable. If you don't know those concepts, you are more likely than not to re-invent crooked wheels (see this for examples of how people get tripped up: https://lnkd.in/gUPPb7s4) All this is particularly relevant for those busy building empirical scaffolds over LLMs (the "LLMs are Zero-shot <XXX>" variety). Most often, these young researchers are coming from NLP. At one point, NLP used to be NLU and students had quite a firm grasp of logic (e..g Montague Semantics!). But over the years, NLU became NLP which in turn has become Applied Machine Learning, and students don't quite have the background in logic/reasoning etc. Now that LLMs have basically "solved" the "processing" tasks--such as information extraction, format conversion etc., NLP folks are trying to turn to reasoning tasks--but often lack the necessary background. (See this unsolicited advice to NLP students: https://lnkd.in/gKTdsH2P) Background in the standard Intro AI topics like search/CSP/logic are useful even if you don't plan on directly using those techniques (e.g. because you want differentiable everything to make use of your SGD hammer). Like MDPs, they provide a normative basis for many deeper reasoning tasks AI systems would have to carry out when they broaden their scope beyond statistical learning. Without that background, you will likely try to pigeon hole everything into "in/out of distribution" framework, when what you need to think of is "in/out of deductive/knowledge closure; see https://lnkd.in/gTWVibdt ) One of the other things that you get exposed to in the standard Intro #AI is computational complexity of the various reasoning tasks. People who jumped in directly via applied ML might understand a bit of sample complexity (maybe?), but are not that attuned to reasoning complexity. (Contd. in the comment below)

46 Comments

Subbarao Kambhampati

Prof at ASU (Former President of AAAI)

1mo

(Contd. from the main post) While computational complexity has increasingly been sidelined in these days where we mostly ignore the costs of offline training (see this thread on the "death of computational complexity" https://x.com/rao2z/status/1500178336504442880), it will eventually rise up and bite you (especially if you are trying to make money without an #AI Pyramid scheme..). For example, trying to get LLMs to fake reasoning via fine tuning often ignores the amortization costs associated with memoization (as sent off in this GoFAI vs. LLaMAI satire: https://x.com/rao2z/status/1749104832450072953). Understanding conservation of computational complexity also makes you question/avoid unwarranted optimism about reasoning being solved by just approximate retrieval on LLMs, (even with pre-training on web-scale corpora, synthetic data etc). given that they take constant time per completion token! (c.f: https://x.com/rao2z/status/1766087877216371072) (Finally, fwiw, here is the link to the Intro to #AI I teach at ASU that brings all these things together: https://rakaposhi.eas.asu.edu/cse471/ )

24 Reactions

Brody Kutt

Sr. Principal MLE, AI Research at Palo Alto Networks

1mo

Symbolic AI is a tool, just as DL is, and has its own shortcomings. Its a useful tool to learn, no doubt, especially for practitioners. But I am a bit more hesistant to just assume that many people "simply don't know enough about symbolic AI." Every CS degree (with specialization in AI) I've ever heard of features the course you're referring to. They likely do know a thing or two about it but want to steer away from it in general. Many times what gets people into AI is the desire to understand and model human intelligence. There is little to nothing that is biologically plausible about symbolic AI including search, CSP, logic, etc. To be clear, DL and SGD are not exactly plausible either, but it is at least subsymbolic.

Nicos Kekchidis

AI and Cloud-Native Transformer | Meaningful AI enabler | Team Builder | People are No.1 Priority

1mo

Though I agree with the need to have much broader understanding of the pillars of modern AI as pointed out in your post, Prof. Subbarao Kambhampati, I disagree with assertion that LLMs outright lack planning. The terseness of the LI threads don’t make justice to complex topic, but I will try to provide schematic: - LLMs through training capture human experience recorded in millenia worth texts - the acquired knowledge is codified as parameters and embeddings - as a specialized and effective world model (with its limitations, of course) - that’s an empirical planning for all kinds of generalized human related scenarios - when fine-tuning LLM gets instructed to prioritize certain scenarios at the expense of others (ie embeddings and parameters get updated accordingly) - at inference times prompts get overlayed by transformers on the previously created model and actionable outcomes are generated - CSP and generally other search space methods are simply intractable in otherwise combinatorially explosive search space - DL cleverly optimizes the search space I have a brief reader’s digest in the end of this article about - How LLMs work: https://enterai.world/how-intelligent-ai-really-is/

Elizabeth Reilley

Enterprise Technology, Al Acceleration

1mo

Agree! I usully start my AI workshops with basic AI 101: what is AI, ML, DL, neural networks, gen AI, LLMs. And how do all these concepts relate to another. We don’t know what we don’t know. I initially found it surprising how many people with deep ML expertise and experience would come up and thank me for the big picture overview and tell me how much they learned and how helpful it is to understand the broader landscape. I worried I might come across as talking down to some of these brilliant folks I have the honor of speaking to. 😅

3 Reactions

Christopher Riesbeck

Associate Professor at Northwestern University

1mo

Not only has NLU devolved to NLP, but natural language generation used to mean “express some conceptual intent in a natural language.” Now it means “extend this text with more text.”

6 Reactions

Michael Rovatsos

Professor of AI at The University of Edinburgh

I would actually go further - I have no problem swallowing the bitter pill that arguably none of sybolic AI ever truly worked on real-world problems. What I worry more about is that we are losing the methodological foundation of computer science, which is to rigorously define a computational problem and think about what representations and algorithms will solve it best. That's what people are no longer interested in.

2 Reactions

Shiwali Mohan

Principal AI Scientist | Collaborative Human-AI Systems | Embodied AI | Reasoning and Learning with World Models | Human Cognition

1mo

So, what are universities, CS departments, and academics doing about this? This knowledge gap in a whole generation of researchers is reflective of the hiring bias US departments have shown in the past decade.

1 Reaction

Soma Dhavala

1mo

Totally agree. Also, we can apply our own mind to learn, and understand these (classical AI) topics, rather than throw data, compute at them, and expect our hypothesis to not turn out false (as David Donoho says, Deep Learning is a magical mirror. You see what you want to see).

3 Reactions

Dan Selman

1mo

If all you know about is hammers…

2 Reactions

Ovidiu M.

I share insights on Big and Fast Data tools, Teaching, Distributed Systems, HPC, AI, and system design with TLA+

1mo

Go here https://rakaposhi.eas.asu.edu/cse471/

See more comments

To view or add a comment, sign in

More Relevant Posts

Hai Huang

Machine Learning | Deep Learning | Senior Staff Engineer at Google
1mo
Report this post
Prof. Rao's post is a good starting point for LLM practitioners to become familiar with classic (i.e., predating LLMs) AI concepts and theories, for two reasons: 📌 To understand that some things are theoretically beyond the reach of LLMs and not waste resources chasing them. 📌 To hedge against the possibility that LLMs may be replaced by other technologies and/or undergo fundamental technological revamps, which would render all existing investments in LLMs obsolete. #artificialintelligence #machinelearning #deeplearning

Subbarao Kambhampati

Prof at ASU (Former President of AAAI)
1mo Edited

𝕎𝕙𝕪 #𝔸𝕀 𝕗𝕠𝕝𝕜𝕤 𝕟𝕖𝕖𝕕 𝕒 𝕓𝕣𝕠𝕒𝕕 𝕓𝕒𝕤𝕖𝕕 𝕀𝕟𝕥𝕣𝕠 𝕥𝕠 #𝔸𝕀 👉 As I go around giving talks/tutorials on the planning and reasoning abilities of LLMs, I am constantly surprised at the rather narrow ML-centric background of grad students/young researchers have about #AI. This seems to be especially the case with those who think LLMs are already doing planning and reasoning etc. Most of them don't seem to know much about the many topics that are taught in a broad-based Intro to #AI course--such as combinatorial search, logic, CSP, difference between inductive vs. deductive reasoning (aka learning vs. inference), soundness vs. completeness of inference/reasoning etc. I can understand why a strong background in ML and DL is sine qua non these days in using/applying the current #AI technology. That doesn't however mean that other things, that are typically not covered in ML courses, but are covered in Intro #AI courses, are expendable. If you don't know those concepts, you are more likely than not to re-invent crooked wheels (see this for examples of how people get tripped up: https://lnkd.in/gUPPb7s4) All this is particularly relevant for those busy building empirical scaffolds over LLMs (the "LLMs are Zero-shot <XXX>" variety). Most often, these young researchers are coming from NLP. At one point, NLP used to be NLU and students had quite a firm grasp of logic (e..g Montague Semantics!). But over the years, NLU became NLP which in turn has become Applied Machine Learning, and students don't quite have the background in logic/reasoning etc. Now that LLMs have basically "solved" the "processing" tasks--such as information extraction, format conversion etc., NLP folks are trying to turn to reasoning tasks--but often lack the necessary background. (See this unsolicited advice to NLP students: https://lnkd.in/gKTdsH2P) Background in the standard Intro AI topics like search/CSP/logic are useful even if you don't plan on directly using those techniques (e.g. because you want differentiable everything to make use of your SGD hammer). Like MDPs, they provide a normative basis for many deeper reasoning tasks AI systems would have to carry out when they broaden their scope beyond statistical learning. Without that background, you will likely try to pigeon hole everything into "in/out of distribution" framework, when what you need to think of is "in/out of deductive/knowledge closure; see https://lnkd.in/gTWVibdt ) One of the other things that you get exposed to in the standard Intro #AI is computational complexity of the various reasoning tasks. People who jumped in directly via applied ML might understand a bit of sample complexity (maybe?), but are not that attuned to reasoning complexity. (Contd. in the comment below)

3 Comments
Like Comment
To view or add a comment, sign in
Chris Hart

Senior Consultant @ Monks
1mo
Report this post
💯 This is also a huge issue for enterprises which requires change management around their data cultures. If I had a dollar for every time a bright eyed ML enthusiast claimed that we need 18 months of “data readiness” efforts before we could ship a recommendation engine, or that all “insights” require batch training and statistical modeling… Lots of very capable data scientists and engineers like to throw their own version of cold water onto AI hype (a quality to be admired). But when they missed out on the full history of techniques and double down on making everything mathematically rigorous, they just make business problems worse. There is a whole world of human-friendly logic and taxonomy modeling that integrates seamlessly with search and recommendation tasks, and it’s way faster and cheaper to ship. The real path forward is a hybrid of the classical symbolic, Good Old Fashioned AI (GOFAI), with the statistical modeling of data science (nowadays generative models to get started faster), plus human critics and curators to guide the knowledge requirements and oversee compliance for the system. Then, as generative models make small gains on retrieval by training on the exact content of a given domain (or that it is discovered where uncertainty is harmless for less consequential tasks), or if certain niche areas of the domain turn out to require more rigorous statistical metrics, system designers can decide, empirically at the margins, to both ease off a bit on the initial logic controls, and to train very specialized models for the metrics discovered. We need GOFAI, data, and UX teams to be allies in the face of hype, because if we don’t join forces, then we’ll produce janky, siloed systems and our organizations will be weighed down by confused executives and managers just trying to solve problems for their customers while we bikeshed the exact pattern of Conway’s Law to follow. Into a vacuum of confusion and competing disciplinary silos, comes vendors with crappy products that don’t scale or work with customer knowledge properly, or worse, having only one of the silos dominate the path to market, and hitting a wall when it is realized that their one third of the whole picture is necessary but insufficient. This internal competition and disunity has played out in every large organization I’ve analyzed over my career, but only got worse with the advent of LLMs, while so much waste could be avoided, and value delivered instead. GOFAI + (data + ML) + UX = responsible AI 😎

Subbarao Kambhampati

Prof at ASU (Former President of AAAI)
1mo Edited

𝕎𝕙𝕪 #𝔸𝕀 𝕗𝕠𝕝𝕜𝕤 𝕟𝕖𝕖𝕕 𝕒 𝕓𝕣𝕠𝕒𝕕 𝕓𝕒𝕤𝕖𝕕 𝕀𝕟𝕥𝕣𝕠 𝕥𝕠 #𝔸𝕀 👉 As I go around giving talks/tutorials on the planning and reasoning abilities of LLMs, I am constantly surprised at the rather narrow ML-centric background of grad students/young researchers have about #AI. This seems to be especially the case with those who think LLMs are already doing planning and reasoning etc. Most of them don't seem to know much about the many topics that are taught in a broad-based Intro to #AI course--such as combinatorial search, logic, CSP, difference between inductive vs. deductive reasoning (aka learning vs. inference), soundness vs. completeness of inference/reasoning etc. I can understand why a strong background in ML and DL is sine qua non these days in using/applying the current #AI technology. That doesn't however mean that other things, that are typically not covered in ML courses, but are covered in Intro #AI courses, are expendable. If you don't know those concepts, you are more likely than not to re-invent crooked wheels (see this for examples of how people get tripped up: https://lnkd.in/gUPPb7s4) All this is particularly relevant for those busy building empirical scaffolds over LLMs (the "LLMs are Zero-shot <XXX>" variety). Most often, these young researchers are coming from NLP. At one point, NLP used to be NLU and students had quite a firm grasp of logic (e..g Montague Semantics!). But over the years, NLU became NLP which in turn has become Applied Machine Learning, and students don't quite have the background in logic/reasoning etc. Now that LLMs have basically "solved" the "processing" tasks--such as information extraction, format conversion etc., NLP folks are trying to turn to reasoning tasks--but often lack the necessary background. (See this unsolicited advice to NLP students: https://lnkd.in/gKTdsH2P) Background in the standard Intro AI topics like search/CSP/logic are useful even if you don't plan on directly using those techniques (e.g. because you want differentiable everything to make use of your SGD hammer). Like MDPs, they provide a normative basis for many deeper reasoning tasks AI systems would have to carry out when they broaden their scope beyond statistical learning. Without that background, you will likely try to pigeon hole everything into "in/out of distribution" framework, when what you need to think of is "in/out of deductive/knowledge closure; see https://lnkd.in/gTWVibdt ) One of the other things that you get exposed to in the standard Intro #AI is computational complexity of the various reasoning tasks. People who jumped in directly via applied ML might understand a bit of sample complexity (maybe?), but are not that attuned to reasoning complexity. (Contd. in the comment below)
Like Comment
To view or add a comment, sign in
Pritam Mohanty

CEO of Product
2mo
Report this post
🚀 Enhancing NLP Model Performance with Prompt Engineering 🌟 Are you a product manager looking to take your NLP models to the next level? Prompt engineering is a powerful technique that can help you improve the performance of your models and deliver better results. Here are some tips and best practices to help you leverage prompt engineering effectively: 🔍 Understand Your Data: Before diving into prompt engineering, make sure you have a clear understanding of your data and the specific problem you're trying to solve. This will help you create prompts that are tailored to your use case and goals. 📝 Craft Clear and Specific Prompts: The key to effective prompt engineering is crafting prompts that are clear, specific, and relevant to the task at hand. Avoid ambiguity and make sure your prompts provide enough context for the model to generate accurate responses. 🎯 Experiment with Different Prompt Formats: Don't be afraid to experiment with different prompt formats to see what works best for your NLP model. Try using different types of prompts, such as fill-in-the-blank or multiple-choice, to see which one yields the best results. 🔧 Fine-Tune Your Prompts: Regularly fine-tune and refine your prompts based on the performance of your NLP model. Analyze the outputs generated by your model and make adjustments to your prompts to address any issues or improve accuracy. 🧠 Incorporate Domain Knowledge: Leverage your domain knowledge and expertise to create prompts that are tailored to the specific nuances of your industry or use case. This can help you create more effective prompts that produce better results. 🚦 Monitor and Measure Performance: Keep a close eye on the performance of your NLP models and regularly monitor key metrics to track the impact of your prompt engineering efforts. Use this data to make informed decisions and iterate on your prompts. By leveraging prompt engineering techniques effectively, product managers can unlock the full potential of their NLP models and deliver impactful solutions that meet the needs of their users. Stay curious, experiment with different approaches, and never stop learning to continuously improve your NLP model performance! 💡✨ #ProductManagement #NLP #PromptEngineering #AI #GenAI #BestPractices #Innovation #DataScience #ProductDevelopment
Like Comment
To view or add a comment, sign in
Research Graph

167 followers
4mo Edited
Report this post
Summary of the Article “BERT Explained: State of the art language model for NLP ” Article Link: https://lnkd.in/dzFesuy In this article, the author talks about the working of BERT. The article emphasises on two key things, bidirectional training and the Masked LM technique used in BERT. The author states that a transformer has two mechanisms - an encoder that reads the input and a decoder that produces the prediction. In addition, the article also explains how the bidirectional nature of the transformer helps the model understand the context of a word better. The article also describes the Masked LM technique used in BERT. During this technique, 15% of the words in each sequence are replaced with a [MASK] token. The model then attempts to predict the original value of these tokens using the context provided by other non-masked words. In addition to Masked LM, the BERT model is also trained using a technique called Next Sentence Prediction (NSP). During this process, the model is fed with a pair of sentences as input and the model learns to predict if the second sentence is the subsequent sentence in the original document. Lastly, the article talks about the wide variety of applications that BERT can be used for. These applications include classification tasks, question-answering tasks, name entity recognition etc. In addition to this, it can also be used for transfer learning purposes and be applied to other NLP tasks as well. In conclusion, BERT is undoubtedly a breakthrough in the field of NLP which has a vast array of applications. Reference: https://lnkd.in/dzFesuy #artificialintelligence #naturallanguage #BERT
Like Comment
To view or add a comment, sign in
AUEB NLP Group

684 followers
5mo Edited
Report this post
Next AUEB NLP Meeting, Friday March 1st, 2024, 17:15-18:30 (Greek time) Title: Discussion of "RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval" (https://lnkd.in/ecKj2pTe) Room: T414, AUEB Troias building (2 Troias Str., Kimolou wing, 4th floor) and virtually via MS Teams: https://lnkd.in/dP6PmvD5 • Meeting ID: 372 651 892 794 • Passcode: iNTWbb Abstract: Retrieval-augmented language models can better adapt to changes in world state and incorporate long-tail knowledge. However, most existing methods retrieve only short contiguous chunks from a retrieval corpus, limiting holistic understanding of the overall document context. We introduce the novel approach of recursively embedding, clustering, and summarizing chunks of text, constructing a tree with differing levels of summarization from the bottom up. At inference time, our RAPTOR model retrieves from this tree, integrating information across lengthy documents at different levels of abstraction. Controlled experiments show that retrieval with recursive summaries offers significant improvements over traditional retrieval-augmented LMs on several tasks. On question-answering tasks that involve complex, multi-step reasoning, we show state-of-the-art results; for example, by coupling RAPTOR retrieval with the use of GPT-4, we can improve the best performance on the QuALITY benchmark by 20% in absolute accuracy. For ways to receive news about the NLP Group and its meetings, check https://lnkd.in/dECavNu4. To subscribe to the mailing list of AUEB NLP Group, send a message with subject "subscribe nodigest" to nlpmeetings-request@cs.aueb.gr. If you have an AUEB account and want to view all scheduled AUEB NLP Group Meetings in your MS Teams calendar, "AUEB NLP Group meetings" group on MS Teams (code: 01j65ny). Team members can also send text messages (chat) to other team members. If you are an AI researcher or practitioner, please consider becoming a member of the Hellenic Artificial Intelligence Society (EETN, http://www.eetn.gr/en/).
Like Comment
To view or add a comment, sign in
Pritam Mohanty

CEO of Product
1mo
Report this post
🚀 Enhancing NLP Models Through Prompt Engineering 🧠✨ As product managers, leveraging prompt engineering techniques can be a game-changer in improving the performance of NLP models. Here are some key tips and best practices to consider on this exciting journey: 🔹 **Understand Your Data**: Before diving into prompt engineering, ensure you have a deep understanding of your data. Analyze the patterns, nuances, and biases present in your dataset to tailor prompts effectively. 🔹 **Craft Clear and Specific Prompts**: The success of prompt engineering lies in the clarity and specificity of the prompts. Avoid vague language and ambiguity to guide the NLP model towards accurate responses. 🔹 **Experiment and Iterate**: Don't be afraid to experiment with different prompt variations. Test out diverse approaches, iterate on the language used, and analyze the model's responses to refine your prompt strategy. 🔹 **Domain Expert Collaboration**: Collaborating with domain experts can provide valuable insights into crafting prompts that resonate with the context of your product or industry. Their expertise can help in formulating prompts that capture the intricacies of the domain. 🔹 **Continuous Monitoring and Evaluation**: Monitoring the performance of your NLP model is crucial. Regularly evaluate the outputs, gather feedback, and make adjustments to the prompts based on the model's responses to ensure ongoing improvements. 🔹 **Utilize Transfer Learning**: Leverage transfer learning techniques to fine-tune your NLP models. By transferring knowledge from pre-trained models, you can enhance the performance and efficiency of your prompts. 🔹 **Consider Bias and Fairness**: Be mindful of biases that may inadvertently seep into your prompts. Regularly assess the data inputs and outputs for any biases and work towards creating fair and inclusive prompts. By incorporating these tips and best practices into your prompt engineering approach, you can unlock the full potential of NLP models and empower your product with enhanced performance and accuracy. Stay curious, iterate relentlessly, and watch your NLP models flourish! 💡🌟 #NLP #PromptEngineering #ProductManagement #AI #MachineLearning #GenAI #ProductPerformance #Innovation #TechTips
Like Comment
To view or add a comment, sign in
Vijeta Wasnik

Trainee Analyst @Sociometrik , Ex-GET @Birla Opus | Natural Language Processing, Deep Learning, Machine Learning | B.Tech in Information Technology
1mo
Report this post
🌟 Day 6 of 100 Days of NLP 🌟 Today, let's dive into Text Preprocessing Level 3 with a focus on Word Embeddings! 🚀 📍 Word Embeddings: Definition: Word embeddings are a type of word representation that allows words to be represented as vectors in a continuous vector space. They capture semantic meanings, relationships, and contexts of words. Popular word embedding techniques include Word2Vec, GloVe, and FastText. How It Works: Word embeddings are trained using neural networks on large text corpora. The result is a vector for each word, where semantically similar words have similar vectors. Popular Word Embedding Techniques: Word2Vec: Developed by Google. Uses two architectures: Continuous Bag of Words (CBOW) and Skip-gram. Example: Words like "king" and "queen" will have similar vectors. GloVe (Global Vectors for Word Representation): Developed by Stanford. Combines the advantages of global matrix factorization and local context window methods. Example: Captures the context of words by looking at their co-occurrence probabilities in a corpus. FastText: Developed by Facebook. Extends Word2Vec by representing words as bags of character n-grams. Example: Can generate embeddings for rare or misspelled words. Benefits of Word Embeddings: Capture semantic meanings and relationships between words. Efficient representation of words in a dense vector space. Improve performance in various NLP tasks like sentiment analysis, machine translation, and text classification. Limitations: Require large corpora for training effective embeddings. May not capture context-specific meanings (e.g., "bank" as in river vs. financial institution). Stay tuned for more insights and questions as we continue our journey into NLP! Let's keep learning and exploring the fascinating world of Natural Language Processing. 🚀💡 #100DaysOfNLP #NaturalLanguageProcessing #AI #MachineLearning #DataScience #WordEmbeddings #Word2Vec #GloVe #FastText
Like Comment
To view or add a comment, sign in
kaikai luo

CEO
2mo
Report this post
📌 Monosemanticity in NLP 1. **Monosemanticity**: Single, well-defined word meanings. Improves NLP accuracy and efficiency by reducing ambiguity. 2. **Challenges of Polysemy**: Multiple meanings cause confusion in NLP models. Traditional solutions require large, annotated datasets, which are impractical. 3. **Scaling Monosemanticity**: Involves creating word embeddings with single meanings, reducing ambiguity, and improving model performance. 4. **Techniques**: - **Contrastive Learning**: Distinguishes semantically distinct words. - **Semantic Clustering**: Groups similar words. - **Multi-Task Learning**: Trains on multiple tasks. - **Attention Mechanisms**: Focuses on relevant input parts. 5. **Benefits**: - Improved accuracy in tasks like sentiment analysis. - Faster processing without disambiguating meanings. - Less need for annotated data. - Applicable to any language. 6. **Applications**: - Chatbots and virtual assistants. - Machine translation. - Text summarization and generation. - Information retrieval. - Sentiment analysis. 7. **Challenges**: - Requires large, high-quality datasets. - Potential bias in training data. - Interpretability issues. 8. **Future**: Can revolutionize NLP and other AI fields. Important to address challenges like bias and interpretability. #NLP #AI #Monosemanticity #MachineLearning #TechInnovation
Like Comment
To view or add a comment, sign in
Pritam Mohanty

CEO of Product
1mo
Report this post
🚀 Enhancing NLP Model Performance: Leveraging Prompt Engineering 🚀 Are you a product manager looking to supercharge your NLP models' performance? Look no further than prompt engineering! Here are some tips and best practices to help you utilize this technique effectively: - **Understand Your Data**: Before diving into prompt engineering, ensure you have a deep understanding of your data. Analyze the language patterns, key phrases, and context to craft prompts that resonate with your specific use case. - **Start Simple**: Begin with straightforward prompts that target the core of your NLP model's objective. As you fine-tune the prompts, gradually introduce complexity to improve the model's understanding and accuracy. - **Iterate and Experiment**: Don't be afraid to experiment with different prompts. Iterate on your approach, test various formulations, and analyze the results to identify what works best for your specific NLP task. - **Context is Key**: Incorporate relevant context into your prompts to provide the model with valuable information for better comprehension. Contextual prompts can significantly enhance the model's performance by guiding it towards more accurate predictions. - **Domain-Specific Prompts**: Tailor your prompts to the domain or industry your NLP model operates in. By incorporating domain-specific terminology and context, you can improve the model's ability to generate meaningful outputs within that particular domain. - **Fine-Tune Regularly**: Continuous fine-tuning of prompts is essential for maintaining optimal NLP model performance. Keep track of the model's output, gather feedback, and adjust your prompts accordingly to ensure consistent improvement. - **Collaborate with Data Scientists**: Work closely with data scientists to create prompts that align with the model's architecture and objectives. Collaborative efforts can help bridge the gap between product management and model development, leading to more effective prompt engineering strategies. By leveraging prompt engineering techniques effectively, product managers can unlock the full potential of NLP models, improving performance, accuracy, and user experience. Stay curious, keep experimenting, and watch your NLP models transform into powerful assets for your product! 💡✨ #NLP #ProductManagement #PromptEngineering #GenAI #TechInnovation
Like Comment
To view or add a comment, sign in
Astha Chaudhary

Senior Engineer | MTech in Data Science and Engineering
2w
Report this post
Day 18: NLP vs LLM - What should I use in my new usecase? LLM might be the most trendiest NLP model these days, but its not the answer to all of your problems. Let's discuss why. NLP (Natural Language Processing) and LLMs (Large Language Models) are both fields of AI concerned with how computers understand and interact with human language, but they approach it in fundamentally different ways. Here's a breakdown to illustrate the contrast: NLP: The meticulous rule follower Think of NLP like studying a language textbook. It delves into the building blocks of language – grammar rules, syntax, and semantics – and uses various techniques (like machine learning) to tackle specific tasks. NLP excels at well-defined problems. It can be incredibly accurate at tasks like sentence structure analysis, named entity recognition (identifying things like people and places in text), or sentiment analysis (determining the emotional tone of a piece of writing). Because NLP is built for specific goals, it often requires less computational power and can be more efficient. LLMs: The language powerhouse Imagine LLMs as being immersed in a massive ocean of text data. They learn by analyzing vast amounts of text, identifying patterns and relationships between words and phrases. This allows them to generate human-quality text, translate languages, write different kinds of creative content, and answer your questions in an informative way. LLMs are like the ultimate language swiss army knife – a single model that can handle a wide range of tasks. However, because they are data-driven, they may not always be as precise as NLP for specific tasks. Additionally, their training on massive datasets requires significant computational resources. The synergy between NLP and LLMs While NLP and LLMs have distinct strengths, they can be a powerful combination. NLP can provide the structure and fine-tuning, while LLMs bring the adaptability and broad capabilities. This teamwork is leading to exciting advancements in AI, allowing for more nuanced and comprehensive interactions with machines. #100DaysOfReviseDS #100dayschallenge #ai #ml #nlp #datascience
Like Comment
To view or add a comment, sign in

8,406 followers

288 Posts

View Profile Follow

Subbarao Kambhampati’s Post

More Relevant Posts

Explore topics