-
Generative AISupercharging Llama 3.1 across NVIDIA Platforms
-
Generative AIDevelop Production-Grade Text Retrieval Pipelines for RAG with NVIDIA NeMo Retriever
-
Generative AICustomize Generative AI Models for Enterprise Applications with Llama 3.1
-
Generative AINVIDIA NeMo Accelerates LLM Innovation with Hybrid State Space Model Support
-
Simulation / Modeling / DesignNVIDIA Transitions Fully Towards Open-Source GPU Kernel Modules
Recent
Jul 26, 2024
Power Text-Generation Applications with Mistral NeMo 12B Running on a Single GPU
NVIDIA collaborated with Mistral to co-build the next-generation language model that achieves leading performance across benchmarks in its class. With a growing...
6 MIN READ
Jul 26, 2024
Faster Insights from Luminary Cloud's Engineering Simulations with NVIDIA GPUs
Engineering simulation is used across industries to accelerate product development. Simulations are used to check the safety of aircraft, cars, and buildings,...
8 MIN READ
Jul 26, 2024
Computed Tomography Organ and Disease Segmentation Using the NVIDIA VISTA-3D NIM Microservice
Over 300M computed tomography (CT) scans are performed globally, 85M in the US alone. Radiologists are looking for ways to speed up their workflow and generate...
9 MIN READ
Jul 25, 2024
Revolutionizing Code Completion with Codestral Mamba, the Next-Gen Coding LLM
In the rapidly evolving field of generative AI, coding models have become indispensable tools for developers, enhancing productivity and precision in software...
5 MIN READ
Jul 25, 2024
Simulate Elastic Objects in Any Representation with NVIDIA Kaolin Library
Recent advancements in generative AI and multi-view reconstruction have introduced new ways to rapidly generate 3D content. However, to be useful for downstream...
2 MIN READ
Jul 24, 2024
Cell Imaging Feature Extraction and Morphology Clustering for Spatial Omics
VISTA-2D is a new foundational model from NVIDIA that can quickly and accurately perform cell segmentation, a fundamental task in cell imaging and spatial omics...
8 MIN READ
Jul 24, 2024
Researchers Use AI to Resurrect Extinct DNA for Fighting Pathogens
Researchers using AI are mining the DNA of long-extinct species like woolly mammoths and giant sloths, looking for ancient genomic secrets to help fight...
5 MIN READ
Jul 24, 2024
Developing Product Configurators with OpenUSD
Developers from advertising agencies to software vendors are empowering global brands to deliver hyperpersonalization for digital experiences and visual...
5 MIN READ
Jul 23, 2024
Accelerate AI Infrastructure Using an NVIDIA BlueField-3 DPU Integration with DDN Storage
As AI becomes integral to organizational innovation and competitive advantage, the need for efficient and scalable infrastructure is more critical than ever. A...
6 MIN READ
Jul 23, 2024
Customize Generative AI Models for Enterprise Applications with Llama 3.1
The newly unveiled Llama 3.1 collection of 8B, 70B, and 405B large language models (LLMs) is narrowing the gap between proprietary and open-source models. Their...
10 MIN READ
Jul 23, 2024
Supercharging Llama 3.1 across NVIDIA Platforms
Meta's Llama collection of large language models are the most popular foundation models in the open-source community today, supporting a variety of use cases....
8 MIN READ
Jul 23, 2024
Develop Production-Grade Text Retrieval Pipelines for RAG with NVIDIA NeMo Retriever
Enterprises are sitting on a goldmine of data waiting to be used to improve efficiency, save money, and ultimately enable higher productivity. With generative...
6 MIN READ
Generative AI
Jul 26, 2024
Power Text-Generation Applications with Mistral NeMo 12B Running on a Single GPU
NVIDIA collaborated with Mistral to co-build the next-generation language model that achieves leading performance across benchmarks in its class. With a growing...
6 MIN READ
Jul 25, 2024
Revolutionizing Code Completion with Codestral Mamba, the Next-Gen Coding LLM
In the rapidly evolving field of generative AI, coding models have become indispensable tools for developers, enhancing productivity and precision in software...
5 MIN READ
Jul 25, 2024
Simulate Elastic Objects in Any Representation with NVIDIA Kaolin Library
Recent advancements in generative AI and multi-view reconstruction have introduced new ways to rapidly generate 3D content. However, to be useful for downstream...
2 MIN READ
Jul 24, 2024
Researchers Use AI to Resurrect Extinct DNA for Fighting Pathogens
Researchers using AI are mining the DNA of long-extinct species like woolly mammoths and giant sloths, looking for ancient genomic secrets to help fight...
5 MIN READ
Jul 24, 2024
Developing Product Configurators with OpenUSD
Developers from advertising agencies to software vendors are empowering global brands to deliver hyperpersonalization for digital experiences and visual...
5 MIN READ
Jul 23, 2024
Accelerate AI Infrastructure Using an NVIDIA BlueField-3 DPU Integration with DDN Storage
As AI becomes integral to organizational innovation and competitive advantage, the need for efficient and scalable infrastructure is more critical than ever. A...
6 MIN READ
Jul 23, 2024
Supercharging Llama 3.1 across NVIDIA Platforms
Meta's Llama collection of large language models are the most popular foundation models in the open-source community today, supporting a variety of use cases....
8 MIN READ
Jul 23, 2024
Develop Production-Grade Text Retrieval Pipelines for RAG with NVIDIA NeMo Retriever
Enterprises are sitting on a goldmine of data waiting to be used to improve efficiency, save money, and ultimately enable higher productivity. With generative...
6 MIN READ
Jul 23, 2024
Build an Agentic RAG Pipeline with Llama 3.1 and NVIDIA NeMo Retriever NIMs
Employing retrieval-augmented generation (RAG) is an effective strategy for ensuring large language model (LLM) responses are up-to-date and not...
7 MIN READ
Jul 23, 2024
Creating Synthetic Data Using Llama 3.1 405B
Synthetic data isn’t about creating new information. It's about transforming existing information to create different variants. For over a decade, synthetic...
15 MIN READ
Jul 23, 2024
Customize Generative AI Models for Enterprise Applications with Llama 3.1
The newly unveiled Llama 3.1 collection of 8B, 70B, and 405B large language models (LLMs) is narrowing the gap between proprietary and open-source models. Their...
10 MIN READ
Jul 23, 2024
Transforming Telco Network Operations Centers with NVIDIA NeMo Retriever and NVIDIA NIM
Telecom companies are challenged with consistently meeting service level agreements (SLAs) for end customers that ensure network quality of service. This...
8 MIN READ
AI Foundation Models
Jul 26, 2024
Power Text-Generation Applications with Mistral NeMo 12B Running on a Single GPU
NVIDIA collaborated with Mistral to co-build the next-generation language model that achieves leading performance across benchmarks in its class. With a growing...
6 MIN READ
Jun 28, 2024
Transforming Financial Analysis with NVIDIA NIM
In financial services, portfolio managers and research analysts diligently sift through vast amounts of data to gain a competitive edge in investments. Making...
13 MIN READ
Jun 24, 2024
Addressing Medical Imaging Limitations with Synthetic Data Generation
Synthetic data in medical imaging offers numerous benefits, including the ability to augment datasets with diverse and realistic images where real data is...
9 MIN READ
Jun 10, 2024
Introducing SDXL-Lightning: New Lightning-Fast Model on NVIDIA API Catalog
Create high-resolution images with remarkable efficiency with the Advanced text-to-image generation model, SDXL-Lightning, available and optimized now on the...
1 MIN READ
Jun 10, 2024
SOLAR-10.7B: Optimized Model Tailored Instruction Following, Reasoning, and Mathematical Tasks
Enhance efficiency and performance in instruction-based NLP tasks with SOLAR-10.7B, especially in following instructions, reasoning, and mathematical tasks.
1 MIN READ
Jun 03, 2024
Breeze-7B: LLM Specialized for Traditional Chinese
The model demonstrates strong performance for tasks such as Q&A, multi-round chat, and summarization in both traditional Chinese and English.
1 MIN READ
Jun 03, 2024
BGE-M3: Advanced Multilingual Text Retrieval Model
Experience the versatile embedding model designed for multilingual, multi-functional, and multi-granularity text retrieval tasks, excelling in dense,...
1 MIN READ
May 30, 2024
Convert Natural Language to Code with CodeGemma
Experience the advanced LLM API for code generation, completion, mathematical reasoning, and instruction following with free cloud credits.
1 MIN READ
May 14, 2024
Generate Text Responses from Visual and Text Inputs with Google's New PaliGemma Model
With free NVIDIA cloud credits, you can start testing the model at scale on the API Catalog.
1 MIN READ
May 13, 2024
Regional LLMs SEA-LION and SeaLLM Serve Languages and Cultures of Southeast Asia
At the recent World Governments Summit in Dubai, NVIDIA CEO Jensen Huang emphasized the importance of sovereign AI, which refers to a nation’s capability to...
3 MIN READ
Apr 30, 2024
Leverage Mixture of Experts-Based DBRX for Superior LLM Performance on Diverse Tasks
This week’s model release features DBRX, a state-of-the-art large language model (LLM) developed by Databricks. With demonstrated strength in programming and...
3 MIN READ
Apr 26, 2024
New LLM: Snowflake Arctic Model for SQL and Code Generation
Large language models (LLMs) have revolutionized natural language processing (NLP) in recent years, enabling a wide range of applications such as text...
3 MIN READ
Simulation / Modeling / Design
Jul 26, 2024
Faster Insights from Luminary Cloud's Engineering Simulations with NVIDIA GPUs
Engineering simulation is used across industries to accelerate product development. Simulations are used to check the safety of aircraft, cars, and buildings,...
8 MIN READ
Jul 25, 2024
Simulate Elastic Objects in Any Representation with NVIDIA Kaolin Library
Recent advancements in generative AI and multi-view reconstruction have introduced new ways to rapidly generate 3D content. However, to be useful for downstream...
2 MIN READ
Jul 24, 2024
Developing Product Configurators with OpenUSD
Developers from advertising agencies to software vendors are empowering global brands to deliver hyperpersonalization for digital experiences and visual...
5 MIN READ
Jul 23, 2024
Transforming Telco Network Operations Centers with NVIDIA NeMo Retriever and NVIDIA NIM
Telecom companies are challenged with consistently meeting service level agreements (SLAs) for end customers that ensure network quality of service. This...
8 MIN READ
Jul 23, 2024
Automating Telco Network Design using NVIDIA NIM and NVIDIA NeMo
Telecom wireless network design demands streamlined processes and standardized approaches. Network architects, engineers, and IT professionals are challenged...
7 MIN READ
Jul 22, 2024
Gets Hands-On Training at SIGGRAPH 2024
Complimentary trainings on OpenUSD, Digital Humans, LLMs and more with hands-on labs for Full Conference and Experience attendees.
1 MIN READ
Jul 22, 2024
Spotlight: HP 3D Printing Open Sources AI Surrogates for Additive Manufacturing Using NVIDIA Modulus
An open ecosystem for physics-informed machine learning (physics-ML) fosters innovation and AI engineering applications. Physics-ML embeds into the learning...
7 MIN READ
Jul 22, 2024
Fast and Differentiable Radio Maps with NVIDIA Instant RM
Wireless networks are the essential infrastructure that enables seamless connectivity. To ensure the best performance, whether in a single building or a whole...
4 MIN READ
Jul 19, 2024
Boosting AI-Driven Innovation in 6G with the AI-RAN Alliance, 3GPP, and O-RAN
The pace of 6G research and development is picking up as the 5G era crosses the midpoint of the decade-long cellular generation time frame. In this blog post,...
13 MIN READ
Jul 17, 2024
NVIDIA Transitions Fully Towards Open-Source GPU Kernel Modules
With the R515 driver, NVIDIA released a set of Linux GPU kernel modules in May 2022 as open source with dual GPL and MIT licensing. The initial release targeted...
7 MIN READ
Jul 16, 2024
Building an AI Agent for Supply Chain Optimization with NVIDIA NIM and cuOpt
Enterprises face significant challenges in making supply chain decisions that maximize profits while adapting quickly to dynamic changes. Optimal supply chain...
8 MIN READ
Jul 12, 2024
Boosting Mathematical Optimization Performance and Energy Efficiency on the NVIDIA Grace CPU
Mathematical optimization is a powerful tool that enables businesses and people to make smarter decisions and reach any number of goals—from improving...
4 MIN READ
Robotics
Jul 18, 2024
Webinar: Improving Robot Uptime Featuring Nav2 Autonomous Docking with NVIDIA Isaac ROS
Join Isaac ROS engineers and the founder of Open Navigation to explore the new Nav2 autonomous docking feature.
1 MIN READ
Jul 11, 2024
Training Sim-to-Real Transferable Robotic Assembly Skills over Diverse Geometries
Most objects in home and industrial settings consist of multiple parts that must be assembled. While human workers typically perform assembly, in certain...
10 MIN READ
Jul 11, 2024
Spotlight: Siemens Energy Accelerates Power Grid Asset Simulation 10,000x Using NVIDIA Modulus
The world’s energy system is increasingly complex and distributed due to increasing renewable energy generation, decentralization of energy resources, and...
9 MIN READ
Jul 10, 2024
Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data
Large-scale, use–case-specific synthetic data has become increasingly important in real-world computer vision and AI workflows. That’s because digital twins...
14 MIN READ
Jun 25, 2024
AI-Enhanced Navigation Charts Safer Waters for Massive Ships
Maritime startup Orca AI is pioneering safety at sea with its AI-powered navigation system, which provides real-time video processing to help crews make...
5 MIN READ
Jun 24, 2024
Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim
As vision AI complexity increases, streamlined deployment solutions are crucial to optimizing spaces and processes. NVIDIA accelerates development, turning...
12 MIN READ
Jun 17, 2024
Closing the Sim-to-Real Gap: Training Spot Quadruped Locomotion with NVIDIA Isaac Lab
Developing effective locomotion policies for quadrupeds poses significant challenges in robotics due to the complex dynamics involved. Training quadrupeds to...
12 MIN READ
Jun 17, 2024
Supercharge Robotics Workflows with AI and Simulation Using NVIDIA Isaac Sim 4.0 and NVIDIA Isaac Lab
The era of AI robots powered by physical AI has arrived. Physical AI models understand their environments and autonomously complete complex tasks in the...
11 MIN READ
Jun 14, 2024
Level Up Your Skills with Five New NVIDIA Technical Courses
With AI introducing an unprecedented pace of technological innovation, staying ahead means keeping your skills up to date. The NVIDIA Developer Program gives...
4 MIN READ
Jun 13, 2024
Build OpenUSD Applications for the Cloud with NVIDIA Omniverse Kit 106 Milestone Release
NVIDIA Omniverse is a platform that enables you to build applications for complex 3D and industrial digitalization workflows based on Universal Scene...
5 MIN READ
Jun 05, 2024
Build a Zero-Copy AI Sensor Processing Pipeline with OpenCV in NVIDIA Holoscan SDK
NVIDIA Holoscan is the NVIDIA domain-agnostic multimodal real-time AI sensor processing platform that delivers the foundation for developers to build their...
6 MIN READ
Jun 04, 2024
Power Cloud-Native Microservices at the Edge with NVIDIA JetPack 6.0, Now GA
NVIDIA JetPack SDK powers NVIDIA Jetson modules, offering a comprehensive solution for building end-to-end accelerated AI applications. JetPack 6 expands the...
12 MIN READ
Computer Vision / Video Analytics
Jul 26, 2024
Computed Tomography Organ and Disease Segmentation Using the NVIDIA VISTA-3D NIM Microservice
Over 300M computed tomography (CT) scans are performed globally, 85M in the US alone. Radiologists are looking for ways to speed up their workflow and generate...
9 MIN READ
Jul 17, 2024
Develop Generative AI-Powered Visual AI Agents for the Edge
An exciting breakthrough in AI technology—Vision Language Models (VLMs)—offers a more dynamic and flexible method for video analysis. VLMs enable users to...
9 MIN READ
Jul 10, 2024
Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data
Large-scale, use–case-specific synthetic data has become increasingly important in real-world computer vision and AI workflows. That’s because digital twins...
14 MIN READ
Jun 28, 2024
Introducing DoRA, a High-Performing Alternative to LoRA for Fine-Tuning
Full fine-tuning (FT) is commonly employed to tailor general pretrained models for specific downstream tasks. To reduce the training cost, parameter-efficient...
6 MIN READ
Jun 26, 2024
Improving Video Quality with the NVIDIA Video Codec SDK 12.2 for HEVC
NVIDIA Video Codec SDK provides a comprehensive set of APIs for hardware-accelerated video encode and decode on Windows and Linux. The 12.2 release improves...
7 MIN READ
Jun 26, 2024
Transforming Microsoft XLS and PPT Files into a Factory Digital Twin with OpenUSD
SyncTwin GmbH, a company that builds software to optimize production, intralogistics, and assembly, is on a mission to unlock industrial digital twins for small...
7 MIN READ
Jun 25, 2024
AI-Enhanced Navigation Charts Safer Waters for Massive Ships
Maritime startup Orca AI is pioneering safety at sea with its AI-powered navigation system, which provides real-time video processing to help crews make...
5 MIN READ
Jun 24, 2024
Addressing Medical Imaging Limitations with Synthetic Data Generation
Synthetic data in medical imaging offers numerous benefits, including the ability to augment datasets with diverse and realistic images where real data is...
9 MIN READ
Jun 24, 2024
Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim
As vision AI complexity increases, streamlined deployment solutions are crucial to optimizing spaces and processes. NVIDIA accelerates development, turning...
12 MIN READ
Jun 18, 2024
Generate Traffic Insights Using YOLOv8 and NVIDIA JetPack 6.0
Intelligent Transportation Systems (ITS) applications are becoming increasingly valuable and prevalent in modern urban environments. The benefits of using ITS...
11 MIN READ
Jun 17, 2024
Supercharge Robotics Workflows with AI and Simulation Using NVIDIA Isaac Sim 4.0 and NVIDIA Isaac Lab
The era of AI robots powered by physical AI has arrived. Physical AI models understand their environments and autonomously complete complex tasks in the...
11 MIN READ
Jun 06, 2024
MediaTek Integrates NVIDIA TAO Toolkit for IoT Edge AI Development
MediaTek is teaming with NVIDIA to integrate NVIDIA TAO training and pretrained models into its development workflow, bringing advanced AI and visual perception...
1 MIN READ
Data Science
Jul 24, 2024
Cell Imaging Feature Extraction and Morphology Clustering for Spatial Omics
VISTA-2D is a new foundational model from NVIDIA that can quickly and accurately perform cell segmentation, a fundamental task in cell imaging and spatial omics...
8 MIN READ
Jul 18, 2024
Accelerating Vector Search: RAPIDS cuVS IVF-PQ Part 2, Performance Tuning
In the first part of the series, we presented an overview of the IVF-PQ algorithm and explained how it builds on top of the IVF-Flat algorithm, using the...
14 MIN READ
Jul 18, 2024
Accelerating Vector Search: RAPIDS cuVS IVF-PQ Part 1, Deep Dive
In this blog post, we continue the series on accelerating vector search using cuVS. Our previous post in the series introduced IVF-Flat, a fast algorithm for...
14 MIN READ
Jul 17, 2024
Encoding and Compression Guide for Parquet String Data Using RAPIDS
Parquet writers provide encoding and compression options that are turned off by default. Enabling these options may provide better lossless compression for your...
10 MIN READ
Jul 16, 2024
Building an AI Agent for Supply Chain Optimization with NVIDIA NIM and cuOpt
Enterprises face significant challenges in making supply chain decisions that maximize profits while adapting quickly to dynamic changes. Optimal supply chain...
8 MIN READ
Jul 15, 2024
Unlock Gene Networks Using Limited Data with AI Model Geneformer
Geneformer is a recently introduced and powerful AI model that learns gene network dynamics and interactions using transfer learning from vast single-cell...
6 MIN READ
Jul 12, 2024
Event: WeAreDevelopers World Congress 2024
Join NVIDIA at WeAreDevelopers July 17-19 to learn how accelerated computing tools powered by GPUs are shaping the future.
1 MIN READ
Jul 12, 2024
Boosting Mathematical Optimization Performance and Energy Efficiency on the NVIDIA Grace CPU
Mathematical optimization is a powerful tool that enables businesses and people to make smarter decisions and reach any number of goals—from improving...
4 MIN READ
Jul 11, 2024
Defending AI Model Files from Unauthorized Access with Canaries
As AI models grow in capability and cost of creation, and hold more sensitive or proprietary data, securing them at rest is increasingly important....
6 MIN READ
Jul 11, 2024
Optimize AI Model Performance and Maintain Data Privacy with Hybrid RAG
The rapidly evolving field of generative AI is focused on building neural networks that can create realistic content such as text, images, audio, and synthetic...
7 MIN READ
Jul 09, 2024
Just Released: nvmath-python
nvmath-python is an open-source Python library that provides high performance access to the core mathematical operations in the NVIDIA Math Libraries. Available...
1 MIN READ
Jul 05, 2024
Explainer: What Is K-Means?
K-means is a clustering algorithm—one of the simplest and most popular unsupervised machine learning (ML) algorithms for data scientists.
1 MIN READ
Content Creation / Rendering
Jul 25, 2024
Simulate Elastic Objects in Any Representation with NVIDIA Kaolin Library
Recent advancements in generative AI and multi-view reconstruction have introduced new ways to rapidly generate 3D content. However, to be useful for downstream...
2 MIN READ
Jul 24, 2024
Developing Product Configurators with OpenUSD
Developers from advertising agencies to software vendors are empowering global brands to deliver hyperpersonalization for digital experiences and visual...
5 MIN READ
Jul 22, 2024
Gets Hands-On Training at SIGGRAPH 2024
Complimentary trainings on OpenUSD, Digital Humans, LLMs and more with hands-on labs for Full Conference and Experience attendees.
1 MIN READ
Jul 18, 2024
Spotlight: UneeQ Revolutionizes Customer Engagement with AI-Powered Digital Humans
With the rise of chatbots and virtual assistants, customer interactions have evolved to embrace the versatility of voice and text inputs. However, integrating...
4 MIN READ
Jul 10, 2024
Understanding Diffusion Models: An Essential Guide for AEC Professionals
Generative AI, the ability of algorithms to process various types of inputs—such as text, images, audio, video, and code—and generate new content, is...
13 MIN READ
Jun 26, 2024
Improving Video Quality with the NVIDIA Video Codec SDK 12.2 for HEVC
NVIDIA Video Codec SDK provides a comprehensive set of APIs for hardware-accelerated video encode and decode on Windows and Linux. The 12.2 release improves...
7 MIN READ
Jun 13, 2024
Build OpenUSD Applications for the Cloud with NVIDIA Omniverse Kit 106 Milestone Release
NVIDIA Omniverse is a platform that enables you to build applications for complex 3D and industrial digitalization workflows based on Universal Scene...
5 MIN READ
Jun 10, 2024
Reallusion Brings Digital Characters to Life with NVIDIA AI
In today's digital age, creating realistic animated characters is crucial for filmmakers, game developers, and content creators looking to bring their visions...
6 MIN READ
Jun 06, 2024
Enhancing Low-Resolution SDR Video with the NVIDIA RTX Video SDK
NVIDIA RTX Video is a collection of AI video enhancements that improve the visual quality of lower-quality video. RTX Video Super Resolution was announced...
2 MIN READ
Jun 04, 2024
Build Lifelike Digital Humans with NVIDIA ACE, Now Generally Available
NVIDIA ACE—a suite of technologies bringing digital humans to life with generative AI—is now generally available for developers. Packaged as NVIDIA NIMs,...
5 MIN READ
May 31, 2024
How to Train an Object Detection Model for Visual Inspection with Synthetic Data
AI is rapidly changing industrial visual inspection. In a factory setting, visual inspection is used for many issues, including detecting defects and missing or...
8 MIN READ
May 16, 2024
Webinar: Path Traced Visuals in Unreal Engine
Integrate RTX into your own game and understand what ReSTIR means for the future of real-time lighting in this May 21 webinar.
1 MIN READ
Conversational AI
Jul 18, 2024
Spotlight: UneeQ Revolutionizes Customer Engagement with AI-Powered Digital Humans
With the rise of chatbots and virtual assistants, customer interactions have evolved to embrace the versatility of voice and text inputs. However, integrating...
4 MIN READ
Jul 18, 2024
Accelerating Vector Search: RAPIDS cuVS IVF-PQ Part 2, Performance Tuning
In the first part of the series, we presented an overview of the IVF-PQ algorithm and explained how it builds on top of the IVF-Flat algorithm, using the...
14 MIN READ
Jul 18, 2024
Accelerating Vector Search: RAPIDS cuVS IVF-PQ Part 1, Deep Dive
In this blog post, we continue the series on accelerating vector search using cuVS. Our previous post in the series introduced IVF-Flat, a fast algorithm for...
14 MIN READ
Jul 17, 2024
NVIDIA NeMo Accelerates LLM Innovation with Hybrid State Space Model Support
Today’s large language models (LLMs) are based on the transformer model architecture introduced in 2017. Since then, rapid advances in AI compute performance...
7 MIN READ
Jul 16, 2024
New Workshops: Customize LLMs, Build and Deploy Large Neural Networks
Register now for an instructor-led public workshop in July, August or September. Space is limited.
1 MIN READ
Jul 12, 2024
Train Generative AI Models More Efficiently with New NVIDIA Megatron-Core Functionalities
First introduced in 2019, NVIDIA Megatron-LM sparked a wave of innovation in the AI community, enabling researchers and developers to use the underpinnings of...
11 MIN READ
Jul 02, 2024
Addressing Hallucinations in Speech Synthesis LLMs with the NVIDIA NeMo T5-TTS Model
NVIDIA NeMo has released the T5-TTS model, a significant advancement in text-to-speech (TTS) technology. Based on large language models (LLMs), T5-TTS produces...
4 MIN READ
Jun 28, 2024
Introducing DoRA, a High-Performing Alternative to LoRA for Fine-Tuning
Full fine-tuning (FT) is commonly employed to tailor general pretrained models for specific downstream tasks. To reduce the training cost, parameter-efficient...
6 MIN READ
Jun 26, 2024
Generate High-Quality, Context-Aware Responses for Chatbots and Search Engines with Llama 3-ChatQA
Experience and test Llama3-ChatQA models at scale with performance optimized NVIDIA NIM inference microservice using the NVIDIA API catalog.
1 MIN READ
Jun 20, 2024
AI Brain Implant Restores Bilingual Communication for Stroke Survivor
Scientists have enabled a stroke survivor, who is unable to speak, to communicate in both Spanish and English by training a neuroprosthesis implant to decode...
3 MIN READ
Jun 12, 2024
Introducing Grouped GEMM APIs in cuBLAS and More Performance Updates
The latest release of NVIDIA cuBLAS library, version 12.5, continues to deliver functionality and performance to deep learning (DL) and high-performance...
7 MIN READ
Jun 10, 2024
NVIDIA Text Embedding Model Tops MTEB Leaderboard
The latest embedding model from NVIDIA—NV-Embed—set a new record for embedding accuracy with a score of 69.32 on the Massive Text Embedding Benchmark...
6 MIN READ
Edge Computing
Jul 22, 2024
Spotlight: HP 3D Printing Open Sources AI Surrogates for Additive Manufacturing Using NVIDIA Modulus
An open ecosystem for physics-informed machine learning (physics-ML) fosters innovation and AI engineering applications. Physics-ML embeds into the learning...
7 MIN READ
Jul 19, 2024
Boosting AI-Driven Innovation in 6G with the AI-RAN Alliance, 3GPP, and O-RAN
The pace of 6G research and development is picking up as the 5G era crosses the midpoint of the decade-long cellular generation time frame. In this blog post,...
13 MIN READ
Jul 18, 2024
Webinar: Improving Robot Uptime Featuring Nav2 Autonomous Docking with NVIDIA Isaac ROS
Join Isaac ROS engineers and the founder of Open Navigation to explore the new Nav2 autonomous docking feature.
1 MIN READ
Jul 17, 2024
Develop Generative AI-Powered Visual AI Agents for the Edge
An exciting breakthrough in AI technology—Vision Language Models (VLMs)—offers a more dynamic and flexible method for video analysis. VLMs enable users to...
9 MIN READ
Jul 03, 2024
Powering the Future of AI-Enabled Medical Devices with NVIDIA Holoscan and RTI Connext
The demand for real-time insights and autonomous decision-making is growing across industries, and healthcare and medical devices are no exception. Relying on...
8 MIN READ
Jun 28, 2024
Introducing DoRA, a High-Performing Alternative to LoRA for Fine-Tuning
Full fine-tuning (FT) is commonly employed to tailor general pretrained models for specific downstream tasks. To reduce the training cost, parameter-efficient...
6 MIN READ
Jun 25, 2024
AI-Enhanced Navigation Charts Safer Waters for Massive Ships
Maritime startup Orca AI is pioneering safety at sea with its AI-powered navigation system, which provides real-time video processing to help crews make...
5 MIN READ
Jun 18, 2024
Generate Traffic Insights Using YOLOv8 and NVIDIA JetPack 6.0
Intelligent Transportation Systems (ITS) applications are becoming increasingly valuable and prevalent in modern urban environments. The benefits of using ITS...
11 MIN READ
Jun 11, 2024
Maximum Performance and Minimum Footprint for AI Apps with NVIDIA TensorRT Weight-Stripped Engines
NVIDIA TensorRT, an established inference library for data centers, has rapidly emerged as a desirable inference backend for NVIDIA GeForce RTX and NVIDIA RTX...
8 MIN READ
Jun 06, 2024
MediaTek Integrates NVIDIA TAO Toolkit for IoT Edge AI Development
MediaTek is teaming with NVIDIA to integrate NVIDIA TAO training and pretrained models into its development workflow, bringing advanced AI and visual perception...
1 MIN READ
Jun 05, 2024
Build a Zero-Copy AI Sensor Processing Pipeline with OpenCV in NVIDIA Holoscan SDK
NVIDIA Holoscan is the NVIDIA domain-agnostic multimodal real-time AI sensor processing platform that delivers the foundation for developers to build their...
6 MIN READ
Jun 04, 2024
Power Cloud-Native Microservices at the Edge with NVIDIA JetPack 6.0, Now GA
NVIDIA JetPack SDK powers NVIDIA Jetson modules, offering a comprehensive solution for building end-to-end accelerated AI applications. JetPack 6 expands the...
12 MIN READ
Data Center / Cloud
Jul 26, 2024
Faster Insights from Luminary Cloud's Engineering Simulations with NVIDIA GPUs
Engineering simulation is used across industries to accelerate product development. Simulations are used to check the safety of aircraft, cars, and buildings,...
8 MIN READ
Jul 26, 2024
Computed Tomography Organ and Disease Segmentation Using the NVIDIA VISTA-3D NIM Microservice
Over 300M computed tomography (CT) scans are performed globally, 85M in the US alone. Radiologists are looking for ways to speed up their workflow and generate...
9 MIN READ
Jul 24, 2024
Cell Imaging Feature Extraction and Morphology Clustering for Spatial Omics
VISTA-2D is a new foundational model from NVIDIA that can quickly and accurately perform cell segmentation, a fundamental task in cell imaging and spatial omics...
8 MIN READ
Jul 23, 2024
Accelerate AI Infrastructure Using an NVIDIA BlueField-3 DPU Integration with DDN Storage
As AI becomes integral to organizational innovation and competitive advantage, the need for efficient and scalable infrastructure is more critical than ever. A...
6 MIN READ
Jul 22, 2024
Spotlight: HP 3D Printing Open Sources AI Surrogates for Additive Manufacturing Using NVIDIA Modulus
An open ecosystem for physics-informed machine learning (physics-ML) fosters innovation and AI engineering applications. Physics-ML embeds into the learning...
7 MIN READ
Jul 19, 2024
Boosting AI-Driven Innovation in 6G with the AI-RAN Alliance, 3GPP, and O-RAN
The pace of 6G research and development is picking up as the 5G era crosses the midpoint of the decade-long cellular generation time frame. In this blog post,...
13 MIN READ
Jul 18, 2024
Accelerating Vector Search: RAPIDS cuVS IVF-PQ Part 2, Performance Tuning
In the first part of the series, we presented an overview of the IVF-PQ algorithm and explained how it builds on top of the IVF-Flat algorithm, using the...
14 MIN READ
Jul 18, 2024
Accelerating Vector Search: RAPIDS cuVS IVF-PQ Part 1, Deep Dive
In this blog post, we continue the series on accelerating vector search using cuVS. Our previous post in the series introduced IVF-Flat, a fast algorithm for...
14 MIN READ
Jul 17, 2024
NVIDIA Transitions Fully Towards Open-Source GPU Kernel Modules
With the R515 driver, NVIDIA released a set of Linux GPU kernel modules in May 2022 as open source with dual GPL and MIT licensing. The initial release targeted...
7 MIN READ
Jul 15, 2024
Power Your AI Projects with New NVIDIA NIMs for Mistral and Mixtral Models
Large language models (LLMs) are growing in adoption across enterprise organizations, with many building them into their AI applications. Foundation models are...
5 MIN READ
Jul 12, 2024
Event: WeAreDevelopers World Congress 2024
Join NVIDIA at WeAreDevelopers July 17-19 to learn how accelerated computing tools powered by GPUs are shaping the future.
1 MIN READ
Jul 09, 2024
Just Released: nvmath-python
nvmath-python is an open-source Python library that provides high performance access to the core mathematical operations in the NVIDIA Math Libraries. Available...
1 MIN READ