-
Generative AINVIDIA Text Embedding Model Tops MTEB Leaderboard
-
Generative AIBuild Lifelike Digital Humans with NVIDIA ACE, Now Generally Available
-
RoboticsCreate, Design, and Deploy Robotics Applications Using New NVIDIA Isaac Foundation Models and Workflows
-
Generative AIGenerative AI Agents Developer Contest: Top Tips for Getting Started
-
Generative AICreate Content, Conversations, and Code with New Phi-3 and Granite Code Model Families
Recent
Jun 11, 2024
Maximum Performance and Minimum Footprint for AI Apps with NVIDIA TensorRT Weight-Stripped Engines
NVIDIA TensorRT, an established inference library for data centers, has rapidly emerged as a desirable inference backend for NVIDIA GeForce RTX and NVIDIA RTX...
8 MIN READ
Jun 10, 2024
Confidential and Self-Sovereign AI: Best Practices for Enhancing Security and Autonomy
Join the webinar on June 11th with NVIDIA and Super Protocol to learn about the benefits of Confidential Computing for Web3 AI.
1 MIN READ
Jun 10, 2024
Spotlight: Cisco Enhances Workload Security and Operational Efficiency with NVIDIA BlueField-3 DPUs
As cyberattacks become more sophisticated, organizations must constantly adapt with cutting-edge solutions to protect their critical assets. One such solution...
7 MIN READ
Jun 10, 2024
Reallusion Brings Digital Characters to Life with NVIDIA AI
In today's digital age, creating realistic animated characters is crucial for filmmakers, game developers, and content creators looking to bring their visions...
6 MIN READ
Jun 10, 2024
NVIDIA Text Embedding Model Tops MTEB Leaderboard
The latest embedding model from NVIDIA—NV-Embed—set a new record for embedding accuracy with a score of 69.32 on the Massive Text Embedding Benchmark...
6 MIN READ
Jun 10, 2024
Introducing SDXL-Lightning: New Lightning-Fast Model on NVIDIA API Catalog
Create high-resolution images with remarkable efficiency with the Advanced text-to-image generation model, SDXL-Lightning, available and optimized now on the...
1 MIN READ
Jun 10, 2024
SOLAR-10.7B: Optimized Model Tailored Instruction Following, Reasoning, and Mathematical Tasks
Enhance efficiency and performance in instruction-based NLP tasks with SOLAR-10.7B, especially in following instructions, reasoning, and mathematical tasks.
1 MIN READ
Jun 07, 2024
Explainer: What Is Generative AI?
Generative AI enables users to quickly generate new content based on a variety of inputs. Inputs and outputs to these models can include text, images, sounds,...
1 MIN READ
Jun 07, 2024
Seamlessly Deploying a Swarm of LoRA Adapters with NVIDIA NIM
The latest state-of-the-art foundation large language models (LLMs) have billions of parameters and are pretrained on trillions of tokens of input text. They...
11 MIN READ
Jun 06, 2024
MediaTek Integrates NVIDIA TAO Toolkit for IoT Edge AI Development
MediaTek is teaming with NVIDIA to integrate NVIDIA TAO training and pretrained models into its development workflow, bringing advanced AI and visual perception...
1 MIN READ
Jun 06, 2024
Enhancing Low-Resolution SDR Video with the NVIDIA RTX Video SDK
NVIDIA RTX Video is a collection of AI video enhancements that improve the visual quality of lower-quality video. RTX Video Super Resolution was announced...
2 MIN READ
Jun 05, 2024
Build a Zero-Copy AI Sensor Processing Pipeline with OpenCV in NVIDIA Holoscan SDK
NVIDIA Holoscan is the NVIDIA domain-agnostic multimodal real-time AI sensor processing platform that delivers the foundation for developers to build their...
6 MIN READ
Generative AI
Jun 11, 2024
Maximum Performance and Minimum Footprint for AI Apps with NVIDIA TensorRT Weight-Stripped Engines
NVIDIA TensorRT, an established inference library for data centers, has rapidly emerged as a desirable inference backend for NVIDIA GeForce RTX and NVIDIA RTX...
8 MIN READ
Jun 10, 2024
NVIDIA Text Embedding Model Tops MTEB Leaderboard
The latest embedding model from NVIDIA—NV-Embed—set a new record for embedding accuracy with a score of 69.32 on the Massive Text Embedding Benchmark...
6 MIN READ
Jun 10, 2024
Introducing SDXL-Lightning: New Lightning-Fast Model on NVIDIA API Catalog
Create high-resolution images with remarkable efficiency with the Advanced text-to-image generation model, SDXL-Lightning, available and optimized now on the...
1 MIN READ
Jun 10, 2024
SOLAR-10.7B: Optimized Model Tailored Instruction Following, Reasoning, and Mathematical Tasks
Enhance efficiency and performance in instruction-based NLP tasks with SOLAR-10.7B, especially in following instructions, reasoning, and mathematical tasks.
1 MIN READ
Jun 07, 2024
Explainer: What Is Generative AI?
Generative AI enables users to quickly generate new content based on a variety of inputs. Inputs and outputs to these models can include text, images, sounds,...
1 MIN READ
Jun 07, 2024
Seamlessly Deploying a Swarm of LoRA Adapters with NVIDIA NIM
The latest state-of-the-art foundation large language models (LLMs) have billions of parameters and are pretrained on trillions of tokens of input text. They...
11 MIN READ
Jun 06, 2024
MediaTek Integrates NVIDIA TAO Toolkit for IoT Edge AI Development
MediaTek is teaming with NVIDIA to integrate NVIDIA TAO training and pretrained models into its development workflow, bringing advanced AI and visual perception...
1 MIN READ
Jun 06, 2024
Enhancing Low-Resolution SDR Video with the NVIDIA RTX Video SDK
NVIDIA RTX Video is a collection of AI video enhancements that improve the visual quality of lower-quality video. RTX Video Super Resolution was announced...
2 MIN READ
Jun 04, 2024
Unlock Deeper Insights of Somatic Mutations with Deep Learning
NVIDIA Parabricks expands the NVIDIA emphasis on solving omics challenges with deep learning and continues accelerating genomics instruments. NVIDIA Parabricks...
5 MIN READ
Jun 04, 2024
Build Lifelike Digital Humans with NVIDIA ACE, Now Generally Available
NVIDIA ACE—a suite of technologies bringing digital humans to life with generative AI—is now generally available for developers. Packaged as NVIDIA NIMs,...
5 MIN READ
Jun 03, 2024
Breeze-7B: LLM Specialized for Traditional Chinese
The model demonstrates strong performance for tasks such as Q&A, multi-round chat, and summarization in both traditional Chinese and English.
1 MIN READ
Jun 03, 2024
BGE-M3: Advanced Multilingual Text Retrieval Model
Experience the versatile embedding model designed for multilingual, multi-functional, and multi-granularity text retrieval tasks, excelling in dense,...
1 MIN READ
AI Foundation Models
Jun 10, 2024
Introducing SDXL-Lightning: New Lightning-Fast Model on NVIDIA API Catalog
Create high-resolution images with remarkable efficiency with the Advanced text-to-image generation model, SDXL-Lightning, available and optimized now on the...
1 MIN READ
Jun 10, 2024
SOLAR-10.7B: Optimized Model Tailored Instruction Following, Reasoning, and Mathematical Tasks
Enhance efficiency and performance in instruction-based NLP tasks with SOLAR-10.7B, especially in following instructions, reasoning, and mathematical tasks.
1 MIN READ
Jun 03, 2024
BGE-M3: Advanced Multilingual Text Retrieval Model
Experience the versatile embedding model designed for multilingual, multi-functional, and multi-granularity text retrieval tasks, excelling in dense,...
1 MIN READ
Jun 03, 2024
Breeze-7B: LLM Specialized for Traditional Chinese
The model demonstrates strong performance for tasks such as Q&A, multi-round chat, and summarization in both traditional Chinese and English.
1 MIN READ
May 30, 2024
Convert Natural Language to Code with CodeGemma
Experience the advanced LLM API for code generation, completion, mathematical reasoning, and instruction following with free cloud credits.
1 MIN READ
May 14, 2024
Generate Text Responses from Visual and Text Inputs with Google's New PaliGemma Model
With free NVIDIA cloud credits, you can start testing the model at scale on the API Catalog.
1 MIN READ
May 13, 2024
Regional LLMs SEA-LION and SeaLLM Serve Languages and Cultures of Southeast Asia
At the recent World Governments Summit in Dubai, NVIDIA CEO Jensen Huang emphasized the importance of sovereign AI, which refers to a nation’s capability to...
3 MIN READ
Apr 30, 2024
Leverage Mixture of Experts-Based DBRX for Superior LLM Performance on Diverse Tasks
This week’s model release features DBRX, a state-of-the-art large language model (LLM) developed by Databricks. With demonstrated strength in programming and...
3 MIN READ
Apr 26, 2024
New LLM: Snowflake Arctic Model for SQL and Code Generation
Large language models (LLMs) have revolutionized natural language processing (NLP) in recent years, enabling a wide range of applications such as text...
3 MIN READ
Apr 22, 2024
Mistral Large and Mixtral 8x22B LLMs Now Powered by NVIDIA NIM and NVIDIA API
This week’s model release features two new NVIDIA AI Foundation models, Mistral Large and Mixtral 8x22B, both developed by Mistral AI. These cutting-edge...
4 MIN READ
Mar 18, 2024
Scale AI-Enabled Robotics Development Workloads with NVIDIA OSMO
Autonomous machine development is an iterative process of data generation and gathering, model training, and deployment characterized by complex multi-stage,...
4 MIN READ
Mar 04, 2024
Solve Complex AI Tasks with Leaderboard-Topping Smaug 72B from NVIDIA AI Foundation Models
This week’s model release features the NVIDIA-optimized language model Smaug 72B, which you can experience directly from your browser. NVIDIA AI Foundation...
2 MIN READ
Simulation / Modeling / Design
Jun 02, 2024
Wistron Advances Energy Efficiency in Manufacturing with AI and NVIDIA Omniverse
With the growing emphasis on environmental, social, and governance (ESG) investments and initiatives, manufacturers are looking for new ways to increase energy...
5 MIN READ
Jun 02, 2024
Pegatron Simulates and Optimizes Factory Operations with AI-Enabled Digital Twins
Manufacturers face increased pressures to shorten production cycles, enhance productivity, and improve quality, all while reducing costs. To address these...
5 MIN READ
Jun 02, 2024
Optimize Processes for Large Spaces with the Multi-Camera Tracking Workflow
Large areas like warehouses, factories, stadiums, and airports are typically monitored by hundreds of cameras to improve safety and optimize operations....
11 MIN READ
May 22, 2024
Just Released: NVIDIA HPC SDK 24.5
NVIDIA HPC SDK 24.5 updates include support for new NVPL components and CUDA 12.4.
1 MIN READ
May 22, 2024
Just Released: Nsight Compute 2024.2
Nsight Compute 2024.2 adds Python syntax highlighting and call stacks, a redesigned report header, and source page statistics to make CUDA optimization easier.
1 MIN READ
May 21, 2024
Just Released: CUDA Toolkit 12.5
CUDA Toolkit 12.5 supports new NVIDIA L20 and H20 GPUs and simultaneous compute and graphics to DirectX, and updates Nsight Compute and CUDA-X Libraries.
1 MIN READ
May 21, 2024
Quantum Mechanics-Enhanced Drug Discovery Using QUELO-G and CUDA Graphs
In drug discovery, approaches based on the so-called classical force field have been routinely used and considered useful. However, it is also widely recognized...
9 MIN READ
May 16, 2024
Webinar: Path Traced Visuals in Unreal Engine
Integrate RTX into your own game and understand what ReSTIR means for the future of real-time lighting in this May 21 webinar.
1 MIN READ
May 15, 2024
NVIDIA Presents New Robotics Research on Geometric Fabrics, Surgical Robots, and More at ICRA
During the IEEE International Conference on Robotics and Automation (ICRA) May 13-17 in Yokohama, Japan, many people will be discussing geometric fabrics. That...
7 MIN READ
May 12, 2024
New NVIDIA CUDA-Q Features Boost Quantum Application Performance
NVIDIA CUDA-Q (formerly NVIDIA CUDA Quantum) is an open-source programming model for building quantum accelerated supercomputing applications that take full...
5 MIN READ
May 12, 2024
Advanced AI and Retrieval-Augmented Generation for Code Development in High-Performance Computing
In the rapidly evolving field of software development, AI tools such as chatbots and GitHub Copilot have significantly transformed how developers write and...
8 MIN READ
May 12, 2024
Enabling Quantum Computing with AI
Building a useful quantum computer in practice is incredibly challenging. Significant improvements are needed in the scale, fidelity, speed, reliability, and...
6 MIN READ
Robotics
Jun 05, 2024
Build a Zero-Copy AI Sensor Processing Pipeline with OpenCV in NVIDIA Holoscan SDK
NVIDIA Holoscan is the NVIDIA domain-agnostic multimodal real-time AI sensor processing platform that delivers the foundation for developers to build their...
6 MIN READ
Jun 04, 2024
Power Cloud-Native Microservices at the Edge with NVIDIA JetPack 6.0, Now GA
NVIDIA JetPack SDK powers NVIDIA Jetson modules, offering a comprehensive solution for building end-to-end accelerated AI applications. JetPack 6 expands the...
12 MIN READ
Jun 02, 2024
Create, Design, and Deploy Robotics Applications Using New NVIDIA Isaac Foundation Models and Workflows
The application of robotics is rapidly expanding in diverse environments such as smart manufacturing facilities, commercial kitchens, hospitals, warehouse...
10 MIN READ
Jun 02, 2024
Optimize Processes for Large Spaces with the Multi-Camera Tracking Workflow
Large areas like warehouses, factories, stadiums, and airports are typically monitored by hundreds of cameras to improve safety and optimize operations....
11 MIN READ
Jun 02, 2024
Production-Ready, Enterprise-Grade Software on NVIDIA IGX Platform, Support for NVIDIA RTX 6000 ADA, and More
Real-time AI at the edge is crucial for medical, industrial, and scientific computing because these mission-critical applications require immediate data...
6 MIN READ
May 15, 2024
NVIDIA Presents New Robotics Research on Geometric Fabrics, Surgical Robots, and More at ICRA
During the IEEE International Conference on Robotics and Automation (ICRA) May 13-17 in Yokohama, Japan, many people will be discussing geometric fabrics. That...
7 MIN READ
May 14, 2024
NVIDIA DeepStream 7.0 Milestone Release for Next-Gen Vision AI Development
NVIDIA DeepStream is a powerful SDK that unlocks GPU-accelerated building blocks to build end-to-end vision AI pipelines. With more than 40+ plugins available...
11 MIN READ
May 08, 2024
Mitigating Occlusions in Visual Perception Using Single-View 3D Tracking in NVIDIA DeepStream
When it comes to perception for Intelligent Video Analytics (IVA) applications such as traffic monitoring, warehouse safety, and retail shopper analytics, one...
8 MIN READ
May 06, 2024
Automating Smart Pick-and-Place with Intrinsic Flowstate and NVIDIA Isaac Manipulator
We are announcing our collaboration with Intrinsic.ai on learning foundation skill models for industrial robotics tasks. Many pick-and-place problems in...
4 MIN READ
May 03, 2024
Visual Language Intelligence and Edge AI 2.0
VILA is a family of high-performance vision language models developed by NVIDIA Research and MIT. The largest model comes with ~40B parameters and the smallest...
8 MIN READ
Apr 26, 2024
Perception Model Training for Autonomous Vehicles with Tensor Parallelism
Due to the adoption of multicamera inputs and deep convolutional backbone networks, the GPU memory footprint for training autonomous driving perception models...
10 MIN READ
Apr 22, 2024
Developing Virtual Factory Solutions with OpenUSD and NVIDIA Omniverse
With NVIDIA AI, NVIDIA Omniverse, and the Universal Scene Description (OpenUSD) ecosystem, industrial developers are building virtual factory solutions that...
4 MIN READ
Computer Vision / Video Analytics
Jun 06, 2024
MediaTek Integrates NVIDIA TAO Toolkit for IoT Edge AI Development
MediaTek is teaming with NVIDIA to integrate NVIDIA TAO training and pretrained models into its development workflow, bringing advanced AI and visual perception...
1 MIN READ
Jun 05, 2024
Build a Zero-Copy AI Sensor Processing Pipeline with OpenCV in NVIDIA Holoscan SDK
NVIDIA Holoscan is the NVIDIA domain-agnostic multimodal real-time AI sensor processing platform that delivers the foundation for developers to build their...
6 MIN READ
Jun 04, 2024
Power Cloud-Native Microservices at the Edge with NVIDIA JetPack 6.0, Now GA
NVIDIA JetPack SDK powers NVIDIA Jetson modules, offering a comprehensive solution for building end-to-end accelerated AI applications. JetPack 6 expands the...
12 MIN READ
Jun 02, 2024
Optimize Processes for Large Spaces with the Multi-Camera Tracking Workflow
Large areas like warehouses, factories, stadiums, and airports are typically monitored by hundreds of cameras to improve safety and optimize operations....
11 MIN READ
May 17, 2024
Enhancing the Apparel Shopping Experience with AI, Emoji-Aware OCR, and Snapchat's Screenshop
Ever spotted someone in a photo wearing a cool shirt or some unique apparel and wondered where they got it? How much did it cost? Maybe you've even thought...
8 MIN READ
May 14, 2024
NVIDIA DeepStream 7.0 Milestone Release for Next-Gen Vision AI Development
NVIDIA DeepStream is a powerful SDK that unlocks GPU-accelerated building blocks to build end-to-end vision AI pipelines. With more than 40+ plugins available...
11 MIN READ
May 08, 2024
Mitigating Occlusions in Visual Perception Using Single-View 3D Tracking in NVIDIA DeepStream
When it comes to perception for Intelligent Video Analytics (IVA) applications such as traffic monitoring, warehouse safety, and retail shopper analytics, one...
8 MIN READ
May 03, 2024
Visual Language Intelligence and Edge AI 2.0
VILA is a family of high-performance vision language models developed by NVIDIA Research and MIT. The largest model comes with ~40B parameters and the smallest...
8 MIN READ
May 03, 2024
Visual Language Models on NVIDIA Hardware with VILA
Visual language models have evolved significantly recently. However, the existing technology typically only supports one single image. They cannot reason among...
11 MIN READ
Apr 26, 2024
Perception Model Training for Autonomous Vehicles with Tensor Parallelism
Due to the adoption of multicamera inputs and deep convolutional backbone networks, the GPU memory footprint for training autonomous driving perception models...
10 MIN READ
Apr 22, 2024
Advancing Cell Segmentation and Morphology Analysis with NVIDIA AI Foundation Model VISTA-2D
Genomics researchers use different sequencing techniques to better understand biological systems, including single-cell and spatial omics. Unlike single-cell,...
7 MIN READ
Apr 17, 2024
Advancing Medical Image Decoding with GPU-Accelerated nvImageCodec
This post delves into the capabilities of decoding DICOM medical images within AWS HealthImaging using the nvJPEG2000 library. We'll guide you through the...
16 MIN READ
Data Science
Jun 07, 2024
Explainer: What Is Generative AI?
Generative AI enables users to quickly generate new content based on a variety of inputs. Inputs and outputs to these models can include text, images, sounds,...
1 MIN READ
Jun 04, 2024
Unlock Deeper Insights of Somatic Mutations with Deep Learning
NVIDIA Parabricks expands the NVIDIA emphasis on solving omics challenges with deep learning and continues accelerating genomics instruments. NVIDIA Parabricks...
5 MIN READ
May 31, 2024
Explainer: What Is a Recommendation System?
A recommendation system (or recommender system) is a class of machine learning that uses data to help predict, narrow down, and find what people are looking for...
1 MIN READ
May 24, 2024
Explainer: What Is Deep Learning?
Deep learning is a subset of artificial intelligence (AI) and machine learning (ML) that uses multi-layered artificial neural networks to deliver...
1 MIN READ
May 23, 2024
Applying Generative AI for CVE Analysis at an Enterprise Scale
The software development and deployment process is complex. Modern enterprise applications have complex software dependencies, forming an interconnected web...
11 MIN READ
May 21, 2024
Just Released: CUDA Toolkit 12.5
CUDA Toolkit 12.5 supports new NVIDIA L20 and H20 GPUs and simultaneous compute and graphics to DirectX, and updates Nsight Compute and CUDA-X Libraries.
1 MIN READ
May 21, 2024
Curating Custom Datasets for LLM Training with NVIDIA NeMo Curator
Data curation is the first, and arguably the most important, step in the pretraining and continuous training of large language models (LLMs) and small language...
14 MIN READ
May 17, 2024
Explainer: What is Regression?
Classification and regression are two groups of supervised machine-learning algorithm problems. Supervised machine learning uses algorithms to train a model to...
1 MIN READ
May 14, 2024
RAPIDS on Databricks: A Guide to GPU-Accelerated Data Processing
In today's data-driven landscape, maximizing performance and efficiency in data processing and analytics is critical. While many Databricks users are familiar...
10 MIN READ
May 14, 2024
RAPIDS cuDF Instantly Accelerates pandas up to 50x on Google Colab
At Google I/O'24, Laurence Moroney, head of AI Advocacy at Google, announced that RAPIDS cuDF is now integrated into Google Colab. Developers can now instantly...
3 MIN READ
May 10, 2024
Explainer: What Is Artificial Intelligence?
In its most fundamental form, AI is the capability of a computer program or a machine to think, learn, and take action without being explicitly encoded with...
1 MIN READ
May 09, 2024
Revolutionizing Graph Analytics: Next-Gen Architecture with NVIDIA cuGraph Acceleration
In our previous exploration of graph analytics, we uncovered the transformative power of GPU-CPU fusion using NVIDIA cuGraph. Building upon those insights, we...
9 MIN READ
Content Creation / Rendering
Jun 10, 2024
Reallusion Brings Digital Characters to Life with NVIDIA AI
In today's digital age, creating realistic animated characters is crucial for filmmakers, game developers, and content creators looking to bring their visions...
6 MIN READ
Jun 06, 2024
Enhancing Low-Resolution SDR Video with the NVIDIA RTX Video SDK
NVIDIA RTX Video is a collection of AI video enhancements that improve the visual quality of lower-quality video. RTX Video Super Resolution was announced...
2 MIN READ
Jun 04, 2024
Build Lifelike Digital Humans with NVIDIA ACE, Now Generally Available
NVIDIA ACE—a suite of technologies bringing digital humans to life with generative AI—is now generally available for developers. Packaged as NVIDIA NIMs,...
5 MIN READ
May 16, 2024
Webinar: Path Traced Visuals in Unreal Engine
Integrate RTX into your own game and understand what ReSTIR means for the future of real-time lighting in this May 21 webinar.
1 MIN READ
Apr 29, 2024
GPU-Powered Windows 365 Cloud PCs with NVIDIA RTX Virtual Workstation for High-End Graphics Workloads
Professional workflows have become more complex with the increased demand for graphics-intensive scenarios. From regular office applications to demanding...
7 MIN READ
Apr 26, 2024
Enhance Text-to-Image Fine-Tuning with DRaFT+, Now Part of NVIDIA NeMo
Text-to-image diffusion models have been established as a powerful method for high-fidelity image generation based on given text. Nevertheless, diffusion models...
10 MIN READ
Apr 11, 2024
New Video Series: OpenUSD for Developers
Universal Scene Description, also called OpenUSD or USD, is an open and extensible framework for creating, editing, querying, rendering, collaborating, and...
3 MIN READ
Apr 09, 2024
Next-Generation Live Media Apps on Repurposable Clusters with NVIDIA Holoscan for Media
NVIDIA Holoscan for Media is now available to all developers looking to build next-generation live media applications on fully repurposable clusters. ...
4 MIN READ
Mar 21, 2024
Speed Up Your AI Development: NVIDIA AI Workbench Goes GA
NVIDIA AI Workbench, a toolkit for AI and ML developers, is now generally available as a free download. It features automation that removes roadblocks for...
4 MIN READ
Mar 21, 2024
Upgrade Your Graphics: Explore New Ray Tracing Features for NVIDIA Nsight Tools
The union of ray tracing and AI is pushing graphics fidelity and performance to new heights. Helping you build optimized, bug-free applications in this era of...
5 MIN READ
Mar 19, 2024
Generative AI for Digital Humans and New AI-powered NVIDIA RTX Lighting
At GDC 2024, NVIDIA announced that leading AI application developers such as Inworld AI are using NVIDIA digital human technologies to accelerate the deployment...
5 MIN READ
Mar 14, 2024
Powerful Shader Insights: Using Shader Debug Info with NVIDIA Nsight Graphics
As ray tracing becomes the predominant rendering technique in modern game engines, a single GPU RayGen shader can now perform most of the light simulation of a...
7 MIN READ
Conversational AI
Jun 10, 2024
NVIDIA Text Embedding Model Tops MTEB Leaderboard
The latest embedding model from NVIDIA—NV-Embed—set a new record for embedding accuracy with a score of 69.32 on the Massive Text Embedding Benchmark...
6 MIN READ
Jun 04, 2024
Build Lifelike Digital Humans with NVIDIA ACE, Now Generally Available
NVIDIA ACE—a suite of technologies bringing digital humans to life with generative AI—is now generally available for developers. Packaged as NVIDIA NIMs,...
5 MIN READ
Jun 02, 2024
Streamline Development of AI-Powered Apps with NVIDIA RTX AI Toolkit for Windows RTX PCs
NVIDIA today launched the NVIDIA RTX AI Toolkit, a collection of tools and SDKs for Windows application developers to customize, optimize, and deploy AI models...
8 MIN READ
May 31, 2024
Building Safer LLM Apps with LangChain Templates and NVIDIA NeMo Guardrails
An easily deployable reference architecture can help developers get to production faster with custom LLM use cases. LangChain Templates are a new way of...
7 MIN READ
May 30, 2024
Personalized Learning with Gipi, NVIDIA TensortRT-LLM, and AI Foundation Models
Over 1.2B people are actively learning new languages, with over 500M learners on digital learning platforms such as Duolingo. At the same time, a significant...
6 MIN READ
May 29, 2024
Generative AI Agents Developer Contest: Top Tips for Getting Started
Join our contest that runs through June 17 and showcase your innovation using cutting-edge generative AI-powered applications using NVIDIA and LangChain...
3 MIN READ
May 24, 2024
Accelerating Transformers with NVIDIA cuDNN 9
The NVIDIA CUDA Deep Neural Network library (cuDNN) is a GPU-accelerated library for accelerating deep learning primitives with state-of-the-art performance....
12 MIN READ
May 17, 2024
Training Localized Multilingual LLMs with NVIDIA NeMo, Part 2
In Part 1, we discussed how to train a monolingual tokenizer and merge it with a pretrained LLM’s tokenizer to form a multilingual tokenizer. In this post, we...
8 MIN READ
May 17, 2024
Training Localized Multilingual LLMs with NVIDIA NeMo, Part 1
In today's globalized world, the ability of AI systems to understand and communicate in diverse languages is increasingly crucial. Large language models (LLMs)...
14 MIN READ
May 15, 2024
Develop Secure, Reliable Medical Apps with RAG and NVIDIA NeMo Guardrails
Imagine an application that can sift through mountains of patient data, intelligently searching and answering questions about diagnoses, health histories, and...
6 MIN READ
May 13, 2024
Customizing Neural Machine Translation Models with NVIDIA NeMo, Part 2
In the first post, we walked through the prerequisites for a neural machine translation example from English to Chinese, running the pretrained model with NeMo,...
11 MIN READ
May 13, 2024
Customizing Neural Machine Translation Models with NVIDIA NeMo, Part 1
Neural machine translation (NMT) is an automatic task of translating a sequence of words from one language to another. In recent years, the development of...
8 MIN READ
Edge Computing
Jun 11, 2024
Maximum Performance and Minimum Footprint for AI Apps with NVIDIA TensorRT Weight-Stripped Engines
NVIDIA TensorRT, an established inference library for data centers, has rapidly emerged as a desirable inference backend for NVIDIA GeForce RTX and NVIDIA RTX...
8 MIN READ
Jun 06, 2024
MediaTek Integrates NVIDIA TAO Toolkit for IoT Edge AI Development
MediaTek is teaming with NVIDIA to integrate NVIDIA TAO training and pretrained models into its development workflow, bringing advanced AI and visual perception...
1 MIN READ
Jun 05, 2024
Build a Zero-Copy AI Sensor Processing Pipeline with OpenCV in NVIDIA Holoscan SDK
NVIDIA Holoscan is the NVIDIA domain-agnostic multimodal real-time AI sensor processing platform that delivers the foundation for developers to build their...
6 MIN READ
Jun 04, 2024
Power Cloud-Native Microservices at the Edge with NVIDIA JetPack 6.0, Now GA
NVIDIA JetPack SDK powers NVIDIA Jetson modules, offering a comprehensive solution for building end-to-end accelerated AI applications. JetPack 6 expands the...
12 MIN READ
Jun 02, 2024
Production-Ready, Enterprise-Grade Software on NVIDIA IGX Platform, Support for NVIDIA RTX 6000 ADA, and More
Real-time AI at the edge is crucial for medical, industrial, and scientific computing because these mission-critical applications require immediate data...
6 MIN READ
May 22, 2024
Enhancing AI Cloud Data Centers and NVIDIA Spectrum-X with NVIDIA DOCA 2.7
The NVIDIA DOCA acceleration framework empowers developers with extensive libraries, drivers, and APIs to create high-performance applications and services for...
10 MIN READ
May 20, 2024
Supercharge Generative AI Development with Firebase Genkit, Optimized by NVIDIA RTX GPUs
At Google I/O 2024, Google announced Firebase Genkit, a new open-source framework for developers to add generative AI to web and mobile applications using...
4 MIN READ
May 14, 2024
NVIDIA DeepStream 7.0 Milestone Release for Next-Gen Vision AI Development
NVIDIA DeepStream is a powerful SDK that unlocks GPU-accelerated building blocks to build end-to-end vision AI pipelines. With more than 40+ plugins available...
11 MIN READ
May 08, 2024
Mitigating Occlusions in Visual Perception Using Single-View 3D Tracking in NVIDIA DeepStream
When it comes to perception for Intelligent Video Analytics (IVA) applications such as traffic monitoring, warehouse safety, and retail shopper analytics, one...
8 MIN READ
May 07, 2024
NVIDIA GTC Training Labs On Demand Available Now
Missed GTC or want to replay your favorite training labs? Find it on demand with the NVIDIA GTC Training Labs playlist.
1 MIN READ
May 03, 2024
Visual Language Intelligence and Edge AI 2.0
VILA is a family of high-performance vision language models developed by NVIDIA Research and MIT. The largest model comes with ~40B parameters and the smallest...
8 MIN READ
May 03, 2024
Visual Language Models on NVIDIA Hardware with VILA
Visual language models have evolved significantly recently. However, the existing technology typically only supports one single image. They cannot reason among...
11 MIN READ
Data Center / Cloud
Jun 10, 2024
Confidential and Self-Sovereign AI: Best Practices for Enhancing Security and Autonomy
Join the webinar on June 11th with NVIDIA and Super Protocol to learn about the benefits of Confidential Computing for Web3 AI.
1 MIN READ
Jun 10, 2024
Spotlight: Cisco Enhances Workload Security and Operational Efficiency with NVIDIA BlueField-3 DPUs
As cyberattacks become more sophisticated, organizations must constantly adapt with cutting-edge solutions to protect their critical assets. One such solution...
7 MIN READ
Jun 04, 2024
Unlock Deeper Insights of Somatic Mutations with Deep Learning
NVIDIA Parabricks expands the NVIDIA emphasis on solving omics challenges with deep learning and continues accelerating genomics instruments. NVIDIA Parabricks...
5 MIN READ
Jun 02, 2024
A Simple Guide to Deploying Generative AI with NVIDIA NIM
Whether you’re working on-premises or in the cloud, NVIDIA NIM inference microservices provide enterprise developers with easy-to-deploy optimized AI models...
4 MIN READ
May 24, 2024
Accelerating Transformers with NVIDIA cuDNN 9
The NVIDIA CUDA Deep Neural Network library (cuDNN) is a GPU-accelerated library for accelerating deep learning primitives with state-of-the-art performance....
12 MIN READ
May 23, 2024
Applying Generative AI for CVE Analysis at an Enterprise Scale
The software development and deployment process is complex. Modern enterprise applications have complex software dependencies, forming an interconnected web...
11 MIN READ
May 22, 2024
Enhancing AI Cloud Data Centers and NVIDIA Spectrum-X with NVIDIA DOCA 2.7
The NVIDIA DOCA acceleration framework empowers developers with extensive libraries, drivers, and APIs to create high-performance applications and services for...
10 MIN READ
May 22, 2024
Just Released: NVIDIA HPC SDK 24.5
NVIDIA HPC SDK 24.5 updates include support for new NVPL components and CUDA 12.4.
1 MIN READ
May 22, 2024
Just Released: Nsight Compute 2024.2
Nsight Compute 2024.2 adds Python syntax highlighting and call stacks, a redesigned report header, and source page statistics to make CUDA optimization easier.
1 MIN READ
May 21, 2024
Quantum Mechanics-Enhanced Drug Discovery Using QUELO-G and CUDA Graphs
In drug discovery, approaches based on the so-called classical force field have been routinely used and considered useful. However, it is also widely recognized...
9 MIN READ
May 17, 2024
Enhancing the Apparel Shopping Experience with AI, Emoji-Aware OCR, and Snapchat's Screenshop
Ever spotted someone in a photo wearing a cool shirt or some unique apparel and wondered where they got it? How much did it cost? Maybe you've even thought...
8 MIN READ
May 14, 2024
RAPIDS on Databricks: A Guide to GPU-Accelerated Data Processing
In today's data-driven landscape, maximizing performance and efficiency in data processing and analytics is critical. While many Databricks users are familiar...
10 MIN READ