English

NV-Tesseract-AD 2.0 Overview

Description:

NVIDIA NV-Tesseract-AD-2.0 provides anomaly detection functionality. Rather than relying only on transformers, it introduces diffusion modeling, stabilized through curriculum learning, and pairs it with adaptive thresholding methods in a model purpose-built for anomaly detection. Together, these elements address some of the most challenging issues in the field: noisy, high-dimensional signals that drift over time and contain rare, irregular events.

This model is for research and development only.

License/Terms of Use:

Governing Terms: Model: The model is provided under the Apache License, Version 2.0.

Deployment Geography:

Global

Use Case:

Companies, organizations, research hubs looking to do anomaly detection on temporal data.

Release Date:

29th June 2026 via https://huggingface.co/nvidia/nv-tesseract-ad-diffusion

Reference(s):

Segmented Confidence Sequences and Multi-Scale Adaptive Confidence Segments for Anomaly Detection in Nonstationary Time Series
ImDiffusion

Model Architecture:

Architecture Type: Diffusion based transformer

Network Architecture: ResNet34

Number of model parameters: 2 million

The NV-Tesseract-AD-2.0 is a diffusion-based model for time-series imputation and anomaly detection.

Input:

Input Type(s): Tabular numeric
Input Format(s): Tabular Pandas DataFrame or CSV/JSON
Input Parameters: Two-Dimensional (2D)
Other Properties Related to Input: Pre-Processing Needed
Anomaly detection: Contains timestamp column and one or more numeric value columns.

Output:

Output Type(s): Tabular numeric
Output Format: Tabular Pandas DataFrame
Output Parameters: Two-Dimensional (2D)
Other Properties Related to Output: Post-Processing Needed
Anomaly detection: Contains timestamp, value, and anomaly label columns.

Our AI models are designed and/or optimized to run on NVIDIA GPU-accelerated systems. By leveraging NVIDIA's hardware (e.g. GPU cores) and software frameworks (e.g., CUDA libraries), the model achieves faster training and inference times compared to CPU-only solutions.

Software Integration:

Runtime Engine(s):
PyTorch

Supported Hardware Microarchitecture Compatibility:
NVIDIA Ampere
NVIDIA Hopper

[Preferred/Supported] Operating System(s):
Linux

The integration of foundation and fine-tuned models into AI systems requires additional testing using use-case-specific data to ensure safe and effective deployment. Following the V-model methodology, iterative testing and validation at both unit and system levels are essential to mitigate risks, meet technical and functional requirements, and ensure compliance with safety and ethical standards before deployment.

Model Version(s):

NV-Tesseract-AD-2.0

Training & Testing Datasets:

80/20 split per dataset.

Data Modality: Other: Numeric time series

Training Data Size: 3 million data points

TSB-AD-M

Data Collection Method by dataset Hybrid: Synthetic, Automatic/Sensors, Human
Labeling Method by dataset Hybrid: Synthetic, Automatic/Sensors, Human
Properties: The TSB-AD-M benchmark includes a public category containing 18 previously proposed datasets with a total of 1980 multi-variate time series. These time series span different domains and exhibit a high variability of anomaly types, ratios, and sizes.

Evaluation Datasets:

Detecting Anomalies in Wafer Manufacturing

Data Collection Method by dataset Hybrid: Synthetic, Automatic/Sensors, Human
Labeling Method by dataset Hybrid: Synthetic, Automatic/Sensors, Human
Properties: 151 inline process-control traces recorded by semiconductor sensors during wafer fabrication.

CalIt2 Building People Counts

Data Collection Method by dataset Hybrid: Synthetic, Automatic/Sensors, Human
Labeling Method by dataset Hybrid: Synthetic, Automatic/Sensors, Human
Properties: People-count sensor at the main entrance of UC-Irvine's CalIt2 building (15 weeks, 48 half-hour slots per day)

Network Traffic

Data Collection Method by dataset Hybrid: Synthetic, Automatic/Sensors, Human
Labeling Method by dataset Hybrid: Synthetic, Automatic/Sensors, Human
Properties: This dataset contains network traffic data generated for the purpose of anomaly detection in embedded systems, specifically targeting security threats such as malicious activities. It includes both normal and anomalous (malicious) behavior, which are labeled accordingly for supervised learning tasks.

Genesis Demonstrator

Data Collection Method by dataset Hybrid: Synthetic, Automatic/Sensors, Human
Labeling Method by dataset Hybrid: Synthetic, Automatic/Sensors, Human
Properties: The Genesis Demonstrator was created during the OPAK Project and further revised during the European IMPROVE project. It is a portable pick-and-place demonstrator which uses an air tank to supply all the gripping and storage units. It records 5(+4) continuous signals, 13 discrete signals and 1 Unix Timestamp.

Falling People

Data Collection Method by dataset Hybrid: Synthetic, Automatic/Sensors, Human
Labeling Method by dataset Hybrid: Synthetic, Automatic/Sensors, Human
Properties: This data set was used during a thesis to develop safer smart environments. The origin is a care independent smart home environment to detect the falling of elderly people.

Inference:

Engine: PyTorch, Transformer Engine
Test Hardware:

  • A100 (8 GPUs; each is 74 GB)
  • H100 (8 GPUs; each is 80 GB)

Ethical Considerations:

NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse.

Please report model quality, risk, security vulnerabilities or NVIDIA AI Concerns here.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for nvidia/nv-tesseract-ad-diffusion