FakeMark: Deepfake Speech Attribution with Watermarked Artifacts

Official pretrained checkpoints for FakeMark, a deepfake speech attribution system. FakeMark injects system-specific watermark artifacts into synthesized speech to attribute waveforms back to their originating Text-to-Speech (TTS) architecture.

Paper Code License


πŸ“Œ Overview

FakeMark provides a robust framework for speech provenance. By leveraging SEANet-based generators and MMS-300M collaborators, it allows for high-fidelity audio watermarking that survives common distortions while maintaining high attribution accuracy.

Key Features:

  • FakeMarkA: SEANet-based watermark generator with ColProcessor conditioning.
  • FakeMarkT: Timbre-based watermark generator.
  • Multi-System Support: Includes retrained checkpoints for AudioSeal and Timbre-4bit for benchmarking.

πŸ“‚ Checkpoint Layout

The repository is organized as follows:

checkpoints/
β”œβ”€β”€ FakeMarkA/
β”‚   β”œβ”€β”€ encoder.ckpt          # SEANet watermark generator
β”‚   β”œβ”€β”€ decoder.ckpt          # SEANet decoder
β”‚   β”œβ”€β”€ colprocessor.ckpt     # ColProcessor conditioning module
β”‚   └── collaborator.ckpt     # MMS-300M collaborator (attribution classifier)
β”œβ”€β”€ FakeMarkT/
β”‚   β”œβ”€β”€ encoder.ckpt          # Timbre watermark generator
β”‚   └── collaborator.ckpt     # MMS-300M collaborator
β”œβ”€β”€ AudioSeal/
β”‚   β”œβ”€β”€ checkpoint_generator_epoch260.pth   # AudioSeal generator (retrained)
β”‚   └── checkpoint_detector_epoch260.pth    # AudioSeal detector (retrained)
β”œβ”€β”€ Timbre-4bit.pth.tar       # Timbre generator/detector (retrained)
β”œβ”€β”€ MMS_300M.ckpt             # Standalone MMS-300M classifier
└── ResNet.ckpt               # Standalone ResNet34 + LFB + LMCL classifier
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Paper for nii-yamagishilab/fakemark