danielrosehill's picture
commit
3dc0b3d

A newer version of the Gradio SDK is available: 6.2.0

Upgrade
metadata
title: Basic STT Transcript Cleanup
emoji: 🎤
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
short_description: Clean up speech-to-text transcripts with AI

Basic STT Transcript Cleanup Tool (Version 3)

A foundational speech-to-text transcript remediation tool that provides purpose-agnostic text cleanup instructions. This is the daily workhorse for cleaning up raw speech-to-text transcripts that naturally contain undesirable material.

Purpose & Philosophy

This tool implements Version 3 of the Basic Speech-to-Text Cleanup prompt - a carefully crafted system prompt that provides sufficiently deterministic guidance without overstepping into actual content editing. The challenge in developing this prompt was ensuring it cleans up technical artifacts of speech-to-text conversion while preserving the authentic voice and intent of the original speaker.

Foundational Design

This basic cleanup prompt serves as a foundation layer that can be combined with specialized text transformation prompts:

  • Standalone Use: Perfect for general transcript cleanup
  • Modular Design: Can be concatenated with purpose-specific prompts from extensive libraries
  • Purpose-Agnostic: Works across all content types and domains
  • Extensible: Hundreds of specialized transformation prompts can be layered on top

Features

  • AI-Powered Cleanup: Uses OpenAI's GPT models with a refined system prompt
  • BYOK (Bring Your Own Key): Secure - uses your own OpenAI API key
  • Copy to Clipboard: Easy copying of cleaned text
  • Re-run Capability: Quickly re-process the same text
  • System Prompt Viewer: Transparent - see exactly how the AI processes your text
  • Deterministic Processing: Consistent, predictable cleanup results

How to Use

  1. Enter API Key: Provide your OpenAI API key (required for processing)
  2. Paste Transcript: Add your raw speech-to-text transcript
  3. Process: Click "Clean Up Transcript" to apply remediation
  4. Copy Results: Use the cleaned output or re-run if needed

What It Does

The tool applies these foundational improvements to your transcripts:

Core Remediations

  • Removes filler words (like "um")
  • Adds punctuation, sentence structure, and paragraph spacing
  • Fixes obvious STT hallucinations and mistranscriptions (e.g., "McDonuts" → "McDonalds")
  • Removes repetitive or run-on thoughts that would not be helpful to readers
  • Follows inferred instructions to omit certain clauses (e.g., "wait .. scratch that from the note")

What It Preserves

  • All important content and meaning
  • Original speaker's voice and intent
  • Factual accuracy and details
  • Natural flow of conversation

Design Principles

  1. Light Touch Editing: Minimal intervention while maximizing clarity
  2. Content Preservation: Never removes or alters important information
  3. Deterministic Guidance: Consistent, predictable results
  4. Purpose Agnostic: Works across all content domains
  5. Modular Foundation: Ready for specialized prompt layering

Extended Ecosystem

This basic cleanup prompt is part of a larger ecosystem:

  • Hundreds of specialized prompts available in shared libraries
  • Domain-specific transformations for various use cases
  • Concatenation-ready design for complex workflows
  • Shared on Hugging Face and other platforms

System Prompt

The tool uses a carefully crafted system prompt (Version 3, September 2025) that balances cleanup effectiveness with content preservation. View the complete prompt using the "Show System Prompt" feature in the interface.

Created By

Daniel Rosehill - Specializing in AI-powered text processing and speech-to-text optimization workflows.