Spaces:
Sleeping
Sleeping
metadata
title: Text Summarization
emoji: 📝
colorFrom: blue
colorTo: indigo
sdk: docker
app_port: 8501
pinned: false
license: mit
short_description: Summarize YouTube, web pages, and uploaded docs.
Text Summarization
This Space runs a Streamlit app for summarizing:
- YouTube videos
- website URLs
- uploaded PDF, TXT, MD, CSV, and DOCX files
Required Secret
Add this secret in the Space settings:
GROQ_API_KEY
YouTube On Hugging Face Spaces
YouTube transcript loading may work locally but fail on Hugging Face Spaces because YouTube frequently blocks or rate-limits datacenter IP ranges. The app now retries transient HTTPS failures and supports proxy configuration through Space secrets:
YOUTUBE_HTTP_PROXYYOUTUBE_HTTPS_PROXY
You can also use the standard HTTP_PROXY and HTTPS_PROXY environment variables if that matches your setup.
Space-Only YouTube Fallbacks
The Hugging Face Space version now supports multiple YouTube retrieval strategies:
- Direct transcript fetch
- External transcript API
- Audio transcription via
yt-dlp+ Groq Whisper - Manual transcript paste/upload
Optional secrets for external transcript API
YOUTUBE_TRANSCRIPT_API_URLYOUTUBE_TRANSCRIPT_API_KEYYOUTUBE_TRANSCRIPT_API_METHOD(GETorPOST, defaultGET)YOUTUBE_TRANSCRIPT_API_KEY_HEADER(defaultAuthorization)YOUTUBE_TRANSCRIPT_API_TIMEOUT(default45)
YOUTUBE_TRANSCRIPT_API_URL may contain placeholders such as {video_id}, {url}, and {language_code}.
Optional secrets for Groq audio transcription fallback
GROQ_AUDIO_TRANSCRIPTION_MODEL
Default model: whisper-large-v3-turbo