Extract audio from a video and transcribe it
Extract text from a PDF file
Convert PDFs to text using OCR