import gradio as gr from gradio_simpletextextractfrompdf import SimpleTextExtractFromPDF def first_200_chars(text): return text[:200] demo = gr.Interface( fn=first_200_chars, inputs=SimpleTextExtractFromPDF(label="Upload a PDF file"), outputs="text", title="SimpleTextExtractFromPDF Demo - First 200 Characters", description=""" ## Component Description This space is to demo the usage of the SimpleTextExtractFromPDF component. This component provides a simple interface to extract text from a PDF file. The extracted text can be submitted as a string input to a function for further processing. - **Text Extraction Only:** Only the text content is extracted from the PDF. Images and table structures are not preserved. - **Flexible Upload Options:** Users can upload a PDF file from their device or provide a URL to the PDF. - **Input Component:** The component is primarily designed to be used as an input, allowing users to submit the extracted text to other functions. - **Output Display:** When used as an output component, the extracted string content is displayed in a textarea. The demo app here uses the SimpleTextExtractFromPDF component as an input component to extract the text from a PDF file and then show the first 200 characters of the extracted text. """, article="""
pip install gradio-simpletextextractfrompdf
https://pypi.org/project/gradio-simpletextextractfrompdf/
Gradio Custom Utility Components Project:
https://github.com/HuggyMonkey/gradio-custom-component