--- title: Multi Document Summarizer emoji: 🔥 colorFrom: blue colorTo: green sdk: gradio sdk_version: 5.25.2 app_file: app.py pinned: false license: mit short_description: Summarizes multiple docs (docx, pdf, etc.) with NLP & visual --- # Multi Document Summarization This project provides a multi-document summarization tool using state-of-the-art NLP models like BART and Longformer. It supports various file formats and generates summaries along with visualizations like dendrograms, t-SNE plots, TF-IDF plots, and word clouds. ## Features - Summarizes multiple documents into concise summaries. - Supports file formats: `.docx`, `.txt`, `.html`, `.pdf`, `.csv`, `.xlsx`, `.json`, `.xml`, `.ppt`, `.pptx`. - Visualizations: Dendrogram, t-SNE, TF-IDF, Word Cloud. ## Installation 1. Clone the repository: ```bash git clone https://github.com/your-username/abstractive-text-summarization.git ``` 2. Navigate to the project directory: ```bash cd abstractive-text-summarization ``` 3. Install dependencies: ```bash pip install -r requirements.txt ``` ## Usage 1. Run the application: ```bash python major_project_main.py ``` 2. Open the Gradio interface in your browser. 3. Upload files and click "Summarize" to generate summaries and visualizations. ## License This project is licensed under the MIT License. See the LICENSE file for details.