File size: 1,387 Bytes
0382947 b905845 1867189 0382947 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 |
---
title: Multi Document Summarizer
emoji: 🔥
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 5.25.2
app_file: app.py
pinned: false
license: mit
short_description: Summarizes multiple docs (docx, pdf, etc.) with NLP & visual
---
# Multi Document Summarization
This project provides a multi-document summarization tool using state-of-the-art NLP models like BART and Longformer. It supports various file formats and generates summaries along with visualizations like dendrograms, t-SNE plots, TF-IDF plots, and word clouds.
## Features
- Summarizes multiple documents into concise summaries.
- Supports file formats: `.docx`, `.txt`, `.html`, `.pdf`, `.csv`, `.xlsx`, `.json`, `.xml`, `.ppt`, `.pptx`.
- Visualizations: Dendrogram, t-SNE, TF-IDF, Word Cloud.
## Installation
1. Clone the repository:
```bash
git clone https://github.com/your-username/abstractive-text-summarization.git
```
2. Navigate to the project directory:
```bash
cd abstractive-text-summarization
```
3. Install dependencies:
```bash
pip install -r requirements.txt
```
## Usage
1. Run the application:
```bash
python major_project_main.py
```
2. Open the Gradio interface in your browser.
3. Upload files and click "Summarize" to generate summaries and visualizations.
## License
This project is licensed under the MIT License. See the LICENSE file for details. |