Spaces:

Davidsamuel101
/

PPTGenerator

Runtime error

Davidsamuel101 commited on Jun 9, 2023

Commit

66a005f

1 Parent(s): 4ceaee9

Fix Convert2Markdown

Files changed (4) hide show

BDP LEC REPORT 3.md ADDED Viewed

+BDP LEC REPORT 3 Presentation
+=============================
+---
+# Group Members:
+Stella Shania Mintara, David Samuel and Egivenia.
+---
+# I. Visualization Layer
+## Web Framework
+We use Django to develop a web application to display analysis results.
+Seaborn is an open-source Python library based on matplotlib. It is used for exploratory data analysis and data visualization.
+Django is a popular Python web framework to develop web applications. The model serves as a definition for stored data and manages database interactions.
+Django is a web framework for big data analytics applications.
+Django is a great match with MongoDB for building powerful, secure, easy-to- maintain applications. Support for a non-relational database like MongoDB can be implemented by installing additional Django-MongoDB engines for MongoDB.
+## Serving Database
+A MongoDB database is also used to store inferences after the deep learning models are applied to the data streams as well. For saving large images such as image and video files up to 16MB per file, we can use MongoDB specification which is GridFS.
+## Interactive Querying
+The MongoDB Connector for Spark provides integration between MongoDB and Apache Spark. Spark also includes a cost-based that is an optimization technique in Spark that uses table statistics to determine the most efficient query execution plan.

BDP LEC REPORT.md ADDED Viewed

+BDP LEC REPORT Presentation
+===========================
+---
+# Group Members:
+Stella Shania Mintara, David Samuel and Egivenia.
+---
+# Case Problem
+FreshMart is already well-established, they have enough resources to buy and own servers. They prefer to outsource the server management to another party so they don’t need to search and hire talents to run and manage the servers.
+---
+# I. Data Source
+The data source is obtained from Fresh Mart’s surveillance cameras. This data will be ingested in the ingestion layer using Apache Kafka.

src/__pycache__/app.cpython-38.pyc CHANGED Viewed

Binary files a/src/__pycache__/app.cpython-38.pyc and b/src/__pycache__/app.cpython-38.pyc differ

src/app.py CHANGED Viewed

@@ -1,4 +1,4 @@
-from text_extractor import TextExtractor
 from tqdm import tqdm
 from transformers import PegasusForConditionalGeneration, PegasusTokenizer
 from transformers import pipeline
@@ -35,7 +35,7 @@ def summarize(slides):
     return generated_slides
 def convert2markdown(generated_slides):
-    mdFile = MdUtils(file_name=f"summary/{FILENAME}", title=f'{FILENAME} Presentation')
     for k, v in generated_slides.items():
         mdFile.new_line('---\n')
         for section in v:
@@ -61,8 +61,7 @@ def inference(document):
     slides = preprocess.get_slides(texts)
     generated_slides = summarize(slides)
     markdown_path = convert2markdown(generated_slides)
-    # with open(markdown_path, 'rt') as f:
-    #     markdown_str = f.read()
     return markdown_path

+from src.text_extractor import TextExtractor
 from tqdm import tqdm
 from transformers import PegasusForConditionalGeneration, PegasusTokenizer
 from transformers import pipeline
     return generated_slides
 def convert2markdown(generated_slides):
+    mdFile = MdUtils(file_name=FILENAME, title=f'{FILENAME} Presentation')
     for k, v in generated_slides.items():
         mdFile.new_line('---\n')
         for section in v:
     slides = preprocess.get_slides(texts)
     generated_slides = summarize(slides)
     markdown_path = convert2markdown(generated_slides)
+    print(f"Markdown Path: {markdown_path}")
     return markdown_path