Spaces:
Runtime error
Runtime error
import streamlit as st | |
import torch | |
from transformers import pipeline | |
# from transformers import T5Tokenizer, T5ForConditionalGeneration | |
# from transformers import BartTokenizer, BartForConditionalGeneration | |
# from transformers import AutoTokenizer, EncoderDecoderModel | |
#from transformers import AutoTokenizer, LEDForConditionalGeneration | |
#from transformers import AutoTokenizer, FlaxLongT5ForConditionalGeneration | |
#Title | |
st.title("SummarizeEasy") | |
#Input text | |
text =""" | |
Kathmandu, Nepal's capital, is set in a valley surrounded by the Himalayan mountains. At the heart of the old city’s mazelike alleys is Durbar Square, which becomes frenetic during Indra Jatra, a religious festival featuring masked dances. Many of the city's historic sites were damaged or destroyed by a 2015 earthquake. Durbar Square's palace, Hanuman Dhoka, and Kasthamandap, a wooden Hindu temple, are being rebuilt. Kathmandu and adjacent cities are composed of neighbourhoods, which are utilized quite extensively and more familiar among locals. However, administratively the city is divided into 32 wards, numbered from 1 to 32. Earlier, there were 35 wards which made it the metropolitan city with the largest number of the wards. Balendra Shah (Balen) has been elected as the new mayor of Kathmandu.Kathmandu Municipal Corporation (KMC) is the chief nodal agency for the administration of Kathmandu. The Municipality of Kathmandu was upgraded to a metropolitan city in 1995.Kathmandu's urban cosmopolitan character has made it the most populous city in Nepal.Metropolitan Kathmandu is divided into five sectors: the Central Sector, the East Sector, the North Sector, the City Core and the West Sector. For civic administration, the city is further divided into 35 administrative wards. The Council administers the Metropolitan area of Kathmandu city through its 177 elected representatives and 20 nominated members. It holds biannual meetings to review, process and approve the annual budget and make major policy decisions.he ancient trade route between India and Tibet that passed through Kathmandu enabled a fusion of artistic and architectural traditions from other cultures to be amalgamated with local art and architecture.The monuments of Kathmandu City have been influenced over the centuries by Hindu and Buddhist religious practices. The architectural treasure of the Kathmandu valley has been categorized under the well-known seven groups of heritage monuments and buildings. In 2006 UNESCO declared these seven groups of monuments as a World Heritage Site (WHS). The seven monuments zones cover an area of 189 hectares (470 acres), with the buffer zone extending to 2,394 hectares (5,920 acres). The Seven Monument Zones inscribed originally in 1979 and with a minor modification in 2006 are the Durbar squares of Hanuman Dhoka, Patan and Bhaktapur, the Hindu temples of Pashupatinath and Changunarayan, the Buddhist stupas of Swayambhunath and Boudhanath. | |
""" | |
##initializing models | |
#Transformers Approach | |
def transform_summarize(text): | |
summary = pipeline("summarization") | |
k=summary(text,max_length=100,do_sample=False) | |
return k | |
#T5 | |
# #def t5_summarize(text): | |
# tokenizer = T5Tokenizer.from_pretrained("t5-small") | |
# model = T5ForConditionalGeneration.from_pretrained("t5-small") | |
# input_text = "summarize: " + text | |
# inputs = tokenizer.encode(input_text, return_tensors="pt", max_length=1024, truncation=True) | |
# outputs = model.generate(inputs, max_length=200, min_length=50, length_penalty=2.0, num_beams=4, early_stopping=True) | |
# summary = tokenizer.decode(outputs[0], skip_special_tokens=True) | |
# return summary | |
#BART | |
# def bart_summarize(text): | |
# tokenizer = BartTokenizer.from_pretrained("facebook/bart-large-cnn") | |
# model = BartForConditionalGeneration.from_pretrained("facebook/bart-large-cnn") | |
# inputs = tokenizer([text], max_length=1024, return_tensors="pt", truncation=True) | |
# summary_ids = model.generate(inputs["input_ids"], num_beams=4, max_length=150, early_stopping=True) | |
# summary = tokenizer.decode(summary_ids[0], skip_special_tokens=True) | |
# return summary | |
#Encoder-Decoder | |
# def encoder_decoder(text): | |
# model = EncoderDecoderModel.from_pretrained("patrickvonplaten/bert2bert_cnn_daily_mail") | |
# tokenizer = AutoTokenizer.from_pretrained("patrickvonplaten/bert2bert_cnn_daily_mail") | |
# # let's perform inference on a long piece of text | |
# input_ids = tokenizer(text, return_tensors="pt").input_ids | |
# # autoregressively generate summary (uses greedy decoding by default) | |
# generated_ids = model.generate(input_ids) | |
# generated_text = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0] | |
# return generated_text | |
#st.write("Generated Summaries are: ") | |
l=transform_summarize(text) | |
st.write(l) | |
# print(t5_summarize(text)) | |
# print(bart_summarize(text)) | |
# print(encoder_decoder(text)) | |