Model Card for Model ID

This is an RNN model for text generation tasks. This model is having more contextual understanding than traditional RNN

Model Details

The model uses bigrams as tokens, thus providing more contextual relevence It also uses a different ouput layer consisting of sigmoid activated neurons to handle larger vocabulary sizes

Model Description

Developed by: ArchBase
Model type: Reccurrent Neural Network
Language(s) (NLP): Probably english (it depends heavily on dataset)
License: Apache license 2.0

Uses

This can be used for text generation tasks where running large computationally intensive architectures are not applicable

Direct Use

For simpler text generation tasks where long range contextual understanding is not must

Out-of-Scope Use

Not applicable for production/commercial use May generate illegal/bad/meaningless responses thay maybe harmful

Bias, Risks, and Limitations

May generate illegal/bad/meaningless responses thay maybe harmful. The model can't handle longer sequences larger than 50 words with contextual relevence

Recommendations

May generate illegal/bad/meaningless responses thay maybe harmful

How to Get Started with the Model

Just run the main.py file

almost basic documentation will be in program itself detailed manual will be in manual.txt file

Training Details

Training Data

[More Information Needed]

Training Procedure

Final training loss: 0.0322 Final validation loss: 5.6888

Training Hyperparameters

Training regime: [More Information Needed]

Speeds, Sizes, Times [optional]

[More Information Needed]

Evaluation

Testing Data, Factors & Metrics

Testing Data

[More Information Needed]

Factors

[More Information Needed]

Metrics

[More Information Needed]

Results

[More Information Needed]

Summary

Model Examination [optional]

[More Information Needed]

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019).

Hardware Type: Trained using Nvidia rtx 2050, using cudnn and cuda dependencies
Hours used: [More Information Needed]
Cloud Provider: [More Information Needed]
Compute Region: [More Information Needed]
Carbon Emitted: [More Information Needed]

Technical Specifications [optional]

Model Architecture and Objective

[More Information Needed]

Compute Infrastructure

[More Information Needed]

Hardware

Nvidia Geforce rtx 2050

Software

cudnn, cuda, tensorflow

Citation [optional]

BibTeX:

[More Information Needed]

APA:

[More Information Needed]

Glossary [optional]

[More Information Needed]

More Information [optional]

[More Information Needed]

Model Card Authors [optional]

[More Information Needed]

Model Card Contact

[More Information Needed]

Downloads last month: -; Downloads are not tracked for this model. How to track