You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Please read the Apache 2 license before accepting it.

Log in or Sign Up to review the conditions and access this model content.

GemSUraV: GemSUra x Vision

Model Details

Model Description

With a strong commitment to enhancing the quality of large language models for the Vietnamese language, a collaborative effort was undertaken by Vietnamese researchers hailing from Ho Chi Minh University of Technology (HCMUT) - Vietnam National University HCMC and Stanford University. In line with our dedication to fostering community progress, we are pleased to offer our models free of charge for research purposes. For those who wish to delve further into our research and its details, we encourage you to explore the comprehensive information provided below.

  • Model type: Text generation
  • Languages: Vietnamese, English
  • License: Apache 2.0
  • Finetuned from model: GemSUra 7B

Model Sources

We publicly provide starter source code for fine-tuning, evaluation adn deployment of our models.

  • Framework: LLaVA
  • Paper: Comming soon

Uses

Direct Use

You can use our models to perform various tasks containing question answering (with context), summarization, language modeling, text classification, translation, code generation, and reasoning.

Downstream Use

This model can serve as an encoder for a wide range of downstream tasks, spanning from pure natural language processing to combinations of natural language processing with computer vision or speech processing.

Out-of-Scope Use

While our models have undergone fine-tuning using extensive Vietnamese datasets, they may not perform optimally in specialized domains necessitating profound domain expertise, such as medicine, politics, chemistry, etc. We kindly request that you refrain from employing our models for political purposes or any endeavors that may cause harm to individuals or compromise the sovereignty and territorial integrity of Vietnam.

Bias, Risks, and Limitations

Unless required by applicable law, the GemSUra materials and any output and results therefrom are provided on an "as is" basis, without warranties of any kind, either express or implied, including, without limitation, any warranties of title, non-infringement, merchantability, or fitness for a particular purpose. you are solely responsible for determining the appropriateness of using or redistributing the GemSUra materials and assume any risks associated with your use of the GemSUra materials and any output and results.

Recommendations

Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. In order for the model to work well, you may need perform prompt engineering to create appropriate prompts before inference.

How to Get Started with the Model

Please use this repo to load the model. Github

python -m llava.serve.cli \
    --model-path ura-hcmut/GemSUraV-7B \
    --image-file "https://llava-vl.github.io/static/images/view.jpg" 

Finetuning Details

See Github.

Evaluation

Our models are tested with various tasks. The detail of evaluation process is comming soon.

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019).

  • Hardware Type: 4 x A100 40GB
  • Hours used: 400h
  • Carbon Emitted: ~175 kg CO2 eq.

Citation

If you use GemSUra materials in your research, please cite our model(s) as below.

BibTeX:

@inproceedings{crossing2024,
    title = "Crossing Linguistic Horizons: Finetuning and Comprehensive Evaluation of Vietnamese Large Language Models",
    author = "Truong, Sang T.  and Nguyen, Duc Q.  and Nguyen, Toan D. V.  and Le, Dong D.  and Truong, Nhi N.  and Quan, Tho  and Koyejo, Sanmi",
    booktitle = "Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies",
    month = June,
    year = "2024",
    address = "Seattle, Washington",
    publisher = "Association for Computational Linguistics",
    url = "",
    pages = "",
}

Contact

Downloads last month
0
Safetensors
Model size
8.85B params
Tensor type
FP16
·
Inference Examples
Inference API (serverless) does not yet support transformers models for this pipeline type.

Collection including ura-hcmut/GemSUraV-7B