---
title: README
emoji: 🌍
colorFrom: pink
colorTo: red
sdk: static
pinned: false
---
![Hugging Face x Google Cloud](https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/google-cloud/thumbnail.png)

*Welcome to the official Google organization on Hugging Face\!*

[Google collaborates with Hugging Face](https://huggingface.co/blog/gcp-partnership) across open science, open source, cloud, and hardware to **enable companies to innovate with AI** [on Google Cloud AI services and infrastructure with the Hugging Face ecosystem](https://huggingface.co/docs/google-cloud/main/en/index).

## Featured Models and Tools

* **Gemma Family of Open Multimodal Models**  
  * **Gemma** is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models  
  * **PaliGemma** is a versatile and lightweight vision-language model (VLM)   
  * **CodeGemma** is a collection of lightweight open code models built on top of Gemma  
  * **RecurrentGemma** is a family of open language models built on a novel recurrent architecture developed at Google  
  * **ShieldGemma** is a series of safety content moderation models built upon Gemma 2 that target four harm categories  
* **[**BERT**](https://huggingface.co/collections/google/bert-release-64ff5e7a4be99045d1896dbc), [**T5**](https://huggingface.co/collections/google/t5-release-65005e7c520f8d7b4d037918), and [**TimesFM**](https://github.com/google-research/timesfm) Model Families**
* **Author ML models with [**MaxText**](https://github.com/google/maxtext), [**JAX**](https://github.com/google/jax), [**Keras**](https://github.com/keras-team/keras), [**Tensorflow**](https://github.com/tensorflow/tensorflow), and [**PyTorch/XLA**](https://github.com/pytorch/xla)**

## Open Research and Community Resources

* **Google Blogs**:  
  * [https://blog.google/](https://blog.google/)  
  * [https://cloud.google.com/blog/](https://cloud.google.com/blog/)  
  * [https://deepmind.google/discover/blog/](https://deepmind.google/discover/blog/)  
  * [https://developers.google.com/learn?category=aiandmachinelearning](https://developers.google.com/learn?category=aiandmachinelearning)   
* **Notable GitHub Repositories**:  
  * [https://github.com/google/jax](https://github.com/google/jax) is a Python library for high-performance numerical computing and machine learning  
  * [https://github.com/huggingface/Google-Cloud-Containers](https://github.com/huggingface/Google-Cloud-Containers) facilitate the training and deployment of Hugging Face models on Google Cloud  
  * [https://github.com/pytorch/xla](https://github.com/pytorch/xla) enables PyTorch on XLA Devices (e.g. Google TPU)  
  * [https://github.com/huggingface/optimum-tpu](https://github.com/huggingface/optimum-tpu) brings the power of TPUs to your training and inference stack  
  * [https://github.com/openxla/xla](https://github.com/openxla/xla) is a machine learning compiler for GPUs, CPUs, and ML accelerators  
  * [https://github.com/google/JetStream](https://github.com/google/JetStream) (and [https://github.com/google/jetstream-pytorch](https://github.com/google/jetstream-pytorch)) is a throughput and memory optimized engine for large language model (LLM) inference on XLA devices  
  * [https://github.com/google/flax](https://github.com/google/flax) is a neural network library for JAX that is designed for flexibility  
  * [https://github.com/kubernetes-sigs/lws](https://github.com/kubernetes-sigs/lws) facilitates Kubernetes deployment patterns for AI/ML inference workloads, especially multi-host inference workloads  
  * [https://github.com/GoogleCloudPlatform/ai-on-gke](https://github.com/GoogleCloudPlatform/ai-on-gke) is a collection of AI examples, best-practices, and prebuilt solutions  
* **Google AI Research Papers**: [https://research.google/](https://research.google/) 

## On-device ML using [Google AI Edge](http://ai.google.dev/edge)

* Customize and run common ML Tasks with low-code [MediaPipe Solutions](https://ai.google.dev/edge/mediapipe/solutions/guide)  
* Run [pretrained](https://ai.google.dev/edge/litert/models/trained) or custom models on-device with [Lite RT (previously known as TensorFlow Lite)](https://ai.google.dev/edge/lite)  
* Convert [TensorFlow](https://ai.google.dev/edge/lite/models/convert_tf) and [JAX](https://ai.google.dev/edge/lite/models/convert_jax) models to LiteRT  
* Convert PyTorch models to LiteRT and author high performance on-device LLMs with [AI Edge Torch](https://github.com/google-ai-edge/ai-edge-torch)  
* Visualize and debug models with [Model Explorer](https://ai.google.dev/edge/model-explorer) ([🤗 Space](https://huggingface.co/spaces/google/model-explorer))

## Partnership Highlights and Resources

* Select Google Cloud CPU, GPU, or TPU options when setting up your **Hugging Face [**Inference Endpoints**](https://huggingface.co/blog/tpu-inference-endpoints-spaces) and Spaces**  
* **Train and Deploy Hugging Face models** on Google Kubernetes Engine (GKE) and Vertex AI **directly from Hugging Face model landing pages or from Google Cloud Model Garden**  
* **Integrate [**Colab**](https://colab.research.google.com/) notebooks with Hugging Face Hub** via the [HF\_TOKEN secret manager integration](https://huggingface.co/docs/huggingface_hub/v0.23.3/en/quick-start#environment-variable) and transformers/huggingface\_hub pre-installs  
* Leverage [**Hugging Face Deep Learning Containers (DLCs)**](https://cloud.google.com/deep-learning-containers/docs/choosing-container#hugging-face) for easy training and deployment of Hugging Face models on Google Cloud infrastructure

Read about our principles for responsible AI at [https://ai.google/responsibility/principles](https://ai.google/responsibility/principles/)