pankaj-munde (Pankaj Munde)

liked a model 2 months ago

ai4bharat/indic-parler-tts

Text-to-Speech • Updated Dec 9, 2024 • 19k • 101

liked 2 Spaces 2 months ago

279

MMS

🌍

Transform and identify speech with MMS

693

OminiControl

🌍

Generate detailed images from a prompt and an image

liked a model 3 months ago

OuteAI/OuteTTS-0.1-350M

Text-to-Speech • Updated Nov 27, 2024 • 5.09k • 299

updated a collection 8 months ago

Public Repos

Collection

2 items • Updated Jun 23, 2024

updated a Space 8 months ago

FScout 2.0

🌖

updated a collection 9 months ago

Public Repos

Collection

2 items • Updated Jun 23, 2024

liked a model 10 months ago

mistralai/Mixtral-8x22B-Instruct-v0.1

Text Generation • Updated Oct 3, 2024 • 1.29M • • 708

reacted to nisten's post with 🔥 10 months ago

Post

5465

Just had the former chief public health officer of the Netherlands🇳🇱 review a huggingface AI-doctor I made via a simple orpo-zephyr-8x22b-GPT and they think it's really good.

https://hf.co/chat/assistant/661d77310e3aea9ae571e43c

3 replies

·

reacted to Molbap's post with 🔥 10 months ago

Post

5143

🚀🚀 Exciting times for the document AI community!

We're thrilled to announce the release of some of the largest OCR datasets available to the public.
🔥 With over 26 million pages , 18 billion text tokens, and 6TB of data, these resources are a significant leap forward for document AI research.

Here's how to access these datasets quickly:

from datasets import load_dataset

pdfa_dataset = load_dataset('pixparse/pdfa-eng-wds', streaming=True)
IDL_dataset = load_dataset('pixparse/idl-wds', streaming=True)

This enables you to stream them directly, integrating seamlessly with your projects using the Hugging Face datasets library. On the hub, you can find them here:

pixparse/pdfa-eng-wds
pixparse/idl-wds

For lean data loading, the new [chug](https://github.com/huggingface/chug) library offers a solution with pdf decoding:

import chug

task_cfg = chug.DataTaskDocReadCfg(
    page_sampling='all',
)
data_cfg = chug.DataCfg(
    source='pixparse/pdfa-eng-wds',
    split='train',
    batch_size=None,
    format='hfids',
    num_workers=0,
)
data_loader = chug.create_loader(
    data_cfg,
    task_cfg,
)
sample = next(iter(data_loader))

We owe a huge thank you to Peter Wyatt, Kate Tasker, Rachel Taketa, Ali Furkan Biten, Ruben Tito, and their colleagues for their contributions. Their work putting these datasets together has been invaluable. 🤗

Looking Ahead:

We're on a mission to enhance document AI capabilities, and these datasets are just the beginning. With your engagement and innovation, we're confident in the community's ability to develop robust OCR solutions. We encourage you to explore these datasets, experiment with the code, and contribute to the collective progress in document AI.

For detailed information on usage and licensing, please refer to the dataset cards on the Hugging Face hub.

4 replies

·

reacted to andrewyng's post with 👍 11 months ago

Post

DeepLearning.AI just announced a new short course: Open Source Models with Hugging Face 🤗, taught by Hugging Face's own Maria Khalusova, Marc Sun and Younes Belkada!

As many of you already know, Hugging Face has been a game changer by letting developers quickly grab any of hundreds of thousands of already-trained open source models to assemble into new applications. This course teaches you best practices for building this way, including how to search and choose among models.

You'll learn to use the Transformers library and walk through multiple models for text, audio, and image processing, including zero-shot image segmentation, zero-shot audio classification, and speech recognition. You'll also learn to use multimodal models for visual question answering, image search, and image captioning. Finally, you’ll learn how to demo what you build locally, on the cloud, or via an API using Gradio and Hugging Face Spaces.

Thank you very much to Hugging Face's wonderful team for working with us on this.

You can sign up for the course here: https://www.deeplearning.ai/short-courses/open-source-models-hugging-face/