nyuuzyou

nyuuzyou

AI & ML interests

None yet

Recent Activity

Organizations

Social Post Explorers's profile picture

nyuuzyou's activity

posted an update about 15 hours ago
view post
Post
440
✈️ Aircraft Dataset & Generation Model nyuuzyou/aircraft-images & nyuuzyou/AircraftFLUX-LoRA

Dataset Features:
• 165,340 high-res aircraft images with metadata
• Machine-generated English captions
• Detailed aircraft specs, registration & flight info
• Environmental context descriptions

LoRA model specializes in:
• Realistic aircraft generation
• Accurate technical details for unpopular airplanes compared to black-forest-labs/FLUX.1-schnell
• Proper airline liveries
• Contextual aviation scenes
reacted to reach-vb's post with 🔥 2 days ago
view post
Post
2324
VLMs are going through quite an open revolution AND on-device friendly sizes:

1. Google DeepMind w/ PaliGemma2 - 3B, 10B & 28B: google/paligemma-2-release-67500e1e1dbfdd4dee27ba48

2. OpenGVLabs w/ InternVL 2.5 - 1B, 2B, 4B, 8B, 26B, 38B & 78B: OpenGVLab/internvl-25-673e1019b66e2218f68d7c1c

3. Qwen w/ Qwen 2 VL - 2B, 7B & 72B: Qwen/qwen2-vl-66cee7455501d7126940800d

4. Microsoft w/ FlorenceVL - 3B & 8B: https://huggingface.co/jiuhai

5. Moondream2 w/ 0.5B: https://huggingface.co/vikhyatk/

What a time to be alive! 🔥
replied to Duskfallcrew's post 4 days ago
replied to Duskfallcrew's post 4 days ago
view reply

I'm waiting for at least an announcement explaining what's going on and at most how this quota is calculated

replied to nroggendorff's post 5 days ago
posted an update 5 days ago
view post
Post
741
what a shame
·
reacted to davidberenstein1957's post with 👍 6 days ago
view post
Post
3326
The Data Is Better Together community is set to release the first Apache 2 licensed image preference dataset!

Great work and let's give this a final push :)

@aashish1904 congrats on your month of HF pro. There is more to win during this sprint!

@aashish1904 @AnyaDesdein @davidberenstein1957 @Malalatiana @beta3 @fffiloni @munish0838 @Reza2kn @bbunzeck @Creazycreator @andrei-saceleanu @jafhaponiuk @rca-etl @kf120 @burtenshaw @mmhamdy @grib0ed0v @Doopus @AnyaDes @ttkap @Xceron @Lewox @davanstrien @Azazelle @adirik @Ashish08 @AntonVic @kenantang @sdiazlor @g-ronimo @dennis-rall @prithivMLmods @girtss3 @flozi00 @WaveCut @Taylor658 @Wildminder @Sara9999 @phaelishall @sararob @dvilasuero @pgabrys @plaguss @CDS899 @timajwilliams @rudzinskimaciej @pavel-ai @aggr8 @ignacioct @MouseAI @Leeps @MaksKul @NicolasDmln @Muinez @kusht55 @caiolang @Jakub-Brand24 @loamy @Demijan @eliab96 @Viewegger @JosephCatrambone @p1atdev @mrshu @o639 @Targezed @Aviv-anthonnyolime @thliang01 @Ahmed-Amine @glards @pranaykoppula @nataliaElv @MaPirlet @alvarobartt @gabrielmbmb @zlicastro @Jaydip @Chouettecheveche @lilcheaty @ruyrdiaz @robintema @fdaudens @ggcristian @a-r-r-o-w @pates @joheras @stopsatgreen @bezo97 @chachi902 @iamyann @liamcripwell @dmb23 @korbih @anonymous7743 @akbdx18 @OVAWARE @severo @akontra @lichorosario @lhoestq @SebastianBodza @Vishnou @ameerazam08 @appoose @Mukei @mearco @joaquincabezas @Fizzarolli @thomastraum @igortopolski @OxxoCodes @patrickfleith @asoria @bn22 @sitammeur @Krodolf @bergr7f @Sbxxn @wietsevenema @sugatoray @Iamladi @MikeTrizna @feveromo @mokady @Bolero @prath @Dowwie @kfahn @decodingchris @alili2050 @RahulRaman @yzimmermann @Ameeeee @ecyht2 @MattMC001 @hemanthkumarak @Thegorgibus @akos2 @LawRun @ramithuh @SuperMuel @sjans @peterizsak @mosama @Eyel @mtr3 @cfahlgren1 @legentil @clem @Citaman @Aurelien-Morgan @AntoineBourgois @TotoB12 @Stanmey @osanseviero @multimodalart @maxiw @ariG23498 @ngk89 @femboysLover @dvs @tacohiddink @blanchon @DavidJimenez
  • 1 reply
·
reacted to nroggendorff's post with 🤯 6 days ago
replied to their post 6 days ago
view reply

You don't have to ask. You just continue to upload and as long your models provide value to the AI community, they will be fine with it.

That's not entirely true. For example, I had to request a limit increase for my anime art dataset containing NSFW content (which fully complies with HF rules). Here is the response I received:
изображение.png
While HF's policies don't explicitly state they won't provide storage and computational quota for NSFW content, my experience suggests otherwise. Does this dataset and potential models trained on it provide value to the AI community? I believe so. But would they grant a quota increase for it? You can see their answer above.

replied to their post 7 days ago
view reply

I'm glad to hear that. But asking for a quota to publish a 100MB dataset is a bit strange.

replied to their post 7 days ago
view reply

Anyway, until an official announcement is made, I'm going to act as normal. I have started backing up locally just in case...

I only hope that no model disappears in surprise when it really doesn't need to disappear.😓

So far, no restrictions have been implemented, and I can still upload new files, so the situation isn't critical yet.

replied to their post 7 days ago
view reply

I hope Hugging Face will find ways to expand storage for free or provide unlimited storage for PRO users instead of the current 1 TB limit. Kaggle offers unlimited storage for public repositories per account, with a limit of 200 GB per repository (which is less generous than Hugging Face's 300 GB). However, Kaggle won't come close to replacing Hugging Face's functionality, so we have nowhere else to turn.

We might need someone from hugging face to actually weigh in. Maybe someone can @ a member in here, if anyone knows.

I'm sure the team was aware of the community's reaction to these changes before they were implemented. Now we'll just have to wait for the full details to come out.

replied to their post 7 days ago
view reply

I hope that there will be some form of a contingency plans, otherwise most of the finetunes and quants on large models might need to be removed and I hope that won't happen. I would hate to have to go Q4 only from now on, just because no one can upload multiple quants because no space.

I hope Hugging Face will find ways to expand storage for free or provide unlimited storage for PRO users instead of the current 1 TB limit. Kaggle offers unlimited storage for public repositories per account, with a limit of 200 GB per repository (which is less generous than Hugging Face's 300 GB). However, Kaggle won't come close to replacing Hugging Face's functionality, so we have nowhere else to turn.

posted an update 7 days ago
view post
Post
2637
its over
·
posted an update 11 days ago
posted an update 12 days ago
reacted to davanstrien's post with ❤️ 12 days ago
view post
Post
2353
First dataset for the new Hugging Face Bluesky community organisation: bluesky-community/one-million-bluesky-posts 🦋

📊 1M public posts from Bluesky's firehose API
🔍 Includes text, metadata, and language predictions
🔬 Perfect to experiment with using ML for Bluesky 🤗

Excited to see people build more open tools for a more open social media platform!
posted an update 13 days ago
view post
Post
930
Hugging Face recently added Bluesky to profile links, which is cool. It would be great to also support links to alternative Git services like Codeberg, GitLab, and Gitea. Many developers use platforms beyond GitHub, and showcasing repositories from these sites would be a great feature
replied to LukeNeumann's post 13 days ago
view reply

I've published almost 70 datasets, and from what I've seen, a combination of downloads and likes seems to be the way to go. My dataset nyuuzyou/subdomains has a few likes, but at its peak it had over 4,000 downloads in a month, and it wasn't in trending at all.