Nicolai Rosenberg

SanRosenberg

AI & ML interests

None yet

Recent Activity

liked a Space 3 months ago

Qwen/Qwen2.5-Turbo-1M-Demo

upvoted an article 5 months ago

HTRflow - A tool for HTR and OCR

liked a model 11 months ago

AI-aktindsigt/kommunal_semantisk_grundmodel_1_og_2

View all activity

Organizations

SanRosenberg's activity

liked a Space 3 months ago

345

Qwen2.5 Turbo 1M Demo

💻

Upload documents for Q&A

upvoted an article 5 months ago

Article

HTRflow - A tool for HTR and OCR

and 3 others •

Oct 1, 2024

• 15

liked a model 11 months ago

AI-aktindsigt/kommunal_semantisk_grundmodel_1_og_2

Updated Mar 14, 2024 • 4

liked a Space 12 months ago

Google Gemma

🔥

reacted to trisfromgoogle's post with ❤️ 12 months ago

Post

I am thrilled to announce Gemma, new 2B and 7B models from Google, based on the same research and technology used to train the Gemini models! These models achieve state-of-the-art performance for their size, and are launched across Transformers, Google Cloud, and many other surfaces worldwide starting today.

Get started using and adapting Gemma in the model Collection: google/gemma-release-65d5efbccdbb8c4202ec078b

These launches are the product of an outstanding collaboration between the Google DeepMind and Hugging Face teams over the last few months -- very proud of the work both teams have done, from integration with Vertex AI to optimization across the stack. Read more about the partnership in the main launch by @philschmid @osanseviero @pcuenq on the launch blog: https://huggingface.co/blog/gemma

More information below if you are curious about training details, eval results, and safety characteristics!

Gemma Tech Report: https://goo.gle/GemmaReport
Launch announcement: https://blog.google/technology/developers/gemma-open-models/

6 replies

liked 2 models 12 months ago

google/gemma-7b-it

Text Generation • Updated Aug 14, 2024 • 138k • 1.15k

google/gemma-7b

Text Generation • Updated Jun 27, 2024 • 64.8k • • 3.12k

reacted to akhaliq's post with ❤️ 12 months ago

Post

Neural Network Diffusion

Neural Network Diffusion (2402.13144)

Diffusion models have achieved remarkable success in image and video generation. In this work, we demonstrate that diffusion models can also generate high-performing neural network parameters. Our approach is simple, utilizing an autoencoder and a standard latent diffusion model. The autoencoder extracts latent representations of a subset of the trained network parameters. A diffusion model is then trained to synthesize these latent parameter representations from random noise. It then generates new representations that are passed through the autoencoder's decoder, whose outputs are ready to use as new subsets of network parameters. Across various architectures and datasets, our diffusion process consistently generates models of comparable or improved performance over trained networks, with minimal additional cost. Notably, we empirically find that the generated models perform differently with the trained networks. Our results encourage more exploration on the versatile use of diffusion models.

liked a model 12 months ago

mhenrichsen/danskgpt-tiny-chat

Text Generation • Updated Jan 27, 2024 • 202 • 12

liked 2 models about 1 year ago

cerebras/btlm-3b-8k-chat

Text Generation • Updated Dec 8, 2023 • 123 • 12

mhenrichsen/danskgpt-tiny

Text Generation • Updated Jan 13, 2024 • 223 • 18

liked a model over 1 year ago

mhenrichsen/hestenettetLM

Text Generation • Updated Jan 16, 2024 • 1.98k • 3