Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
33
David Corvoysier
dacorvo
Follow
Mjlehtim's profile picture
rishiraj's profile picture
Molbap's profile picture
29 followers
·
20 following
https://www.kaizou.org
dacorvo
AI & ML interests
Quantization
Articles
quanto: a pytorch quantization toolkit
Mar 18
•
11
Hugging Face Text Generation Inference available for AWS Inferentia2
Feb 1
•
1
Make your llama generation time fly with AWS Inferentia2
Nov 7, 2023
•
1
Organizations
dacorvo
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
aws-neuron/optimum-neuron-cache
7 days ago
[Cache Request] meta-llama/Meta-Llama-3-8B
1
#65 opened 10 days ago by
huntingcarlisle
New activity in
aws-neuron/optimum-neuron-cache
11 days ago
[Cache Request] quilr-ai/semantic-dlp
3
#63 opened 14 days ago by
ksquarekumar
New activity in
aws-neuron/optimum-neuron-cache
21 days ago
[Cache Request] meta-llama/Meta-Llama-3-70B-Instruct
1
#56 opened 22 days ago by
CodeVinayak
models for inf2.
5
#33 opened about 2 months ago by
AC2132
[Cache Request] aws-neuron/Llama-2-7b-hf-neuron-latency
1
#58 opened 22 days ago by
Gerald001
[Cache Request] aws-neuron/Llama-2-7b-hf-neuron-throughput
1
#57 opened 22 days ago by
Gerald001
[Cache Request] aws-neuron/Llama-2-7b-hf-neuron-budget
1
#59 opened 22 days ago by
Gerald001
New activity in
aws-neuron/optimum-neuron-cache
23 days ago
Can't find zephyr-7b-beta cache using optimum cli list command.
5
#21 opened about 2 months ago by
Anurag2132
[Cache Request] mistralai/Mistral-7B-Instruct-v0.2
1
#18 opened about 2 months ago by
krish1124
[Cache Request] mistralai/Mistral-7B-Instruct-v0.2
1
#39 opened about 1 month ago by
jburtoft
[Cache Request] TheBloke/Wizard-Vicuna-7B-Uncensored-GPTQ
1
#54 opened 24 days ago by
nadilio
New activity in
aws-neuron/optimum-neuron-cache
24 days ago
[Cache Request] TheBloke/em_german_leo_mistral-GGUF
1
#50 opened 25 days ago by
OnurSarikaya2000
[Cache Request] meta-llama/Llama-2-7b-chat-hf
1
#51 opened 24 days ago by
naveen1601datalyticsfoundry
New activity in
aws-neuron/optimum-neuron-cache
about 2 months ago
[Cache Request] TheBloke/Llama-2-7B-Chat-GGML
1
#36 opened about 2 months ago by
lou987
[Cache Request] aws-neuron/Llama-2-7b-chat-hf-seqlen-2048-bs-2
1
#9 opened 2 months ago by
RamiroRamirez
[Cache Request] aws-neuron/Llama-2-7b-chat-hf-seqlen-2048-bs-1
1
#6 opened 2 months ago by
RamiroRamirez
Optimum-neuron-cache for inference?
2
#1 opened 4 months ago by
jburtoft
Request for adding mistral with batch_size=8 or batch_size=4
4
#3 opened 3 months ago by
michaelfeil
[Cache Request] TheBloke/OpenHermes-2.5-Mistral-7B-GGUF
1
#7 opened 2 months ago by
boose101
[Cache Request] abacusai/Smaug-72B-v0.1
1
#12 opened 2 months ago by
saqlainraza
[Cache Request] defog/sqlcoder-7b-2
1
#20 opened about 2 months ago by
marinap
New activity in
aws-neuron/optimum-neuron-cache
2 months ago
[Cache Request] google/gemma-7b
1
#14 opened 2 months ago by
mihirjadhav
[Cache request] zephyr-7b-beta-neuron with sequence_length more than 4096
1
#11 opened 2 months ago by
Anurag2132
[Cache Request] Helsinki-NLP/opus-mt-en-de
2
#10 opened 2 months ago by
k10
[Cache Request] facebook/seamless-m4t-v2-large
2
#13 opened 2 months ago by
aitransync
New activity in
aws-neuron/optimum-neuron-cache
3 months ago
Issue running v-alpha-tross after cache update
4
#2 opened 3 months ago by
michaelfeil
New activity in
aws-neuron/Mistral-neuron
4 months ago
Deploy with Sagemaker LMI
9
#2 opened 4 months ago by
josete89
New activity in
aws-neuron/Llama-2-7b-chat-hf-seqlen-2048-bs-1
4 months ago
Could not find a matching NEFF for your HLO in this directory. When trying to load precompiled neuron artifacts
4
#2 opened 4 months ago by
luuksuurmeijer
New activity in
Jingya/tiny-random-t5-neuronx
4 months ago
Upload folder using huggingface_hub
1
#1 opened 4 months ago by
dacorvo
New activity in
aws-neuron/Llama-2-7b-chat-hf-seqlen-2048-bs-1
4 months ago
Unable to successfully compile the model meta-llama/Llama-2-7b-chat-hf on Inf2 instance
6
#1 opened 4 months ago by
WaelDataReply
New activity in
huggingface/documentation-images
7 months ago
Add pictures for llama2 on Inferentia2 blog post
2
#212 opened 7 months ago by
dacorvo
upload assets for llama2 on inferentia2 blogpost
#196 opened 7 months ago by
dacorvo
Create inferentia-llama2
#195 opened 7 months ago by
dacorvo