Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
40
1
David Corvoysier
dacorvo
Follow
julien-c's profile picture
akshayar's profile picture
lunarflu's profile picture
30 followers
·
20 following
https://www.kaizou.org
dacorvo
AI & ML interests
Quantization
Articles
quanto: a pytorch quantization toolkit
Mar 18
•
15
Hugging Face Text Generation Inference available for AWS Inferentia2
Feb 1
•
1
Make your llama generation time fly with AWS Inferentia2
Nov 7, 2023
•
1
Organizations
dacorvo
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
NousResearch/Llama-2-7b-chat-hf
1 day ago
Fix invalid generation config
#9 opened 1 day ago by
dacorvo
New activity in
NousResearch/Llama-2-7b-hf
1 day ago
Fix invalid generation config
#8 opened 1 day ago by
dacorvo
New activity in
aws-neuron/optimum-neuron-cache
2 days ago
[Cache Request] meta-llama/Meta-Llama-3-8B
1
#88 opened 10 days ago by
sanctuaire21
[Cache Request] meta-llama/Meta-Llama-3-8B
1
#87 opened 10 days ago by
sanctuaire21
[Cache Request] mistralai/Mistral-7B-Instruct-v0.3
2
#86 opened 10 days ago by
xapss
[Cache Request] mistralai/Mistral-7B-Instruct-v0.3
1
#93 opened 5 days ago by
ajay1710
New activity in
aws-neuron/optimum-neuron-cache
13 days ago
Update README.md
1
#82 opened 13 days ago by
JordanRichardson
New activity in
aws-neuron/optimum-neuron-cache
about 1 month ago
[Cache Request] meta-llama/Meta-Llama-3-8B
1
#65 opened about 1 month ago by
huntingcarlisle
[Cache Request] quilr-ai/semantic-dlp
3
#63 opened about 1 month ago by
ksquarekumar
[Cache Request] meta-llama/Meta-Llama-3-70B-Instruct
1
#56 opened about 2 months ago by
CodeVinayak
models for inf2.
5
#33 opened 2 months ago by
AC2132
[Cache Request] aws-neuron/Llama-2-7b-hf-neuron-latency
1
#58 opened about 2 months ago by
Gerald001
[Cache Request] aws-neuron/Llama-2-7b-hf-neuron-throughput
1
#57 opened about 2 months ago by
Gerald001
[Cache Request] aws-neuron/Llama-2-7b-hf-neuron-budget
1
#59 opened about 2 months ago by
Gerald001
New activity in
aws-neuron/optimum-neuron-cache
about 2 months ago
Can't find zephyr-7b-beta cache using optimum cli list command.
5
#21 opened 3 months ago by
Anurag2132
[Cache Request] mistralai/Mistral-7B-Instruct-v0.2
1
#18 opened 3 months ago by
krish1124
[Cache Request] mistralai/Mistral-7B-Instruct-v0.2
1
#39 opened 2 months ago by
jburtoft
[Cache Request] TheBloke/Wizard-Vicuna-7B-Uncensored-GPTQ
1
#54 opened about 2 months ago by
nadilio
[Cache Request] TheBloke/em_german_leo_mistral-GGUF
1
#50 opened about 2 months ago by
OnurSarikaya2000
[Cache Request] meta-llama/Llama-2-7b-chat-hf
1
#51 opened about 2 months ago by
naveen1601datalyticsfoundry
New activity in
aws-neuron/optimum-neuron-cache
2 months ago
[Cache Request] TheBloke/Llama-2-7B-Chat-GGML
1
#36 opened 2 months ago by
lou987
[Cache Request] aws-neuron/Llama-2-7b-chat-hf-seqlen-2048-bs-2
1
#9 opened 3 months ago by
RamiroRamirez
[Cache Request] aws-neuron/Llama-2-7b-chat-hf-seqlen-2048-bs-1
1
#6 opened 3 months ago by
RamiroRamirez
Optimum-neuron-cache for inference?
2
#1 opened 5 months ago by
jburtoft
Request for adding mistral with batch_size=8 or batch_size=4
4
#3 opened 4 months ago by
michaelfeil
[Cache Request] TheBloke/OpenHermes-2.5-Mistral-7B-GGUF
1
#7 opened 3 months ago by
boose101
[Cache Request] abacusai/Smaug-72B-v0.1
1
#12 opened 3 months ago by
saqlainraza
[Cache Request] defog/sqlcoder-7b-2
1
#20 opened 3 months ago by
marinap
New activity in
aws-neuron/optimum-neuron-cache
3 months ago
[Cache Request] google/gemma-7b
1
#14 opened 3 months ago by
mihirjadhav
[Cache request] zephyr-7b-beta-neuron with sequence_length more than 4096
1
#11 opened 3 months ago by
Anurag2132
[Cache Request] Helsinki-NLP/opus-mt-en-de
2
#10 opened 3 months ago by
k10
[Cache Request] facebook/seamless-m4t-v2-large
2
#13 opened 3 months ago by
aitransync
New activity in
aws-neuron/optimum-neuron-cache
4 months ago
Issue running v-alpha-tross after cache update
4
#2 opened 4 months ago by
michaelfeil
New activity in
aws-neuron/Mistral-neuron
4 months ago
Deploy with Sagemaker LMI
9
#2 opened 4 months ago by
josete89
New activity in
aws-neuron/Llama-2-7b-chat-hf-seqlen-2048-bs-1
4 months ago
Could not find a matching NEFF for your HLO in this directory. When trying to load precompiled neuron artifacts
4
#2 opened 4 months ago by
luuksuurmeijer
New activity in
Jingya/tiny-random-t5-neuronx
5 months ago
Upload folder using huggingface_hub
1
#1 opened 5 months ago by
dacorvo
New activity in
aws-neuron/Llama-2-7b-chat-hf-seqlen-2048-bs-1
5 months ago
Unable to successfully compile the model meta-llama/Llama-2-7b-chat-hf on Inf2 instance
6
#1 opened 5 months ago by
WaelDataReply
New activity in
huggingface/documentation-images
7 months ago
Add pictures for llama2 on Inferentia2 blog post
2
#212 opened 7 months ago by
dacorvo
New activity in
huggingface/documentation-images
8 months ago
upload assets for llama2 on inferentia2 blogpost
#196 opened 8 months ago by
dacorvo
Create inferentia-llama2
#195 opened 8 months ago by
dacorvo