40 1

David Corvoysier

dacorvo

https://www.kaizou.org

dacorvo

AI & ML interests

Quantization

Articles

Organizations

dacorvo's activity

New activity in NousResearch/Llama-2-7b-chat-hf 1 day ago

Fix invalid generation config

#9 opened 1 day ago by

dacorvo

New activity in NousResearch/Llama-2-7b-hf 1 day ago

Fix invalid generation config

#8 opened 1 day ago by

dacorvo

New activity in aws-neuron/optimum-neuron-cache 2 days ago

[Cache Request] meta-llama/Meta-Llama-3-8B

#88 opened 10 days ago by

sanctuaire21

[Cache Request] meta-llama/Meta-Llama-3-8B

#87 opened 10 days ago by

sanctuaire21

[Cache Request] mistralai/Mistral-7B-Instruct-v0.3

#86 opened 10 days ago by

xapss

[Cache Request] mistralai/Mistral-7B-Instruct-v0.3

#93 opened 5 days ago by

ajay1710

New activity in aws-neuron/optimum-neuron-cache 13 days ago

Update README.md

#82 opened 13 days ago by

JordanRichardson

New activity in aws-neuron/optimum-neuron-cache about 1 month ago

[Cache Request] meta-llama/Meta-Llama-3-8B

#65 opened about 1 month ago by

huntingcarlisle

[Cache Request] quilr-ai/semantic-dlp

#63 opened about 1 month ago by

ksquarekumar

[Cache Request] meta-llama/Meta-Llama-3-70B-Instruct

#56 opened about 2 months ago by

CodeVinayak

models for inf2.

#33 opened 2 months ago by

AC2132

[Cache Request] aws-neuron/Llama-2-7b-hf-neuron-latency

#58 opened about 2 months ago by

Gerald001

[Cache Request] aws-neuron/Llama-2-7b-hf-neuron-throughput

#57 opened about 2 months ago by

Gerald001

[Cache Request] aws-neuron/Llama-2-7b-hf-neuron-budget

#59 opened about 2 months ago by

Gerald001

New activity in aws-neuron/optimum-neuron-cache about 2 months ago

Can't find zephyr-7b-beta cache using optimum cli list command.

#21 opened 3 months ago by

Anurag2132

[Cache Request] mistralai/Mistral-7B-Instruct-v0.2

#18 opened 3 months ago by

krish1124

[Cache Request] mistralai/Mistral-7B-Instruct-v0.2

#39 opened 2 months ago by

jburtoft

[Cache Request] TheBloke/Wizard-Vicuna-7B-Uncensored-GPTQ

#54 opened about 2 months ago by

nadilio

[Cache Request] TheBloke/em_german_leo_mistral-GGUF

#50 opened about 2 months ago by

OnurSarikaya2000

[Cache Request] meta-llama/Llama-2-7b-chat-hf

#51 opened about 2 months ago by

naveen1601datalyticsfoundry

New activity in aws-neuron/optimum-neuron-cache 2 months ago

[Cache Request] TheBloke/Llama-2-7B-Chat-GGML

#36 opened 2 months ago by

lou987

[Cache Request] aws-neuron/Llama-2-7b-chat-hf-seqlen-2048-bs-2

#9 opened 3 months ago by

RamiroRamirez

[Cache Request] aws-neuron/Llama-2-7b-chat-hf-seqlen-2048-bs-1

#6 opened 3 months ago by

RamiroRamirez

Optimum-neuron-cache for inference?

#1 opened 5 months ago by

jburtoft

Request for adding mistral with batch_size=8 or batch_size=4

#3 opened 4 months ago by

michaelfeil

[Cache Request] TheBloke/OpenHermes-2.5-Mistral-7B-GGUF

#7 opened 3 months ago by

boose101

[Cache Request] abacusai/Smaug-72B-v0.1

#12 opened 3 months ago by

saqlainraza

[Cache Request] defog/sqlcoder-7b-2

#20 opened 3 months ago by

marinap

New activity in aws-neuron/optimum-neuron-cache 3 months ago

[Cache Request] google/gemma-7b

#14 opened 3 months ago by

mihirjadhav

[Cache request] zephyr-7b-beta-neuron with sequence_length more than 4096

#11 opened 3 months ago by

Anurag2132

[Cache Request] Helsinki-NLP/opus-mt-en-de

#10 opened 3 months ago by

k10

[Cache Request] facebook/seamless-m4t-v2-large

#13 opened 3 months ago by

aitransync

New activity in aws-neuron/optimum-neuron-cache 4 months ago

Issue running v-alpha-tross after cache update

#2 opened 4 months ago by

michaelfeil

New activity in aws-neuron/Mistral-neuron 4 months ago

Deploy with Sagemaker LMI

#2 opened 4 months ago by

josete89

New activity in aws-neuron/Llama-2-7b-chat-hf-seqlen-2048-bs-1 4 months ago

Could not find a matching NEFF for your HLO in this directory. When trying to load precompiled neuron artifacts

#2 opened 4 months ago by

luuksuurmeijer

New activity in Jingya/tiny-random-t5-neuronx 5 months ago

Upload folder using huggingface_hub

#1 opened 5 months ago by

dacorvo

New activity in aws-neuron/Llama-2-7b-chat-hf-seqlen-2048-bs-1 5 months ago

Unable to successfully compile the model meta-llama/Llama-2-7b-chat-hf on Inf2 instance

#1 opened 5 months ago by

WaelDataReply

New activity in huggingface/documentation-images 7 months ago

Add pictures for llama2 on Inferentia2 blog post

#212 opened 7 months ago by

dacorvo

New activity in huggingface/documentation-images 8 months ago

upload assets for llama2 on inferentia2 blogpost

#196 opened 8 months ago by

dacorvo

Create inferentia-llama2

#195 opened 8 months ago by

dacorvo

David Corvoysier

AI & ML interests

Articles

quanto: a pytorch quantization toolkit

Hugging Face Text Generation Inference available for AWS Inferentia2

Make your llama generation time fly with AWS Inferentia2

Organizations

dacorvo's activity

Fix invalid generation config

Fix invalid generation config

[Cache Request] meta-llama/Meta-Llama-3-8B

[Cache Request] meta-llama/Meta-Llama-3-8B

[Cache Request] mistralai/Mistral-7B-Instruct-v0.3

[Cache Request] mistralai/Mistral-7B-Instruct-v0.3

Update README.md

[Cache Request] meta-llama/Meta-Llama-3-8B

[Cache Request] quilr-ai/semantic-dlp

[Cache Request] meta-llama/Meta-Llama-3-70B-Instruct

models for inf2.

[Cache Request] aws-neuron/Llama-2-7b-hf-neuron-latency

[Cache Request] aws-neuron/Llama-2-7b-hf-neuron-throughput

[Cache Request] aws-neuron/Llama-2-7b-hf-neuron-budget

Can't find zephyr-7b-beta cache using optimum cli list command.

[Cache Request] mistralai/Mistral-7B-Instruct-v0.2

[Cache Request] mistralai/Mistral-7B-Instruct-v0.2

[Cache Request] TheBloke/Wizard-Vicuna-7B-Uncensored-GPTQ

[Cache Request] TheBloke/em_german_leo_mistral-GGUF

[Cache Request] meta-llama/Llama-2-7b-chat-hf

[Cache Request] TheBloke/Llama-2-7B-Chat-GGML

[Cache Request] aws-neuron/Llama-2-7b-chat-hf-seqlen-2048-bs-2

[Cache Request] aws-neuron/Llama-2-7b-chat-hf-seqlen-2048-bs-1

Optimum-neuron-cache for inference?

Request for adding mistral with batch_size=8 or batch_size=4

[Cache Request] TheBloke/OpenHermes-2.5-Mistral-7B-GGUF

[Cache Request] abacusai/Smaug-72B-v0.1

[Cache Request] defog/sqlcoder-7b-2

[Cache Request] google/gemma-7b

[Cache request] zephyr-7b-beta-neuron with sequence_length more than 4096

[Cache Request] Helsinki-NLP/opus-mt-en-de

[Cache Request] facebook/seamless-m4t-v2-large

Issue running v-alpha-tross after cache update

Deploy with Sagemaker LMI

Could not find a matching NEFF for your HLO in this directory. When trying to load precompiled neuron artifacts

Upload folder using huggingface_hub

Unable to successfully compile the model meta-llama/Llama-2-7b-chat-hf on Inf2 instance

Add pictures for llama2 on Inferentia2 blog post

upload assets for llama2 on inferentia2 blogpost

Create inferentia-llama2