meta-llama
/

Llama-3.1-70B-Instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (15)

Access request FAQ

#13 opened 3 months ago by

How to run llama3.1-70B-Instruct inference with mutil-gpu？

#38 opened 15 days ago by

Slow response : Text validation

#37 opened 19 days ago by

Error

#36 opened 26 days ago by

Independent evaluation results

#35 opened about 1 month ago by

Request: DOI

#34 opened about 2 months ago by

Inference client to be added to the pipeline

#33 opened about 2 months ago by

Llama 3.1 models continuously unavailable

#32 opened about 2 months ago by

Use of "parameters" or "arguments" in chat template

#31 opened about 2 months ago by

Update tokenizer_config.json

#30 opened 2 months ago by

Deploying to dedicated Inference Endpoints

#29 opened 2 months ago by

Compute Instance Requirement

#28 opened 3 months ago by

Slow inference/low GPU utilization.

#27 opened 3 months ago by

Pruning

#24 opened 3 months ago by

Context window size?

#23 opened 3 months ago by

Fix chat_template for tool-calling

#22 opened 3 months ago by

[ToolCalling] Fix chat_template error

#21 opened 3 months ago by

what is a way to verify the model I am running is performing as expected?

#18 opened 3 months ago by

GGuf

#17 opened 3 months ago by

What's up with the MATH Lvl 5 score on HF Open LLM Leaderboard 2?

#16 opened 3 months ago by

🚀 LMDeploy support Llama3.1 and its Tool Calling. An example of calling "Wolfram Alpha" to perform complex mathematical calculations can be found from here!

#14 opened 3 months ago by

Issue with Tokenizer when deploying with TGI

#10 opened 3 months ago by

Bug in config.json?

#7 opened 3 months ago by