Pradeep T

Pradeep1995

pradeepdev-1995

AI & ML interests

None yet

Recent Activity

new activity about 1 month ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B:Step by step guide for Distillation

new activity about 2 months ago

deepseek-ai/DeepSeek-R1:Transformer version required?

new activity 12 months ago

text-generation-inference/Mixtral-8x7B-Instruct-v0.1-medusa:how to use this model on sagemaker endpoints

View all activity

Organizations

None yet

Pradeep1995's activity

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B about 1 month ago

Step by step guide for Distillation

#23 opened about 1 month ago by

Pradeep1995

New activity in deepseek-ai/DeepSeek-R1 about 2 months ago

Transformer version required?

#24 opened about 2 months ago by

Pradeep1995

New activity in text-generation-inference/Mixtral-8x7B-Instruct-v0.1-medusa 12 months ago

how to use this model on sagemaker endpoints

#1 opened about 1 year ago by

LorenzoCevolaniAXA

New activity in google/gemma-7b 12 months ago

What is the actual context size of google/gemma-7b model

#81 opened 12 months ago by

Pradeep1995

New activity in mistralai/Mixtral-8x7B-Instruct-v0.1 12 months ago

What is the actual context size of mistralai/Mixtral-8x7B-Instruct-v0.1 model

#186 opened 12 months ago by

Pradeep1995

New activity in mistralai/Mistral-7B-v0.1 about 1 year ago

PEFT based Fine Tuned model hallucinates values from the fine tuning training data while inferencing.

#111 opened about 1 year ago by

Pradeep1995

New activity in mistralai/Mistral-7B-Instruct-v0.2 about 1 year ago

Special token( </s>) not generating in the model.generate() method

#47 opened about 1 year ago by

Pradeep1995

Can we save the finetuned Mistral model by exporting to TorchScript

#46 opened about 1 year ago by

Pradeep1995

New activity in mistralai/Mixtral-8x7B-Instruct-v0.1 about 1 year ago

What is the best way for the inference process in LORA in PEFT approach

#70 opened about 1 year ago by

Pradeep1995

New activity in Intel/neural-chat-7b-v1-1 about 1 year ago

What is the best way for the inference process in LORA in PEFT approach

#3 opened about 1 year ago by

Pradeep1995

New activity in microsoft/phi-2 about 1 year ago

What is the best way for the inference process in LORA in PEFT approach

#53 opened about 1 year ago by

Pradeep1995

New activity in openchat/openchat_3.5 about 1 year ago

What is the best way for the inference process in LORA in PEFT approach

#43 opened about 1 year ago by

Pradeep1995

New activity in mistralai/Mistral-7B-Instruct-v0.1 about 1 year ago

What is the best way for the inference process in LORA in PEFT approach

#96 opened about 1 year ago by

Pradeep1995

New activity in mistralai/Mixtral-8x7B-Instruct-v0.1 about 1 year ago

Which is the actual way to store the adapters after PEFT finetuning

#67 opened about 1 year ago by

Pradeep1995

New activity in openchat/openchat_3.5 about 1 year ago

Which is the actual way to store the Adapter after PEFT finetuning

#42 opened about 1 year ago by

Pradeep1995

should we follow the same openchat prompt structure while finetuning time?

#38 opened about 1 year ago by

Pradeep1995

PEFT based Fine Tuned model hallucinates values from the fine tuning training data while inferencing

#39 opened about 1 year ago by

Pradeep1995

New activity in mistralai/Mistral-7B-v0.1 about 1 year ago

Incomplete Output even with max_new_tokens

#107 opened about 1 year ago by

Pradeep1995

should we follow the same mistral prompt structure while finetuning time?

#110 opened about 1 year ago by

Pradeep1995

New activity in openchat/openchat_3.5 about 1 year ago

Incomplete Output even with max_new_tokens

#37 opened about 1 year ago by

Pradeep1995