Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
sravanthi pulijala
sravanthib
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
18 minutes ago
sravanthib/Qwen-2.5-7B-Simple-RL
published
a model
about 19 hours ago
sravanthib/Qwen-math-open-RL
published
a model
about 21 hours ago
sravanthib/Qwen-math-Simple-RL
View all activity
Organizations
None yet
sravanthib
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
18 minutes ago
sravanthib/Qwen-2.5-7B-Simple-RL
Text Generation
•
Updated
18 minutes ago
•
157
published
a model
about 19 hours ago
sravanthib/Qwen-math-open-RL
Updated
about 19 hours ago
published
a model
about 21 hours ago
sravanthib/Qwen-math-Simple-RL
Updated
about 21 hours ago
updated
a model
2 days ago
sravanthib/qwen-32b-multinode-try
Updated
2 days ago
published
2 models
2 days ago
sravanthib/qwen-32b-multinode-try
Updated
2 days ago
sravanthib/new-multinode-try
Updated
2 days ago
updated
a model
2 days ago
sravanthib/multinode-try
Updated
2 days ago
published
a model
2 days ago
sravanthib/multinode-try
Updated
2 days ago
updated
2 models
2 days ago
sravanthib/with_accelerate_output_Qwen2-0.5B-GRPO-test
Updated
2 days ago
sravanthib/tokenizer-aded-Llama3.1-8b-instruct-RL
Updated
2 days ago
published
a model
2 days ago
sravanthib/tokenizer-aded-Llama3.1-8b-instruct-RL
Updated
2 days ago
published
a model
3 days ago
sravanthib/with_accelerate_output_Qwen2-0.5B-GRPO-test
Updated
2 days ago
updated
a model
3 days ago
sravanthib/with_accelarate_output_Qwen2-0.5B-GRPO-test
Updated
3 days ago
published
a model
3 days ago
sravanthib/single_node_llama_custom-code-test
Updated
3 days ago
updated
a model
4 days ago
sravanthib/Final-try-Llama3.1-8b-instruct-RL
Text Generation
•
Updated
4 days ago
•
63
published
4 models
4 days ago
sravanthib/grpo-output
Updated
4 days ago
sravanthib/Final-try-Llama3.1-8b-instruct-RL
Text Generation
•
Updated
4 days ago
•
63
sravanthib/Llama-Simple-RL
Updated
4 days ago
sravanthib/Simple-RL
Updated
4 days ago
published
a model
5 days ago
sravanthib/SFT_and_RL_final-Simple-RL
Updated
5 days ago
Load more