Sayantan Das's picture

Sayantan Das

ucalyptus

·

https://ucalyptus.me/

AI & ML interests

Generative Modeling

Recent Activity

liked a Space about 22 hours ago

data-agents/jupyter-agent

liked a model 3 days ago

thanhkt/codegemma-7B-ManimGen

liked a model 3 days ago

TWO/sutra-mlt256-v2

View all activity

Organizations

ucalyptus's activity

New activity in asif00/bangla-llama 7 months ago

I'm interested in improving these

#1 opened 7 months ago by

New activity in defog/llama-3-sqlcoder-8b 7 months ago

Are there plans to distribute this model on Ollama.ai?

#4 opened 7 months ago by

New activity in ucalyptus/prem-1B-chat-webgpu 7 months ago

Currently experiences difficulties with generation

#1 opened 7 months ago by

New activity in Xenova/experimental-moondream-webgpu 7 months ago

How to create such hf spaces?

#3 opened 7 months ago by

New activity in defog/llama-3-sqlcoder-8b 7 months ago

May I ask if this model is a chat model or a base model?

#3 opened 7 months ago by

New activity in Xenova/experimental-phi3-webgpu 7 months ago

How do I run npm / transformers.js to create the .js files ?

#3 opened 7 months ago by

New activity in ucalyptus/prem-1B-chat-ONNX 7 months ago

TODO: add model_quantized.onnx_data

#1 opened 7 months ago by

New activity in Xenova/Phi-3-mini-4k-instruct_fp16 7 months ago

how is this fp16 when filename has q4?

#1 opened 7 months ago by

New activity in mlc-ai/Llama-3-8B-Instruct-q3f16_2-MLC 7 months ago

How do u make these?

#1 opened 7 months ago by

New activity in ucalyptus/prem-7B-chat 7 months ago

Gotta eval this mf

#1 opened 7 months ago by

New activity in ucalyptus/prem-615M-chat 8 months ago

How to up-merge?

#1 opened 8 months ago by

New activity in FL33TW00D-HF/ratchet-phi 8 months ago

How do I run this space locally?

#1 opened 8 months ago by

New activity in stabilityai/stablelm-2-12b-chat 8 months ago

Suggest some datasets and techniques for self-knowledge

#3 opened 8 months ago by

New activity in winglian/llama-3-8b-1m-PoSE 8 months ago

How would this compare training time wise with gradientai/Llama-3-8B-Instruct-Gradient-1048k ?

#1 opened 8 months ago by

New activity in microsoft/Phi-3-mini-128k-instruct 8 months ago

No base model?

#45 opened 8 months ago by

commented a paper 8 months ago

PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training

Paper • 2309.10400 • Published Sep 19, 2023 • 26 •

New activity in damerajee/hathi-moe-test 8 months ago

Can you add a README to it?

#1 opened 8 months ago by

New activity in Crystalcareai/GemMoE-Medium-v0.4 9 months ago

can you share the code?

#2 opened 9 months ago by

commented 2 papers 10 months ago

WebArena: A Realistic Web Environment for Building Autonomous Agents

Paper • 2307.13854 • Published Jul 25, 2023 • 23 •

IndicVoices: Towards building an Inclusive Multilingual Speech Dataset for Indian Languages

Paper • 2403.01926 • Published Mar 4 • 1 •