Csaba Kecskemeti PRO

csabakecskemeti

AI & ML interests

None yet

Recent Activity

updated a model 13 minutes ago
DevQuasar/allura-org.Teleut-7b-GGUF
updated a model about 2 hours ago
DevQuasar/MTSAIR.Cotype-Nano-GGUF
updated a model about 4 hours ago
DevQuasar/Nexusflow.Athene-V2-Agent-GGUF
View all activity

Organizations

csabakecskemeti's activity

posted an update 1 day ago
view post
Post
719
I have this small utility: no_more_typo
It is running in the background and able to call the LLM model to update the text on the clipboard. I think it would be ideal to fix typos and syntax.
I have just added the option to use custom prompt templates to perform different tasks.

Details, code and executable:
https://github.com/csabakecskemeti/no_more_typo

https://devquasar.com/no-more-typo/
replied to their post 3 days ago
view reply

Exactly :D
Overflow fridge will be replaced with a rack :)

Phi3 or Mistral?

2
#3 opened 3 days ago by csabakecskemeti
Reacted to cappuch's post with ๐Ÿ˜Ž 3 days ago
posted an update 3 days ago
view post
Post
265
Repurposed my older AI workstation to a homelab server, it has received 2xV100 + 1xP40
I can reach huge 210k token context size with MegaBeam-Mistral-7B-512k-GGUF ~70+tok/s, or run Llama-3.1-Nemotron-70B-Instruct-HF-GGUF with 50k Context ~10tok/s (V100 only 40k ctx and 15tok/s).
Also able to Lora finetune with similar performace as an RTX3090.
It moved to the garage to no complaints for the noise from the family. Will move to a Rack soon :D
  • 2 replies
ยท