Falcon LLM TII UAE
FalconLLM
AI & ML interests
Large language models
Organizations
FalconLLM's activity
Update README.md
#1 opened 11 months ago
by
philschmid
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1624629516652-5ff5d596f244529b3ec0fb89.png)
Move to in-library checkpoint
#2 opened about 1 year ago
by
Rocketknight1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1660312628256-60ba519750effef3a58beac3.png)
Move to in-library checkpoint
#4 opened about 1 year ago
by
Rocketknight1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1660312628256-60ba519750effef3a58beac3.png)
Move to in-library checkpoint
#56 opened about 1 year ago
by
Rocketknight1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1660312628256-60ba519750effef3a58beac3.png)
Move to in-library checkpoint
1
#57 opened about 1 year ago
by
Rocketknight1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1660312628256-60ba519750effef3a58beac3.png)
Move to in-library checkpoint
#60 opened about 1 year ago
by
Rocketknight1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1660312628256-60ba519750effef3a58beac3.png)
Move to in-library checkpoint
1
#81 opened about 1 year ago
by
Rocketknight1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1660312628256-60ba519750effef3a58beac3.png)
Upload Отчет_Системный_Индекс_счастья_Medical_Case.pdf
#34 opened about 1 year ago
by
Romanzar
a100-80g memory but still call error
6
#32 opened about 1 year ago
by
leocheung
how to implement multiquery, FlashAttention and alibi.
1
#29 opened about 1 year ago
by
NickyNicky
![](https://cdn-avatars.huggingface.co/v1/production/uploads/641b435ba5f876fe30c5ae0a/OknUuweWxX3IzUZIKZ6CF.png)
Why not add system requirements on the model card?
9
#28 opened about 1 year ago
by
johnjohndoedoe
Getting "trust_remote_code" Error when Running SageMaker Deploy Code Sample
3
#27 opened about 1 year ago
by
garystafford
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/1AltROWohmoRop3-QeimM.png)
Fix "Finetuned from model" link
#26 opened about 1 year ago
by
rocca
Finetuned from model: Falcon-7B???
1
#25 opened about 1 year ago
by
DrNicefellow
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5f2220612128ba01bdf08c26/PTAD_Zcdq_O91DfYaDpNW.png)
Caching doesn't work on multi gpu
4
#23 opened about 1 year ago
by
eastwind
How to finetune the falcon-40b
2
#21 opened about 1 year ago
by
jiangix
Slow and Gibberish when inferencing
11
#20 opened about 1 year ago
by
eastwind
Is there a way to control the temperature of the model?
1
#19 opened about 1 year ago
by
zkdtckk
Update README.md: Update Model Description to reference Falcon-40B as the base model for falcon-40b-instruct
#17 opened about 1 year ago
by
AliSab
SageMaker Endpoint error during inference
12
#16 opened about 1 year ago
by
Shridharalve
Response language issue with fastchat
1
#14 opened about 1 year ago
by
manishl127
Custom 4-bit Finetuning 5-7 times faster inference than QLora
1
#9 opened about 1 year ago
by
rmihaylov
How to try Flacon in HuggingChat?
5
#6 opened about 1 year ago
by
promptgai
Might be interesting to have a thread on people with Successful Implementations, and on what kind of hardware..
1
#53 opened about 1 year ago
by
LinuxMagic
What is the inference time? Any ideas how to make it faster?
1
#52 opened about 1 year ago
by
leoapolonio
Is it really Good ?
1
#51 opened about 1 year ago
by
a749734
multiquery attention
1
#46 opened about 1 year ago
by
ZhongYingMatrix
Could you share the full pretraining data of Falcon-40B
1
#45 opened about 1 year ago
by
ChangranHuuu
how much Vram does it take to run Falcon 40b
7
#44 opened about 1 year ago
by
Toaster496
Question: Not support Arabic.
4
#43 opened about 1 year ago
by
awyshen
Add hf endpoint handler
1
#42 opened about 1 year ago
by
olivierdehaene
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62a093d63e7d1dda047039fc/QGpVSKuJLwl2EsiffCYML.jpeg)
Update README.md
2
#40 opened about 1 year ago
by
roboojack
Falcon 40B Inference at 4bit in Google Colab
27
#38 opened about 1 year ago
by
serin32
In addition to task 'text-generation', can falcon be used for other tasks like summarization, QA etc?
3
#37 opened about 1 year ago
by
VS9205
Fine-tuning on a new language
4
#35 opened about 1 year ago
by
AliMirlou
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63fdf13f8b3c5087ff836f58/se1eTd-kKUyWqMliEDE6T.jpeg)
Plans for other versions (outside of 7B and 40B)?
1
#26 opened about 1 year ago
by
flashvenom
![](https://cdn-avatars.huggingface.co/v1/production/uploads/637c621facc078d5bec14073/MOKvlABZuesOL3rVmxalE.png)
Custom 4-bit Finetuning 5-7 times faster inference than QLora
6
#25 opened about 1 year ago
by
rmihaylov
Working code with full server requirements
2
#24 opened about 1 year ago
by
gmjolt
Fine Tuning examples
4
#21 opened about 1 year ago
by
skeenan947
![](https://cdn-avatars.huggingface.co/v1/production/uploads/645922a08aa54fb020f87992/h8ERHSS6UMw88DiHLLUGY.jpeg)
Finetune wtih QLoRA please
7
#14 opened about 1 year ago
by
supercharge19
How to set trust_remote_code to true?
13
#9 opened about 1 year ago
by
gmjolt
[Bug] Does not work
58
#3 opened about 1 year ago
by
catid
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1661210778957-noauth.jpeg)
Tried to allocate 564.00 MiB (GPU 0; 7.98 GiB total capacity; 7.52 GiB already allocated; 446.00 MiB free; 7.55 GiB reserved in total by PyTorch)
1
#25 opened about 1 year ago
by
davisitoo
memory needed
2
#23 opened about 1 year ago
by
koshinryuu
CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling `cublasGemmStridedBatchedFx...
1
#20 opened about 1 year ago
by
CalumPlays
ValueError: The following `model_kwargs` are not used by the model: ['token_type_ids'] (note: typos in the generate arguments will also show up in this list)
4
#2 opened about 1 year ago
by
Imran1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62846faa99bff5076f0a93b4/QO7sgRWOXS6nlQ-GcEg94.jpeg)
Sample code error - AttributeError: module 'torch.nn.functional' has no attribute 'scaled_dot_product_attention'
1
#16 opened about 1 year ago
by
bernardogmorais
Ambiguous License
1
#15 opened about 1 year ago
by
jdpressman
Custom 4-bit Finetuning 5-7 times faster inference than QLora
#13 opened about 1 year ago
by
rmihaylov
Is it possible to generate semantic embeddings?
1
#12 opened about 1 year ago
by
michael-newsrx-com
Deployment to Amazon SageMaker - `trust_remote_code` issue
3
#10 opened about 1 year ago
by
dgallitelli
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60dc3f9f86932230e632cded/pW716zj8Pr5LLPJrruNq_.png)
Getting an error with the example code
16
#7 opened about 1 year ago
by
aviadatlas
8bit and sharded weights
3
#5 opened about 1 year ago
by
ThreeBlessings
How to quantize this model using QLoRA ?
1
#7 opened about 1 year ago
by
mrhimanshu
Error when using falcon-7b model for embeddings
1
#25 opened about 1 year ago
by
Shilpil
How to make it work for less experienced AI whisperers
17
#4 opened about 1 year ago
by
Sloba
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6432f4e34521083b9d286a48/v9bX1bMorcB7XWlmG2aUi.jpeg)
Support for LoRA?
17
#3 opened about 1 year ago
by
cekal
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63ebfd71ca08a72ba9ce6fe0/WEXOVko_Lgvq_Y8_Zlb4o.png)
Spell correction
1
#22 opened about 1 year ago
by
surajp
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5e9540e04957053f60648a0b/NWKSV4dOV1NN9Hxux-u0j.jpeg)
Is it possible to add TF Weights
3
#21 opened about 1 year ago
by
mb-data96
Add hf endpoint handler.py
1
#20 opened about 1 year ago
by
olivierdehaene
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62a093d63e7d1dda047039fc/QGpVSKuJLwl2EsiffCYML.jpeg)