tiiuae/falcon-40b · Discussions

Falcon 40B Inference at 4bit in Google Colab

pinned

27

#38 opened almost 2 years ago by

serin32

Custom 4-bit Finetuning 5-7 times faster inference than QLora

pinned

6

#25 opened almost 2 years ago by

rmihaylov

remove-extra-parentheses

#115 opened 9 months ago by

ZennyKenny

Could not locate the configuration_RW.py inside tiiuae/falcon-40b-instruct.

#114 opened 11 months ago by

cosmino

[AUTOMATED] Model Memory Requirements

#113 opened 12 months ago by

model-sizer-bot

Adding Evaluation Results

#111 opened about 1 year ago by

leaderboard-pr-bot

Could someone upload a tokenizer.model file? to allow for making ggufs

#110 opened over 1 year ago by

RonanMcGovern

Add chat_template so that it can be used for chat out-of-box

#109 opened over 1 year ago by

chujiezheng

pb when testing the model

#108 opened over 1 year ago by

louvivien

Update generation_config.json

1

#106 opened over 1 year ago by

nkasmanoff

Gradio interface

#105 opened over 1 year ago by

sequentialsystems

Optimizing Inference Time for Chat Conversations on Falcon

2

#104 opened over 1 year ago by

humza-sami

Finetuned Falcon40 is not working with pipeline (text-generation)

#103 opened over 1 year ago by

chelouche9

Advice on inference over a large-ish dataset in Databricks?

#102 opened over 1 year ago by

archonlith

Use input attention mask instead of casual mask in attention

#101 opened over 1 year ago by

CyberZHG

Inference

4

#99 opened over 1 year ago by

davidhung

Best Practice for Handling Variable-Length Sequences in Training an LLM Model on a Chatbot Dataset

#98 opened over 1 year ago by

humza-sami

Request: DOI

#97 opened over 1 year ago by

waelTalan

Getting HTTP Error Code: 422 when using Inference API

2

#96 opened over 1 year ago by

reetkat

Run falcon on Mac

2

#95 opened over 1 year ago by

corin9122

Unable to use all cores.

2

#94 opened over 1 year ago by

armx40

Bug: the model's head dimensionality is hardcoded

#93 opened over 1 year ago by deleted

Fine-tune on model response only?

1

#92 opened over 1 year ago by

mkserge

Finetuning Base Falcon on Unseen Language/New data (non instruct/RLHF)

2

#91 opened over 1 year ago by

AshBam

Slow response time for 7b and 40b

6

#89 opened over 1 year ago by

kartik99

configuration_RW.py Missing in the latest commit

#88 opened over 1 year ago by

ravikiran3690

Update README.md

2

#87 opened over 1 year ago by

FelixMildon

Falcon breaks after the second prompt of code.

#86 opened over 1 year ago by

thecowmilk

Changes in modelling_RW.py to be able to handle past_key_values for faster model generations

8

#85 opened over 1 year ago by

puru22

@TII Falcon is stunning but will you continue or is the majestic bird destined to starve ?

#84 opened over 1 year ago by

cmp-nct

Finetune Error using the notebook referred on the model page

#83 opened over 1 year ago by

hamad

Nvidia H100 Finetuning Error on BitsandBytes

2

#82 opened over 1 year ago by

ashmitbhattarai

new here, confused which .bin file to download?

#80 opened over 1 year ago by

kingofdelphi

Update generation_config.json

#77 opened over 1 year ago by

psinger

Request: DOI

#76 opened over 1 year ago by

winter6below618

Seeking insights on integrating RAG with Falcon for Domain Specific requirements

#75 opened over 1 year ago by

rahul2008d

Prevent Hallucinations

1

#74 opened over 1 year ago by

Zhaoqiong

Deployment on Azure ML

1

#73 opened over 1 year ago by

Eliahu551818

Access To Hidden States

#72 opened over 1 year ago by

DJT777

Were special tokens trained?

#71 opened over 1 year ago by

Tron2060

Example code from README output is nonsense

1

#70 opened over 1 year ago by

amitgurintecom

New language

2

#69 opened almost 2 years ago by

mindplay

GPU requirements

7

#68 opened almost 2 years ago by

GuySerk

Cuda out of memory error.

2

#67 opened almost 2 years ago by

ibrim

ValueError: The following model_kwargs are not used by the model: ['token_type_ids'] (note: typos in the generate arguments will also show up in this list)

1

#66 opened almost 2 years ago by

yiz4869

How to fine tune falcon for summarization on xsum?

1

#65 opened almost 2 years ago by

uzumakiusa

Need claritiy about the adjustable model hyperparameters

#64 opened almost 2 years ago by

Someshfengde

Update README.md

#63 opened almost 2 years ago by

Gage888

Borken docs link Use in transformers

1

#62 opened almost 2 years ago by

natika1

Hello, may I know where can I get the embeddings for falcon-40b?

3

#61 opened almost 2 years ago by

kurtgan