Commit History

update `bos_token_id` and `eos_token_id`
4f74ac4
verified

chujiezheng commited on

Update LICENSE
bffd3b2
verified

gugarosa commited on

Update README.md
349cf8b
verified

gugarosa commited on

Update README.md
83b9c52
verified

gugarosa commited on

Update config.json
675e8c1
verified

gugarosa commited on

Update modeling_phi.py
34a1490
verified

gugarosa commited on

Update README.md
59e722d
verified

gugarosa commited on

Update modeling_phi.py
426ea90
verified

gugarosa commited on

Update modeling_phi.py
3edb5e6
verified

gugarosa commited on

Update modeling_phi.py
e0f03c4
verified

gugarosa commited on

Update config.json
051d15f
verified

gugarosa commited on

Upload modeling_phi.py
914c8fb

gugarosa commited on

Delete Research License.docx
3a705a2

gugarosa commited on

Upload 5 files
341a17a

gugarosa commited on

Update config.json
41217aa

gugarosa commited on

chore(root): Updates files to internal transformers implementation.
d3ba318

gugarosa commited on

Update README.md
24f9ea1

gugarosa commited on

Upload 4 files
d262514

gugarosa commited on

Update README.md
f27cd93

gugarosa commited on

Update README.md
80c0ba9

gugarosa commited on

Disables inference API to prevent mismatch with HF implementation.
a286f5c

gugarosa commited on

fix(modeling_phi): Fixes initial generation with length larger than context length.
ca573e3

gugarosa commited on

fix(modeling_phi): Fixes cached generation when above maximum context length.
37527ba

gugarosa commited on

Fixes exceeding maximum sequence length when using generate().
5fd430c

gugarosa commited on

Delete modeling_mixformer_sequential.py
d212a78

gugarosa commited on

Delete configuration_mixformer_sequential.py
8e9ebfb

gugarosa commited on

Update to new model interface.
271c339

gugarosa commited on

Improves type hinting on configuration arguments.
92557d0

gugarosa commited on

Enables to toggle fused_dense, flash_rotary and attn_pdrop in the configuration.
45f4b21

gugarosa commited on

Fixes flash-attn import with a try/except statement
0254d42

gugarosa commited on

Adds support for flash-attn rotary embedding and fused dense layers.
0bbd68a

gugarosa commited on

Adds support for MQA/GQA and attention mask during training.
de35f90

gugarosa commited on

Update modeling_mixformer_sequential.py
d38e6f9

gugarosa commited on

Adding _set_gradient_checkpointing for compatibility (#22)
8091327

gugarosa vriveras commited on

Upload modeling_mixformer_sequential.py
b6a7e2f

gugarosa commited on

Add more precise license metadata (UI will be cleaner!) (#35)
8ab0f29

gugarosa julien-c HF staff commited on

Upload README.md
bc09a08

gugarosa commited on

fix(phi-1_5): Checks length of `attention_mask`if it is passed as direct tensor.
f9f2ac7

gugarosa commited on

Support for `attention_mask` in forward pass.
3128bb6

gugarosa commited on

Upload MixFormerSequentialForCausalLM
d655135

suriyagunasekar commited on