Commit History
fix(config): Removes auto_map since it is not used anymore.
675aa38
verified
gugarosa
commited on
Delete modeling_phi.py
db56137
verified
gugarosa
commited on
Delete configuration_phi.py
de9f725
verified
gugarosa
commited on
Update README.md
467adac
verified
gugarosa
commited on
Delete pytorch_model.bin
474b29e
verified
gugarosa
commited on
Adding `safetensors` variant of this model (#82)
fa2a356
verified
Update LICENSE
bffd3b2
verified
gugarosa
commited on
Update README.md
349cf8b
verified
gugarosa
commited on
Update README.md
83b9c52
verified
gugarosa
commited on
Update config.json
675e8c1
verified
gugarosa
commited on
Update modeling_phi.py
34a1490
verified
gugarosa
commited on
Update README.md
59e722d
verified
gugarosa
commited on
Update modeling_phi.py
426ea90
verified
gugarosa
commited on
Update modeling_phi.py
3edb5e6
verified
gugarosa
commited on
Update modeling_phi.py
e0f03c4
verified
gugarosa
commited on
Update config.json
051d15f
verified
gugarosa
commited on
Upload modeling_phi.py
914c8fb
gugarosa
commited on
Delete Research License.docx
3a705a2
gugarosa
commited on
Upload 5 files
341a17a
gugarosa
commited on
Update config.json
41217aa
gugarosa
commited on
chore(root): Updates files to internal transformers implementation.
d3ba318
gugarosa
commited on
Update README.md
24f9ea1
gugarosa
commited on
Upload 4 files
d262514
gugarosa
commited on
Update README.md
f27cd93
gugarosa
commited on
Update README.md
80c0ba9
gugarosa
commited on
Disables inference API to prevent mismatch with HF implementation.
a286f5c
gugarosa
commited on
fix(modeling_phi): Fixes initial generation with length larger than context length.
ca573e3
gugarosa
commited on
fix(modeling_phi): Fixes cached generation when above maximum context length.
37527ba
gugarosa
commited on
Fixes exceeding maximum sequence length when using generate().
5fd430c
gugarosa
commited on
Delete modeling_mixformer_sequential.py
d212a78
gugarosa
commited on
Delete configuration_mixformer_sequential.py
8e9ebfb
gugarosa
commited on
Update to new model interface.
271c339
gugarosa
commited on
Improves type hinting on configuration arguments.
92557d0
gugarosa
commited on
Enables to toggle fused_dense, flash_rotary and attn_pdrop in the configuration.
45f4b21
gugarosa
commited on
Fixes flash-attn import with a try/except statement
0254d42
gugarosa
commited on
Adds support for flash-attn rotary embedding and fused dense layers.
0bbd68a
gugarosa
commited on
Adds support for MQA/GQA and attention mask during training.
de35f90
gugarosa
commited on
Update modeling_mixformer_sequential.py
d38e6f9
gugarosa
commited on
Upload modeling_mixformer_sequential.py
b6a7e2f
gugarosa
commited on
Upload README.md
bc09a08
gugarosa
commited on
fix(phi-1_5): Checks length of `attention_mask`if it is passed as direct tensor.
f9f2ac7
gugarosa
commited on
Support for `attention_mask` in forward pass.
3128bb6
gugarosa
commited on
Update README.md
7d482dd
suriyagunasekar
commited on
Update README.md
c8f6ad8
suriyagunasekar
commited on