anas-awadalla's picture
added of
c3be39e
|
raw
history blame
432 Bytes
## 2.0.0
* Add gradient checkpointing, FullyShardedDataParallel
* Model releases
* (CLIP ViT-L-14 / MPT-1B)
* (CLIP ViT-L-14 / MPT-1B Dolly)
* (CLIP ViT-L-14 / RedPajama-3B)
* (CLIP ViT-L-14 / RedPajama-3B Instruct)
* (CLIP ViT-L-14 / MPT-7B)
* Remove color jitter when training
* Fix cross-attention bug when calling generate()
## 1.0.0
* Initial code release
* Early model release (CLIP ViT-L-14 / LLaMA-7B)