Pietro Lesci
pietrolesci
AI & ML interests
Parameter- and data-efficient fine-tuning of large language models (keywords: active-learning, data subset selection, meta-learning).
Organizations
pietrolesci's activity
Bias annotation
#2 opened 3 months ago
by
pietrolesci
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61927a5329a3ab51aa2417c5/0BL0-SdpodzNfcn4Ci4VB.png)
Tokenizer `merges.txt` files
3
#5 opened 3 months ago
by
pietrolesci
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61927a5329a3ab51aa2417c5/0BL0-SdpodzNfcn4Ci4VB.png)
Sequence "packing" logic
1
#2 opened 7 months ago
by
pietrolesci
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61927a5329a3ab51aa2417c5/0BL0-SdpodzNfcn4Ci4VB.png)
Pad-only sequences from mmap'ed dataset after a certain index
#1 opened 7 months ago
by
pietrolesci
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61927a5329a3ab51aa2417c5/0BL0-SdpodzNfcn4Ci4VB.png)
Add full sequences (beyond the first 64 tokens)
3
#1 opened 7 months ago
by
pietrolesci
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61927a5329a3ab51aa2417c5/0BL0-SdpodzNfcn4Ci4VB.png)
Domain and provenance annotation
7
#1 opened 11 months ago
by
haukur
Fix swapped start and exclusive_end fields
1
#3 opened almost 2 years ago
by
pietrolesci
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61927a5329a3ab51aa2417c5/0BL0-SdpodzNfcn4Ci4VB.png)
App down
#1 opened almost 2 years ago
by
pietrolesci
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61927a5329a3ab51aa2417c5/0BL0-SdpodzNfcn4Ci4VB.png)
`start` and `exclusive_end` seems swapped
1
#1 opened almost 2 years ago
by
pietrolesci
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61927a5329a3ab51aa2417c5/0BL0-SdpodzNfcn4Ci4VB.png)