File size: 1,527 Bytes
a93da10
 
 
 
 
 
 
 
 
69276b8
 
 
 
 
 
 
 
 
 
 
 
e801883
 
 
69276b8
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
---
title: README
emoji: 🌍
colorFrom: red
colorTo: red
sdk: static
pinned: false
---

# ALM: Audio Language and Multimodal

ALM is a collaborative research group focused on deep learning for audio, language, and multimodal data. 

### About Us

- **Alkis Koudounas** - PhD Student at Politecnico di Torino ([Profile](https://huggingface.co/alkiskoudounas) | [polito.it](https://www.polito.it))
- **Lorenzo Vaiani** - PhD Student at Politecnico di Torino ([Profile](https://huggingface.co/VaianiLorenzo) | [polito.it](https://www.polito.it))
- **Moreno La Quatra** - Research Fellow at Kore University of Enna ([Profile](https://huggingface.co/morenolq) | [unikore.it](https://www.unikore.it))

### Projects

- **ARCH** - [Audio Representation Benchmark](https://huggingface.co/spaces/ALM/ARCH) ([Repo](https://github.com/MorenoLaQuatra/ARCH)): A platform dedicated to benchmarking models for audio representations. [Resaerch Paper](https://huggingface.co/papers/2405.00934)
- **CALM** - [Contrastive Alignment of Language and Music](https://github.com/ALM-LAB/CALM): A project from the 1st Sound of AI Hackathon. CALM aligns songs with natural language descriptions, enabling music searches via text, voice, or facial expressions.
- **PACE** - [Podcast AI for Chapters and Episodes](https://github.com/ALM-LAB/PACE): PACE is a semantic search engine for podcasts. It enables users to search for specific parts of a podcast using natural language. The project was created for the AssemblyAI 50K Hackathon - Winter 2022.
---