VocalSound

audio

A dataset for improving human vocal sounds recognition, containing over 21,000 crowdsourced recordings of laughter, sighs, coughs, throat clearing, sneezes, and sniffs from 3,365 unique subjects. Used for audio event classification and recognition of human non-speech vocalizations.

Leaderboard

Showing 1 of 1 result

Qwen2.5-Omni-7B

93.9%

i