Published on 01 January 2015 |

Version 2.0

ESC: Dataset for Environmental Sound Classification

View Dataset
J. Piczak, Karol

Description

<p>The <strong>ESC dataset</strong> is a collection of short environmental recordings available in a unified format (5-second-long clips, 44.1 kHz, single channel, Ogg Vorbis compressed @ 192 kbit/s). All clips have been extracted from public field recordings available through the <a href="http://freesound.org">Freesound.org project</a>. Please see the README files for a detailed attribution list. The dataset is available under the terms of the <a href="http://creativecommons.org/licenses/by-nc/3.0/">Creative Commons license - Attribution-NonCommercial</a>.</p><p>The dataset consists of three parts:</p><ul><li><strong><a href="https://github.com/karoldvl/ESC-50">ESC-50</a></strong>: a labeled set of 2 000 environmental recordings (50 classes, 40 clips per class),</li><li><strong><a href="https://github.com/karoldvl/ESC-10">ESC-10</a></strong>: a labeled set of 400 environmental recordings (10 classes, 40 clips per class) (this is a subset of ESC-50 - created initialy as a proof-of-concept/standardized selection of easy recordings),</li><li><strong>ESC-US</strong>: an unlabeled dataset of 250 000 environmental recordings (5-second-long clips), suitable for unsupervised pre-training.</li></ul><p>The ESC-US dataset, although not hand-annotated, includes the labels (tags) submitted by the original uploading users, which could be potentially used for weakly-supervised learning (noisy and/or missing labels). The ESC-10 and ESC-50 datasets have been prearranged into 5 uniformly sized folds so that clips extracted from the same original source recording are always contained in a single fold.</p><p>The labeled datasets are also available as GitHub projects: <a href="https://github.com/karoldvl/ESC-50">ESC-50</a> | <a href="https://github.com/karoldvl/ESC-10">ESC-10</a>.</p><p>For a more thorough description and analysis, please see the <a href="http://karol.piczak.com/papers/Piczak2015-ESC-Dataset.pdf">original paper</a> and the <a href="https://github.com/karoldvl/paper-2015-esc">supplementary IPython notebook</a>.</p><hr><p>The goal of this project is to facilitate open research initiatives in the field of environmental sound classification as publicly available datasets in this domain are still quite scarce.</p><p><strong>Acknowledgments</strong><br/>I would like to thank <a href="http://www.dtic.upf.edu/~ffont/">Frederic Font Corbera</a> for his help in using the Freesound API.</p>

Citations (171)

Mentions (160)

Metrics

Dataset Index

185.3

FAIR Score

15%

Citations

171

Mentions

160

Metrics Over Time

Publication Details

Publisher

Harvard Dataverse

Assigned Domain

Topic Name

Music and Audio Processing

Subfield

Signal Processing

Field

Computer Science

Domain

Physical Sciences

Keywords

Computer and Information Scienceenvironmental soundclassificationdataset

Normalization Factors

FT

13.46

CTw

1.00

MTw

1.00