Published on 01 January 2015 |
<p>The <strong>ESC dataset</strong> is a collection of short environmental recordings available in a unified format (5-second-long clips, 44.1 kHz, single channel, Ogg Vorbis compressed @ 192 kbit/s). All clips have been extracted from public field recordings available through the <a href="http://freesound.org">Freesound.org project</a>. Please see the README files for a detailed attribution list. The dataset is available under the terms of the <a href="http://creativecommons.org/licenses/by-nc/3.0/">Creative Commons license - Attribution-NonCommercial</a>.</p><p>The dataset consists of three parts:</p><ul><li><strong><a href="https://github.com/karoldvl/ESC-50">ESC-50</a></strong>: a labeled set of 2 000 environmental recordings (50 classes, 40 clips per class),</li><li><strong><a href="https://github.com/karoldvl/ESC-10">ESC-10</a></strong>: a labeled set of 400 environmental recordings (10 classes, 40 clips per class) (this is a subset of ESC-50 - created initialy as a proof-of-concept/standardized selection of easy recordings),</li><li><strong>ESC-US</strong>: an unlabeled dataset of 250 000 environmental recordings (5-second-long clips), suitable for unsupervised pre-training.</li></ul><p>The ESC-US dataset, although not hand-annotated, includes the labels (tags) submitted by the original uploading users, which could be potentially used for weakly-supervised learning (noisy and/or missing labels). The ESC-10 and ESC-50 datasets have been prearranged into 5 uniformly sized folds so that clips extracted from the same original source recording are always contained in a single fold.</p><p>The labeled datasets are also available as GitHub projects: <a href="https://github.com/karoldvl/ESC-50">ESC-50</a> | <a href="https://github.com/karoldvl/ESC-10">ESC-10</a>.</p><p>For a more thorough description and analysis, please see the <a href="http://karol.piczak.com/papers/Piczak2015-ESC-Dataset.pdf">original paper</a> and the <a href="https://github.com/karoldvl/paper-2015-esc">supplementary IPython notebook</a>.</p><hr><p>The goal of this project is to facilitate open research initiatives in the field of environmental sound classification as publicly available datasets in this domain are still quite scarce.</p><p><strong>Acknowledgments</strong><br/>I would like to thank <a href="http://www.dtic.upf.edu/~ffont/">Frederic Font Corbera</a> for his help in using the Freesound API.</p>
Cited on 01 January 2026
Weight: 1.00
Cited on 19 July 2025
Weight: 1.79
Cited on 19 July 2025
Weight: 1.79
Cited on 05 July 2025
Weight: 1.79
Cited on 05 June 2025
Weight: 1.79
Cited on 20 May 2025
Weight: 1.79
Cited on 03 April 2025
Weight: 1.79
Cited on 28 March 2025
Weight: 1.79
Cited on 01 January 2025
Weight: 1.79
Cited on 01 January 2025
Weight: 1.79
Mentioned on 17 September 2025
Weight: 1.79
Mentioned on 15 September 2025
Weight: 1.79
Mentioned on 15 February 2025
Weight: 1.79
Mentioned on 16 January 2025
Weight: 1.79
Mentioned on 16 November 2024
Weight: 1.76
Mentioned on 16 November 2024
Weight: 1.76
Mentioned on 17 October 2024
Weight: 1.76
Mentioned on 11 October 2024
Weight: 1.76
Mentioned on 30 August 2024
Weight: 1.76
Mentioned on 30 July 2024
Weight: 1.76
Dataset Index
FAIR Score
Citations
Mentions
Publisher
Harvard Dataverse
Topic Name
Music and Audio Processing
Subfield
Signal Processing
Field
Computer Science
Domain
Physical Sciences
FT
CTw
MTw