Tasks Compare Workflows AI News Tools

Submit ToolSubmit

find AI list

Search by task, compare top tools, and use proven workflows to choose the right AI tool faster.

Platform

Tasks
Tools
Compare
Alternatives
Workflows
Reports
Personas
Roles
Stacks
Models
Agents
AI News

Company

About
Blog
FAQ
Contact
Editorial Policy
Privacy
Terms

Contribute

Submit Tool
Manage Tool
Request Tool

Stay Updated

Get new tools, workflows, and AI updates in your inbox.

© 2026 findAIList. All rights reserved.

Privacy Policy Terms of Service Editorial Policy Refund Policy

AudioSet | findAIList | Find AI List

Home/Tasks/Science & Healthcare/More & General/Sound Classification/AudioSet

AudioSet

4.6

Free

Quick Tool Decision

Should you use AudioSet?

A large-scale dataset of manually annotated audio events.

Category

Audio Analysis

Setup effort

medium

Pricing

Free

Data confidence: release and verification fields are source-audited when available; other summary fields are community-aggregated.

Visit Tool Website Open Detailed Profile

Overview FAQ Pricing

Overview

AudioSet is a large-scale dataset of manually annotated audio events, designed to provide a common evaluation task for audio event detection and a starting point for a comprehensive vocabulary of sound events. It consists of an expanding ontology of 632 audio event classes and a collection of over 2 million human-labeled 10-second sound clips drawn from YouTube videos. The ontology is structured as a hierarchical graph of event categories, covering a wide range of human and animal sounds, musical instruments, and common environmental sounds. The data collection process involves human annotators verifying the presence of sounds within YouTube segments nominated based on metadata and content-based search. Machine-extracted features are available for download alongside the dataset, facilitating machine learning model training and evaluation.

Common tasks

Audio Event Detection Sound Classification Acoustic Scene Understanding

FAQ

What is AudioSet?

AudioSet is a large-scale dataset of manually annotated audio events, designed to provide a common evaluation task for audio event detection.

How many audio event classes are in the AudioSet ontology?

The ontology consists of 632 audio event classes organized as a hierarchical graph.

Where does the audio data come from?

The audio clips are drawn from YouTube videos.

Are there pre-computed audio features available?

Yes, machine-extracted features are available for download.

FAQ+-

What is AudioSet?

AudioSet is a large-scale dataset of manually annotated audio events, designed to provide a common evaluation task for audio event detection.

How many audio event classes are in the AudioSet ontology?

The ontology consists of 632 audio event classes organized as a hierarchical graph.

Where does the audio data come from?

The audio clips are drawn from YouTube videos.

Are there pre-computed audio features available?

Yes, machine-extracted features are available for download.

Pricing

Free

Free

$0

Pros & Cons

Pros

- Large-scale dataset
- Comprehensive ontology
- Human-verified labels

Cons

- YouTube sourced, potential bias
- Requires significant computational resources
- Labeling can still have errors

More tools from Research

Company grouping is inferred from website domain and may improve as structured company data is enriched.

Gemini for Google Workspace (formerly Duet AI)

Transform raw data into structured insights using generative AI and natural language processing within Google Sheets.

Pricing: Paid

Google Drive

The intelligent cloud storage backbone for seamless collaboration and AI-driven file management.

Pricing: Freemium

Gemini for Google Workspace

Integrated generative AI across Docs, Gmail, Sheets, and Slides for enterprise productivity.

Pricing: Paid

Google Meet

Secure, high-performance video communication integrated with Gemini AI for the modern enterprise.

Pricing: Freemium

Google Virtual Try-On (VTO)

Photorealistic generative AI that drapes garments on diverse human models in real-time.

Pricing: Freemium

Google Docs

The AI-augmented collaborative document workspace for real-time synthesis and knowledge orchestration.

Pricing: Freemium

Reviews & Ratings

Share your experience, and users can reply directly under each review.

Reviews load as you scroll.

Need advanced specs, integrations, implementation notes, and deeper comparisons? Open the Detailed Profile.

Free

Model not listed