theverge.com 4 hours ago URGENCY: 6/10
AI Music Datasets: The Hidden Truth Revealed
Discover the shocking reality behind AI music training datasets. Millions of tracks are available, but what does this mean for artists and creators?

Unveiling the AI Music Database
The Atlantic has launched a groundbreaking searchable database that reveals the music used to train AI models. This database includes four major datasets, with two containing a staggering 12 million and 9 million tracks respectively, while the others hold over 100,000 songs each.
These datasets have been downloaded extensively, with major players like Google and Stability acknowledging their use in research. However, the process of utilizing these datasets isn't straightforward; developers often employ tools to bypass platform restrictions, raising ethical concerns about copyright and fair use.
- Key points about the datasets:
- Two massive datasets with millions of tracks.
- Sources include popular platforms like YouTube and Spotify.
- Ethical implications for artists and creators.