We have released the Sound Stock Music Genres Taxonomy publicly on Hugging Face:
https://huggingface.co/datasets/SoundStock/soundstock.com-music-genres-taxonomy
This is a large, structured, and extensible genre taxonomy designed for music tagging, classification, search, recommendation systems, and modern audio / music ML workflows.
The dataset provides a hierarchical view of music genres — including root genres, subgenres, and expanded variants (style, era, region, and fusion) — with stable, deterministic IDs suitable for long-term production use.
Dataset highlights:
• 1,600+ genres
• Root → subgenre → expanded variant hierarchy
• Stable genre_id values
• CSV and JSONL formats for analytics and ML pipelines
This taxonomy is already used internally across Sound Stock for search, tagging, recommendation, and AI-driven music generation — and we’re excited to make it available to the broader music-tech and ML community.
If you’re building MIR systems, DAW tools, music libraries, or audio ML pipelines, this dataset is designed to be a practical, production-ready foundation.
More open datasets and infrastructure components will follow.