# ๐ŸŽต SongEval: A Benchmark Dataset for Song Aesthetics Evaluation [![Hugging Face Dataset](https://img.shields.io/badge/HuggingFace-Dataset-blue)](https://huggingface.co/datasets/ASLP-lab/SongEval) [![Arxiv Paper](https://img.shields.io/badge/arXiv-Paper-.svg)](https://arxiv.org/pdf/2505.10793) [![License: CC BY-NC-SA 4.0](https://img.shields.io/badge/License-CC%20BY--NC--SA%204.0-lightgrey.svg)](https://creativecommons.org/licenses/by-nc-sa/4.0/) This repository provides a **trained aesthetic evaluation toolkit** based on [SongEval](https://huggingface.co/datasets/ASLP-lab/SongEval), the first large-scale, open-source dataset for human-perceived song aesthetics. The toolkit enables **automatic scoring of generated song** across five perceptual aesthetic dimensions aligned with professional musician judgments. --- ## ๐ŸŒŸ Key Features - ๐Ÿง  **Pretrained neural models** for perceptual aesthetic evaluation - ๐ŸŽผ Predicts **five aesthetic dimensions**: - Overall Coherence - Memorability - Naturalness of Vocal Breathing and Phrasing - Clarity of Song Structure - Overall Musicality - ๐ŸŽง Accepts **full-length songs** (vocals + accompaniment) as input - โš™๏ธ Simple inference interface --- ## ๐Ÿ“ฆ Installation Clone the repository and install dependencies: ```bash git clone https://github.com/ASLP-lab/SongEval.git cd SongEval pip install -r requirements.txt ``` ## ๐Ÿš€ Quick Start - Evaluate a single audio file: ```bash python eval.py -i /path/to/audio.mp3 -o /path/to/output ``` - Evaluate a list of audio files: ```bash python eval.py -i /path/to/audio_list.txt -o /path/to/output ``` - Evaluate all audio files in a directory: ```bash python eval.py -i /path/to/audio_directory -o /path/to/output ``` - Force evaluation on CPU (โš ๏ธ CPU evaluation may be significantly slower) : ```bash python eval.py -i /path/to/audio.wav -o /path/to/output --use_cpu True ``` ## ๐Ÿ™ Acknowledgement This project is mainly organized by the audio, speech and language processing lab [(ASLP@NPU)](http://www.npu-aslp.org/). We sincerely thank the **Shanghai Conservatory of Music** for their expert guidance on music theory, aesthetics, and annotation design. Meanwhile, we thank AISHELL to help with the orgnization of the song annotations.

Shanghai Conservatory of Music Logo

## ๐Ÿ“‘ License This project is released under the CC BY-NC-SA 4.0 license. You are free to use, modify, and build upon it for non-commercial purposes, with attribution. ## ๐Ÿ“š Citation If you use this toolkit or the SongEval dataset, please cite the following: ``` @article{yao2025songeval, title = {SongEval: A Benchmark Dataset for Song Aesthetics Evaluation}, author = {Yao, Jixun and Ma, Guobin and Xue, Huixin and Chen, Huakang and Hao, Chunbo and Jiang, Yuepeng and Liu, Haohe and Yuan, Ruibin and Xu, Jin and Xue, Wei and others}, journal = {arXiv preprint arXiv:2505.10793}, year={2025} } ```