index.xlsx: it contains a list describing the baisc information of each index folder/file (name, number of beats per measure, number of quavers per measure, and modify times)
index folder: it contains several files for a data in the POP909 dataset:
index.mid: the music midi file of the arrangement song (MELODY track for the main melody, BRIDGE track for the sub-melody, and PIANO track for the accompaniment)
beat_audio/beat_midi.txt: the extracted beat information from the raw audio/midi, the first column is the time (in sec), and the seconcd column is the beat order
chord_audio/beat_audio.txt: the extracted chord information from the raw audio/midi, the first/second column is the start/end time (in sec), and the third column is the chord name
key_audio.txt: the extracted key change information from the raw audio, the first/second column is the start/end time (in sec), and the third column is the key change.
versions folder: it contains different versions of the same arrangement song.

Data Processing Script

We also provide scripts for the data processing. It will allow you to quickly process the POP909 Files (Midi) into the Google Magenta's music representation as like Music Transformer and Performance RNN.

data_process.ipynb: follow this jupyter notebook, you will get the data input tokens that are able to be fed into the pytorch/tensorflow dataset/dataloader.(Notice that the representation of encoding the midi sequence are various {e.g., monophonic note tokens, magenta's event tokens, pianoroll, etcs}. We highly recommend users to create their own data processing files to encode the data in their wanted format)
pop-pickle.zip: it contains the pickle file, already in magenta's event tokens representation

Credit

Please cite this work if you want to use this dataset

@inproceedings{pop909-ismir2020,
    author = {Ziyu Wang* and Ke Chen* and Junyan Jiang and Yiyi Zhang and Maoran Xu and Shuqi Dai and Guxian Bin and Gus Xia},
    title = {POP909: A Pop-song Dataset for Music Arrangement Generation},
    booktitle = {Proceedings of 21st International Conference on Music Information Retrieval, {ISMIR}},
    year = {2020}
}

使用案例

mmdet3d纯视觉baseline之数据准备：处理waymo dataset v1.3.1

在waymo上测纯视觉baseline（多相机模式），分很多步：处理数据集为kitti格式修改dataloader代码修改模型config 修改模型target和loss 修改eval pipeline的代码 mmdet3d官网的waymo dataset教程过于简略，处理的结果只能给pointpillar用，而且是旧版的数据集。对初学者的我非常不友好。下面基于mmdet的教程（以下简称教程
FDDB人脸数据集dataset的dataset数据集的制作

FDDB为图片多人脸目标检测数据集，本文根据vocdataset 进行改编，将FDDB数据集进行分割，并进行图像预处理，翻转，随机裁剪等数据集增强相关的预处理。如有转载请附本文链接：https://blog.csdn.net/canmang1/article/details/108487673 # 每个标注的椭圆形人脸由六个元素组成。 # （ra, rb, Θ, cx, cy, s） # r
Java Dataset.withColumn方法代码示例

import org.apache.spark.sql.Dataset; //导入方法依赖的package包/类 private void start() { Dataset householdDf = getHouseholdDataframe(); Dataset populationDf = getPopulationDataframe(); Dataset indexDf = joinHo
tensorflow dataset.shuffle dataset.batch dataset.repeat 理解注意点

batch很好理解，就是batch size。注意在一个epoch中最后一个batch大小可能小于等于batch size dataset.repeat就是俗称epoch，但在tf中与dataset.shuffle的使用顺序可能会导致个epoch的混合 dataset.shuffle就是说维持一个buffer size 大小的 shuffle buffer，图中所需的每个样本从shuffle
pytorch学习之：常用的数据集处理方法（Dataset）和数据采样策略（Sampler）

数据集处理方法小批量数据 & 为数据添加随机噪声使用小部分的数据：在做实验的时候，有时候我们想用一小部分数据来先跑通代码，然后再上大量的数据为 Dataset 中的图片数据添加高斯噪声 """ @file: codes.py @Time : 2023/1/12 @Author : Peinuan qin """ import numpy as np import torch
yolov7的dataset代码详解

yolov7的数据增强中采用了很多yolov5没有用过的增强，比如mosaic、mosaic9、mixup、copy_paste、paste_in等，这些数据增强很占显存，训练的时候可以把一些数据增强关了，我把mosaic9注释了，以下是yolov7的dataset代码详解。 # Dataset utils and dataloaders import glob import logging i

POP909-Dataset

POP909 Dataset for Music Arrangement Generation

Dataset Zip File Structure

Data Processing Script

Credit

同类工具

相关阅读

相关文章

相关问答

相关文档