omnizart vocal¶

Lists the detailed available options of each sub-commands.

transcribe¶

Transcribe a single audio and output as a MIDI file.

This will output a MIDI file with the same name as the given audio, except the extension will be replaced with ‘.mid’.

omnizart vocal transcribe [OPTIONS] INPUT_AUDIO

Options

-m, --model-path <model_path>¶: Path to the pre-trained model or the supported transcription mode.

-o, --output <output>¶

Path to output the prediction file (could be MIDI, CSV, …, etc.)

Arguments

Extract the feature of the whole dataset for training.

omnizart vocal generate-feature [OPTIONS]

Options

-d, --dataset-path <dataset_path>¶: Required Path to the downloaded dataset

-o, --output-path <output_path>¶: Path for saving the extracted feature. Default to the folder under the dataset.

-n, --num-threads <num_threads>¶

Number of threads used for parallel feature extraction.

Train a new model or continue to train on a pre-trained model

omnizart vocal train-model [OPTIONS]

Options

-d, --feature-path <feature_path>¶: Required Path to the folder of extracted feature

-i, --input-model <input_model>¶: If given, the training will continue to fine-tune the pre-trained model.

--early-stop <early_stop>¶: Stop the training if validation accuracy does not improve over the given number of epochs.