MuseNet is a deep neural network that was developed by OpenAI. It is capable of composing four-minute long pieces of music using ten distinct instruments and can blend a wide variety of musical styles, from country to Mozart to the Beatles. It makes use of the same unsupervised general-purpose technology as GPT-2, which is a large-scale transformer model that has been trained to predict the next token in a sequence, regardless of whether the sequence is audio or text. The model is trained on data taken from MIDI files, and it is able to create samples in a certain style when given a prompt to get it started. In order to provide the model with more context, it makes use of a number of embeddings, including positional embeddings, a time embedding, and structural embeddings.

