AI

Stability AI Launches New Sound Generation AI Tool

AI startup company Stability AI launched a new sound generator called Stable Audio Open, which uses royalty-free sounds for training.

Stability AI, the startup responsible for the AI-powered art generator Stable Diffusion, has released an open AI model for generating sounds and music. The model was purportedly trained exclusively on royalty-free recordings.

The generative model, Stable Audio Open, generates a recording up to 47 seconds long by interpreting a text description (e.g., “Rock beat played in a treated studio, session drumming on an acoustic kit”). The model was trained using approximately 486,000 samples from the Free Music Archive and FreeSound, two free music libraries.

According to Stability AI, the model can generate drum beats, instrument riffs, ambient noises, and “production elements” for videos, films, and TV programs. Additionally, it can be employed to “edit” existing songs or adopt the style of one song (e.g., smooth jazz) for another.

In a post on its corporate blog, Stability AI stated that users can fine-tune the model on their custom audio data, a significant benefit of this open-source release. “For instance, a drummer could generate new beats by fine-tuning samples of their drum recordings.”

Nevertheless, Stable Audio Open has its limitations. It must be capable of composing complete compositions, melodies, or vocals, at least not high quality. Stability AI asserts that it is not optimized for this purpose and recommends that users seeking those capabilities opt for the company’s premium Stable Audio service.

Additionally, its terms of service prohibit commercial use of Stable Audio Open. Additionally, it is less effective when described in languages other than English or across musical genres and cultures, as it is subject to biases. Stability AI accuses the training data.

In a model description, Stability AI notes that the data source potentially could be more diverse and that all cultures are not equitably represented in the dataset. “The biases present in the training data will be reflected in the samples generated by the model.”

Stability AI, which has long struggled to revitalize its faltering business, recently faced controversy when its VP of generative audio, Ed Newton-Rex, resigned because he disagreed with the company’s stance that training generative AI models on copyrighted works constitutes “fair use.” Stable Audio Open seems to attempt to rewrite that narrative while simultaneously not so subtly advertising Stability AI’s paid products.

Edwin Aboyi

Edwin Aboyi is a product designer, writer, and illustrator with a degree in Biological Sciences from the University of Abuja. Passionate about merging technology with creativity, Edwin contributes to Protechbro.com by offering fresh perspectives on AI, Web3, and blockchain

Share
Published by
Edwin Aboyi

Recent Posts

Ethereum DApp Volume Surges by 83%

Although Ethereum network volumes have increased, a single decentralized application constituted 59.5% of the network's…

3 hours ago

Ethereum Foundation Email Hack

A hacker infiltrated the Ethereum Foundation's email system and sent fraudulent emails to 35,794 recipients…

3 hours ago

HashKey Launches 10M HSK Token Airdrop on Telegram

HashKey, a prominent blockchain team, plans to distribute 10 million HSK tokens through the DejenDog…

4 hours ago

Ripple Release Money from Escrow Worth 1 Billion XRP

Fintech Company for blockchain and crypto, Ripple, has recently released 1 Billion XRP from Escrow,…

5 hours ago

OpenLedger Gets Funding Worth $8M for AI Data Infrastructure

Blockchain AI Data Infrastructure company OpenLedger has received $8 Million in funding to invest in…

6 hours ago

Top 5 Token Generation Events to Drop in 2024

This is a list of the top 5 token launches to get excited about, and…

6 hours ago