🎵 Audio & Music AI Tools

Generate and edit audio, music, and speech

12 tools found

All 💬 Chat & Assistant 🎨 Image Generation 🎬 Video Generation 🎵 Audio & Music 💻 Code & Development ⚡ Productivity ✍️ Writing & Content 🖌️ Design & Creative 📈 Marketing & SEO 📊 Data & Analytics 📚 Education 🔧 Other

Suno

audio

Featured

AI music generation platform that creates complete original songs with lyrics, vocals, and instrumentation from simple text prompts spanning virtually any genre or style. Its ability to produce radio-ready tracks in seconds has democratized music creation for hobbyists, content creators, and musicians looking to rapidly iterate on musical ideas.

musicsong-generationlyrics

ElevenLabs

audio

FeaturedPromo

Industry-leading AI voice synthesis platform offering ultra-realistic text-to-speech, voice cloning, and multilingual narration across more than 30 languages. Its lifelike voices and precise emotional control have made it the standard for audiobook production, content localization, and accessibility applications worldwide.

voicettscloning

Udio

audio

New

AI music generation platform created by former Google DeepMind researchers, capable of producing high-quality, full-length songs with coherent structure across an impressively diverse range of genres. Its audio quality and musical coherence set a new bar for AI-generated music, making it a strong alternative to Suno for serious music creators.

musicgenerationgenres

Mubert

audio

AI-powered music platform specializing in generating royalty-free background music for content creators, streamers, and video producers. It offers both AI generation and curated human-made tracks with flexible licensing, ensuring creators can find the perfect soundtrack without worrying about copyright claims on their content.

royalty-freebackgroundstreaming

OpenAI Whisper

audio

Free

Open-source automatic speech recognition system from OpenAI that delivers high-accuracy transcription across dozens of languages, including support for translation to English. Its local-first architecture means it can run entirely offline on consumer hardware, making it the go-to choice for privacy-sensitive transcription applications and developers building speech-enabled products.

speech-to-texttranscriptionopen-source

Notta

audio

AI meeting assistant that records, transcribes, and summarizes conversations in real time, with integrations for popular video conferencing platforms like Zoom, Teams, and Google Meet. It helps teams capture meeting outcomes, action items, and decisions without manual note-taking, boosting productivity for distributed and hybrid work environments.

transcriptionmeetingssummarization

Stable Audio

audio

Stability AI's audio generation tool for creating professional-quality sound effects, music loops, ambient textures, and instrument stems from text prompts. It is designed for content creators, game developers, and video producers who need high-quality, customizable audio assets without licensing fees or expensive studio recording sessions.

sound-effectsloopsstability-ai

AIVA

audio

AI music composer specializing in classical, orchestral, and cinematic score composition with full MIDI and sheet music notation export for use in professional production workflows. It is unique among AI music tools in its focus on traditional composition formats, making it a valuable tool for film composers, game music producers, and arrangers who need AI-assisted composition that integrates with their existing digital audio workstation setup.

classicalorchestralcomposition

Moises.ai

audio

AI-powered audio platform that offers precise stem separation (splitting songs into vocals, drums, bass, and other instruments), practice tools like metronome and vocal removal, and automatic key and tempo detection. It is essential for musicians learning songs by ear, DJs preparing remixes, and producers looking to isolate tracks for sampling and mashups.

stem-separationpracticeremixing

Google Lyria 3 Pro

audio

New

Google DeepMind's flagship AI music model capable of generating complete songs up to 3 minutes with explicit control over structure, arrangement, and instrumentation, plus innovative image-to-music capabilities. It includes SynthID watermarking for responsible AI use, making it suitable for both creative professionals exploring AI-assisted composition and platforms needing safe, attributable AI-generated music.

music-generationstructural-controlgoogle-deepmind

Soundraw

audio

AI music generator designed for content creators who need customizable royalty-free background tracks with granular controls over mood, tempo, energy level, and instrumentation. Its licensing model gives creators peace of mind for monetized content, and its detailed controls allow fine-tuning tracks to match the exact pacing and emotion of videos and streams.

royalty-freebackground-musiccustomizable

Resemble AI

audio

New

Advanced voice AI platform offering voice cloning, text-to-speech, speech-to-speech conversion, and emotional expression control across 120+ languages, with the open-source Chatterbox model for zero-shot voice cloning. Its combination of commercial-grade tools and open-source offerings makes it a versatile choice for developers, content creators, and enterprises building voice-enabled products.

voice-cloningttsopen-source