Tag: audio

Filtered selection of tools tagged audio.

Adobe Enhance Speech

Adobe Enhance Speech is an AI tool for automatically improving spoken audio. It reduces common recording problems such as room echo, background noise, muffled voice quality, and uneven vocal presence, with the goal of turning simple recordings into clearer, podcast-like speech tracks. It is especially useful when audio is recorded outside a studio, using a laptop, headset, phone, or USB microphone in changing conditions.

Audio Freemium

OBS Studio

OBS Studio is a video and production tool for open-source streaming and screen recording for live productions, tutorials, and events.

Video Free

Soundtrap

Soundtrap is a audio and music tool for browser-based music production and audio collaboration for songs, podcasts, and education.

Audio Subscription

Acapela Group

Acapela Group is a leading provider of Text-to-Speech (TTS) solutions that offers natural and expressive voices for a variety of applications. The technology enables the conversion of written text into high-quality, understandable speech recordings that are used in various industries such as education, telecommunications, accessibility, and entertainment. Acapela Group places a strong emphasis on individual adaptations and multilingual options to meet the needs of different users.

AI Plan-based

Acast

Acast is an innovative platform that specializes in hosting, monetizing, and analyzing podcasts. By utilizing modern technologies, including AI-powered tools, Acast enables podcasters to efficiently manage and make their content accessible to a wide audience. The platform supports both beginners and experienced podcasters and offers a range of features around audio content.

AI Plan-based

Adobe Premiere Pro

Professional video editor for editing, color, audio, captions, and post-production workflows.

Audio & Video Subscription

Alitu

Alitu is a KI-powered tool designed specifically for podcasters to simplify the recording and editing process. It automates many technical steps that are typically time-consuming, allowing users without extensive audio expertise to create professional podcasts. Alitu is particularly helpful for cleaning up, cutting, and adding music or effects to audio files without requiring complex software.

AI Subscription

Amazon Transcribe

Amazon Transcribe is Amazon Web Services' automatic speech recognition service for turning audio and video into text. It is used for meeting notes, media transcripts, contact-center analysis, subtitles, research interviews and internal documentation. The service is especially relevant for teams that already store files in AWS or want transcription to become part of a larger processing pipeline rather than a standalone manual task.

AI Usage-based

Auphonic

Auphonic is a AI-powered tool for automated audio production and optimization. It helps users to quickly improve, transcribe, and prepare audio and video files for various platforms. Auphonic is particularly suitable for podcasters, journalists, content creators, and anyone who values high-quality sound without spending a lot of time on manual editing.

AI Plan-based

Cleanvoice AI

Cleanvoice AI is an intelligent audio tool designed to automate and simplify post-production of audio recordings. It uses artificial intelligence to automatically detect and remove unwanted elements such as filler words, background noise, and other imperfections in audio recordings. This helps to create professional-sounding audio files more quickly and efficiently without the need for extensive manual editing.

Audio Plan-based

Descript Overdub

Descript voice workflow for voice cloning, speech repair, and text-based audio editing.

Audio & Video Subscription

Filmora

Filmora is a video and production tool for accessible video editing for creators, tutorials, social clips, and simple productions.

Audio & Video Plan-based

Hindenburg Journalist

Specialized audio editing software for journalists, podcasters, and radio professionals, with an emphasis on ease of use, automation, and a streamlined production workflow.

AI Plan-based

IBM Watson Text to Speech

A cloud-based text-to-speech service that turns written text into natural-sounding speech, supports multiple languages and voices, and helps teams build accessible, interactive applications.

Productivity Plan-based

LANDR

LANDR is a audio and music tool for mastering, music distribution, and audio workflows for independent musicians and creators.

AI One-time purchase

Soundraw

Soundraw is an AI music composition tool for creating and adapting tracks quickly for videos, podcasts, and other creative projects.

Audio Plan-based

Wispr Flow

Wispr Flow is an AI dictation tool for fast voice-first writing in apps, documents, chats, and workflows.

Audio Freemium

Ableton Live

Ableton Live is a digital audio workstation for people who do not only record music linearly, but work with loops, clips, MIDI ideas, sound design, and stage setups. It is especially strong when a sketch needs to become a playable arrangement quickly.

Audio Plan-based

Adobe Podcast

Adobe Podcast is an innovative platform designed specifically for podcasters and audio producers to simplify the recording, editing, and transcription of audio content. Featuring integrated AI-powered functions, Adobe Podcast helps create and publish professional podcasts more efficiently. Its freemium model allows users to test basic features for free and access advanced functions if needed.

Audio Freemium

Amazon Polly

Amazon Polly is a cloud-based service from Amazon Web Services (AWS) that converts text into naturally sounding speech. With advanced artificial intelligence, Polly produces realistic speech outputs from text, which can be used in various applications such as customer service, e-learning, audiobooks, or automation solutions. The API allows for easy integration into different systems and supports many languages and voices.

AI Usage-based

Anchor

A podcast hosting and distribution tool for creators who want to record, publish, and track episodes with minimal technical overhead.

AI Freemium

AssemblyAI

AssemblyAI is a powerful platform for automatic speech recognition (ASR) and speech processing, primarily developed for developers and enterprises. It offers advanced AI-based transcription services that quickly convert audio and video files into text. The API of AssemblyAI enables easy integration into various applications to efficiently analyze and process speech data.

Audio

AudioMaster

AudioMaster is a versatile audio software tool specifically designed for mastering and editing audio files. With a user-friendly interface and mobile use options, the tool is aimed at musicians, producers, and audio enthusiasts who want to improve their sound quality quickly and effectively. Whether on the go or in the studio, AudioMaster offers a wide range of functions that make professional results possible even without in-depth technical knowledge.

Audio Plan-based

Audiotool

Audiotool is a browser-based music production platform that allows users to create, edit, and publish electronic music directly in the web. Without software installation, Audiotool offers a comprehensive collection of virtual instruments, effects, and mixer tools that are both appealing to beginners and experienced producers. The platform supports collaborative work and direct exchange of projects in the community.

Audio Freemium

Audo

Audo is an audio tool for voice enhancement, noise reduction, and clearer recordings in content workflows.

AI Freemium

Bitwig Studio

Bitwig Studio is a modern digital audio workstation (DAW) that is known for its flexibility and extensive creative possibilities. Developed for musicians, producers, and sound designers, Bitwig Studio offers a modular environment for music production that provides numerous tools for both beginners and professionals. With an intuitive user interface and innovative features, Bitwig Studio supports the implementation of ideas in all music styles.

Audio Plan-based

Boomy

Boomy is a audio and music tool for AI music generation for quick song sketches, background music, and creative audio experiments.

AI Freemium

Buzzsprout

Buzzsprout is a user-friendly podcast hosting platform that allows users to easily publish, manage, and distribute their podcasts. With a clear interface and automated tools, Buzzsprout helps podcasters get their content online and available on various platforms. The platform is suitable for both beginners and experienced podcasters who prioritize ease of use and reliable hosting.

AI Freemium

Deepgram

Deepgram is a cloud-based platform for automatic speech recognition and transcription. With the latest algorithms, Deepgram enables the conversion of audio and video content into searchable text - precise, fast, and scalable. The solution is primarily aimed at developers and enterprises who want to integrate speech recognition into their applications, and offers flexible APIs and SDKs.

AI Plan-based

Descript Studio Sound

Descript Studio Sound is an AI speech enhancement feature inside the Descript production workflow. It is designed to make voices sound clearer, closer, and more professional by reducing noise, room echo, muffled microphone quality, and uneven levels. Its practical value is that everyday recordings can become usable much faster, without rebuilding every track through a manual chain of audio plugins.

Audio Plan-based

Ecrett Music

Ecrett Music generates licensable background music for videos, games, presentations, and content projects.

Audio Plan-based

ElevenLabs

ElevenLabs is a cutting-edge AI-based audio platform specializing in the creation and editing of speech content. With modern text-to-speech technologies, ElevenLabs enables natural and expressive speech synthesis that can be used in various applications. The platform offers both a free entry-level version and paid plans with enhanced features.

Audio Freemium

FabFilter Pro-L 2

A professional limiter for mastering and final loudness control, with transparent signal processing, detailed metering, and flexible limiting modes for music production and audio post-production.

Audio One-time purchase

FL Studio

FL Studio is a audio and music tool for DAW for beatmaking, electronic music, recording, and full music production.

Audio Plan-based

Fliki

Fliki is an innovative AI tool designed specifically for creating videos and podcasts from text content. With the help of artificial intelligence, Fliki transforms text into engaging audiovisual media suitable for marketing, education, or social media. The platform offers an intuitive user interface and a wide range of customization options to quickly and efficiently produce content.

AI Freemium

Google Cloud Text-to-Speech

Google Cloud Text-to-Speech is a powerful AI-based service that converts written text into naturally sounding speech. It uses advanced Deep Learning models to provide a wide range of voices and languages suitable for applications in audiobooks, speech assistants, learning programs, and more. With flexible customization options and a user-friendly API, this service is ideal for developers and businesses looking to create high-quality audio content automatically.

AI Freemium

HeyGen

HeyGen is a practical tool for creating AI avatar videos, localizing video content, and producing synthetic presentations for marketing, training, support, and internal communication.

AI Freemium

IBM Watson Speech to Text

A cloud-based speech recognition service that converts audio into text with support for real-time and batch transcription, multiple languages, speaker identification, and API integration.

Productivity Usage-based

iSpeech

iSpeech is an AI-powered speech processing platform for text-to-speech and speech-to-text workflows, with APIs for integrating voice features into websites, apps, and business systems.

AI Plan-based

iZotope Ozone

iZotope Ozone is professional audio mastering software that uses AI-powered technologies to simplify and optimize the mastering process. With a broad set of tools and intelligent algorithms, it helps music producers, sound engineers, and creators take their sound to a new level, whether in the studio or on the go.

AI Subscription

Krisp

AI-powered audio software that removes background noise in real time for calls, video meetings, and recordings, with support for major communication tools and local processing for privacy.

Audio Freemium

Libsyn

Libsyn is an established podcast hosting platform focused on easy distribution and monetization of audio content, with tools for managing, publishing, and analyzing podcasts.

AI Subscription

Loudly

Loudly is a audio and music tool for AI music, soundtracks, and licensable audio variants for content production.

AI Plan-based

LoudMax

LoudMax is a free audio limiter designed specifically for mastering and adjusting the loudness of music and audio content. The plugin allows you to significantly boost the volume of an audio signal without audible distortion or quality loss. With its simple interface and efficient processing, LoudMax is a popular choice for musicians, producers, and audio engineers seeking a fast and reliable solution for volume optimization.

Audio Free

MeldaProduction MLimiter

A powerful, versatile limiter plugin for audio mastering, designed to maximize loudness while preserving clarity and control. It offers a user-friendly interface, detailed dynamics control, and a free version that makes it accessible for both beginners and experienced producers.

Audio Free

Microsoft Azure Cognitive Services - Text to Speech

Microsoft Azure Cognitive Services - Text to Speech is a powerful cloud-based service that converts written text into natural-sounding speech. With a wide range of voices, languages, and customization options, this service is suitable for applications in areas such as accessibility, customer service, e-learning, and more. Integration is handled through an API, offering flexible deployment options across a variety of software solutions.

Audio Usage-based

Microsoft Azure Speech Service

Microsoft Azure Speech Service is a cloud-based speech processing platform for transcription, text-to-speech, translation, and speech understanding. It supports a wide range of use cases for customer service, media, education, and workflow automation.

AI Usage-based

Microsoft Azure Speech to Text

Microsoft Azure Speech to Text is a cloud-based service that converts spoken language into text. It is suitable for meeting transcription, app integration, accessibility, and productivity workflows, with support for real-time and batch transcription, speaker identification, and customizable speech models.

Productivity Plan-based

Mimic

Mimic is an AI-based speech synthesis tool for generating natural, realistic voices for audiobooks, virtual assistants, audio content, and other applications. It offers flexible voice generation with multiple languages, API integration, and plan-dependent offline use.

AI Plan-based

Murf

Murf is a audio and music tool for AI voices, voiceovers, and speech production for videos, courses, and marketing material.

AI Freemium

NightCafe Studio

NightCafe Studio is an AI-powered audio creation platform for generating soundscapes, music, and sound effects with adjustable parameters, cloud-based access, export options, and community features.

Audio Freemium

Noise Blocker

Noise Blocker is an AI-powered noise suppression tool for calls, meetings, recordings, and streaming, designed to reduce background noise and improve clarity.

AI Plan-based

Ocenaudio

Ocenaudio is a free audio editor for quick cuts, recording checks, and simple editing without a complex studio environment.

Audio Free

Otter.ai

Otter.ai is an AI-powered transcription and note-taking tool for meetings, interviews, lectures, and other spoken content.

Audio Freemium

Play.ht

Play.ht is a text-to-speech platform for turning written content into natural-sounding audio for podcasts, audiobooks, e-learning, and other use cases.

Audio Plan-based

Podbean

Podbean is a comprehensive podcast platform that offers both hosting and monetization options. With a user-friendly interface and versatile features, Podbean helps podcasters create, publish, and make their content accessible to a broad audience. The platform is especially well suited for beginners and experienced podcasters who value ease of use and professional tools.

AI Subscription

Podcastle

Podcastle is an AI-powered platform for creating, recording, and editing audio and video content, with tools for transcription, audio enhancement, collaboration, and publishing workflows.

AI Plan-based

ReadSpeaker

Natural-sounding text-to-speech software for websites, apps, and digital learning content, with multilingual voices, accessibility features, and API or widget integration.

AI Freemium

Resemble AI

Resemble AI is a voice synthesis and cloning tool for teams that need fast, flexible audio production with clear rules around consent, labeling, security, and editorial review.

Audio Plan-based

Respeecher

Respeecher is a cloud-based voice cloning and synthetic speech tool for media teams that need repeatable workflows, clear consent handling, and reliable quality review for film, games, and localization.

AI Freemium

ResponsiveVoice

ResponsiveVoice is an AI-powered text-to-speech solution that makes it easy to add voice output to websites and applications. It supports many languages and voices, with straightforward integration for accessibility, interactivity, and automated audio workflows.

AI Plan-based

RX Elements by iZotope

RX Elements by iZotope is specialized audio editing software that focuses primarily on repairing and enhancing audio recordings. With a range of intelligent tools, it enables users to effectively remove unwanted noise such as hiss, clicks, or hum and improve the sound quality of speech and music recordings. The software is suitable for both beginners and advanced users who are looking for a cost-effective solution for audio restoration.

Audio One-time purchase

Slate Digital FG-X

Slate Digital FG-X is a professional mastering tool for maximizing loudness while preserving transparency, dynamics, and mix clarity.

Audio Subscription

Sonix

Sonix is an AI transcription and captioning tool for audio and video files. It helps turn interviews, meetings, podcasts, videos, and research recordings into searchable text faster.

AI Freemium

Speech-to-Text

AI-powered speech-to-text tools that automatically convert spoken language into written text for transcription, productivity, accessibility, and content workflows.

AI Freemium

Speechify

Speechify is an AI-powered text-to-speech tool that turns written content into natural-sounding audio. It helps users consume text more efficiently for study, work, or leisure, with a user-friendly interface and a range of features. A free version is available, along with paid plans that add more advanced capabilities.

AI Freemium

Speechly

Speechly is an AI-powered speech processing solution for adding real-time voice commands, speech recognition, and natural language understanding to web and mobile applications.

AI Freemium

Speechmatics

Speechmatics provides automatic speech recognition and transcription for audio, video, meetings, and multilingual workflows.

AI Freemium

Splice

Splice is a versatile platform focused on helping creatives produce audio and video content. With a combination of AI-powered tools and an extensive library of sounds, samples, and templates, Splice enables users to make their projects more efficient and more creative. The platform is aimed primarily at musicians, video producers, and content creators who want to boost their productivity.

AI Plan-based

Spreaker

Spreaker is a versatile platform for podcast creation and publishing, with tools for recording, editing, distribution, live streaming, analytics, monetization, and team collaboration.

AI Plan-based

StudioBinder

A production management platform for film and video teams with planning, collaboration, task tracking, and media organization features that can also support audio-related workflows.

Audio Plan-based

Suno AI

An AI-powered audio tool for creating, editing, and managing audio projects with intuitive workflows and flexible features for beginners and professionals alike.

Audio Freemium

T-RackS by IK Multimedia

T-RackS by IK Multimedia is a mixing and mastering suite for shaping finished audio with EQ, compression, limiting, saturation, metering, and analog-style color. It is aimed at musicians, producers, engineers, and podcasters who want more control over loudness, balance, and overall polish, while still relying on careful listening and reference-based decisions.

Audio One-time purchase

TDR Limiter 6 GE

TDR Limiter 6 GE is a professional audio plugin designed specifically for mastering and volume control. It offers precise and flexible dynamic processing with multiple limiter types and extensive customizable settings. Renowned for its high sound quality and user-friendly interface, it is a popular choice among sound engineers and music producers.

Audio One-time purchase

Temi

Temi is an automatic transcription service that enables quick and accurate conversion of audio and video files into text. Utilizing modern speech recognition technology, Temi is especially helpful for individuals who regularly need to transcribe audio content, such as journalists, students, and content creators. The service offers ease of use and delivers results rapidly, significantly boosting productivity.

Audio Usage-based

TurboScribe

TurboScribe is a modern transcription tool powered by artificial intelligence, designed specifically for fast and accurate conversion of audio files into text. It is ideal for users who want to transcribe audio content automatically, whether for interviews, meetings, podcasts, or other voice recordings. With an intuitive user interface and flexible pricing, TurboScribe offers both beginners and professional users an effective solution for audio transcription.

Audio Freemium

WavePad

WavePad is a versatile audio editing tool for everything from simple trimming to more complex production work. It offers an intuitive interface, broad format support, and practical features for recording, editing, adding effects, batch processing, and exporting audio across different platforms.

AI

Waves Abbey Road TG Mastering Chain

A mastering plugin that recreates the legendary Abbey Road console sound, with EQ, compression, limiting, and saturation in a flexible workflow for mixing and mastering.

Audio Subscription

Waves L1 Ultramaximizer

The Waves L1 Ultramaximizer is a professional audio plugin designed specifically for mastering and optimizing the loudness of music and audio productions. Utilizing precise limiting technology, it achieves maximum volume without distortion, preserving the sound quality of your tracks. As one of the most renowned tools in the audio industry, the L1 Ultramaximizer is an essential tool for producers, sound engineers, and musicians aiming to take their productions to the next level.

Audio One-time purchase

Waves L2 Ultramaximizer

A professional mastering limiter for controlling loudness with transparent clipping protection, dithering, and a simple interface.

Audio One-time purchase

WellSaid Labs

WellSaid Labs is a cloud-based AI text-to-speech platform for turning written content into natural-sounding voice recordings. It offers realistic voices, customization controls, API access, team collaboration, and export options for use in voice-overs, audiobooks, learning content, and podcasts.

Audio Plan-based

Zamzar AI

A practical file conversion tool for quickly preparing documents, images, audio, and video for further workflows, with clear limits around sensitive data, quality, and governance.

AI Plan-based

Zencastr

Zencastr is a audio and music tool for remote podcast recording, audio/video capture, and production workflow for conversations.

AI Subscription