Projects tagged with "Audio"
60 projects found
Adtwin AI
AI audio ads made easy for marketers, brands, and agencies. Create fast, collaborate across teams, target any customers, distribute everywhere, and track with pixel analytics. Free to create, Pay when you publish.
AI Jingle Maker
Generate an AI voiceover (or record your voice / upload your own MP3 file), then pick an intro, a background, an outro, and let the AI Jingle Maker automagically generate your radio jingle, podcast intro or audio ad in the blink of an eye.
Aloude
Aloude lets you record quick audio takes or interviews, then instantly turns them into branded videos ready to share on X, LinkedIn, or embed on your site. Perfect for experts, creators, and businesses to grow authority through voice in just 2 clicks.
A powerful desktop application for real-time Arabic audio transcription and translation that works completely offline.
Audino AI
Simplify video content creation with AI-powered audio generation. Our platform analyzes your videos to create perfectly synced sound effects and dynamic background music that adapts to every scene. Create content with ai audio that elevates your storytelling.
Audixa AI
Audixa AI is a high-fidelity text-to-speech API for developers & creators who want ElevenLabs-level quality without the cost. Great voices are expensive, affordable ones sound robotic-so we built Audixa. Fast, realistic voices. Reliable API. Simple docs. Up to 10x cheaper, with a real free tier. Perfect for AI apps, agents, voice tools & content automation.
BeeBot for AirPods
It’s like having that friend who knows everything that’s happening, except it whispers directly into your ears as you walk around. BeeBot gives you a few short updates a day about people, places, and things nearby, emceed by your host, DJ BeeBot. BeeBot turns on automatically when you put your headphones in and stays quiet when you take them off. You can check in with DJ BeeBot to share what you’re doing or what’s happening nearby — BeeBot shares your updates with friends and other users nearby.
Butter Reader
Add AI-powered audio players to any website in minutes. Your readers can listen to your content while multitasking, making your blog more accessible and engaging than ever before.
CastBandit
CastBandit turns your podcast into an AI chatbot trained on your episodes. Listeners can ask questions, get episode recommendations, and explore your full back catalog. Embed it on your site or share via a public link.
Chatquick AI
Create audiobooks, podcasts, and interactive audio from your own content all in one platform where you can speak, join the conversation, and add your voice in real time.
CreateWise AI
AI-powered Podcast Editor, make your podcast production 10x faster! Automatically removes filler words and silences, generates show notes and highlights, and creates social-ready clips — all in one click. From one audio to 100+ content assets. Try for Free!
Professional voice typing and dictation tool supporting 99+ languages. Free speech to text conversion with AI-powered voice recognition. Start dictating now!
earwink™
This will never take off! All my life, I've tried to create and launch ideas that are different, edgy, creative, innovative or fresh in some way. But I've noticed that often, these ideas don't get picked up by a mainstream audience. Will this boring one stick?
A major evolution of our platform, empowering the creation of the most advanced and trustworthy voice agents. Just five months since our last release, this update brings powerful improvements and full enterprise readiness—ushering in a new era.
file renamer ai
Rename videos, photos, audio, pdfs with AI. Smart, automatic organization. Batch rename thousands of files in under 2-3 minutes for extremely cheap. (1000 renames for $10)
Fish Audio S1
Fish Audio S1 is the most expressive and emotionally rich TTS model—creating lifelike voices that capture emotion, rhythm, and nuance. Clone any voice in 10 seconds, preserving accent, tone, and speaking habits with unmatched realism.
Simulate by Future AGI
The 1st AI-powered testing infra for voice AI: evaluate across thousands of real-world scenarios in minutes using simulated agents that stress-test edge cases, detect multilingual issues, and uncover failures missed by humans. Ship reliable voice AI at scale!
Google AI Studio
Google AI Studio now lets you vibe code with your voice. Hit the mic, describe what you need, and watch it build. The AI strips out filler words and false starts, turning your natural speech into clean prompts. No typing, no friction—just talk and build.
Handy
Handy is a cross platform, open-source, speech-to-text application for your computer.
Hathora
Build voice agents on open source or closed models with zero DevOps. Start instantly on shared endpoints and upgrade to dedicated infrastructure for privacy, compliance, or VPC requirements. Models run in 14 regions for ultra low latency. Bring your own models or custom containers as you scale.
Octave 2 by Hume AI
Introducing Octave 2. What’s new: - Fluent in 11+ languages - 40% faster (<200ms latency) & 50% cheaper than Octave 1 - Multi-speaker conversation - More reliable pronunciation - New voice conversion & phoneme editing capabilities
Informed
Tired of generic news readers? We are too. Get your daily news on topics you love, read by a voice you choose. Clone your own voice, a friend's, or any other. Your news, your topics, your voice.
Katalog
Save articles for later and listen to them with high-quality AI narration. Use your voice while listening to ask questions, take notes, or highlight important points.
Layercode CLI
Build a voice AI agent in minutes, straight from your terminal. One command scaffolds your project with built-in tunneling, sample backends, and global edge deployment. Connect via webhook, use existing agent logic, pay only for speech (silence is free).
Learn By Podcas
Learn By Podcas turns everyday moments into opportunities to learn, discover, and share through audio. Turn commutes, workouts, or walks into hands-free learning with quick, engaging sessions. Explore diverse perspectives on news, get the essence of long reads in short audio bites, and share insights with friends. Browse our growing library of topics and join our early access community — free to use and evolving fast!
LyRuno
LyRuno Free & Powerful AI Audio Separator. Extract vocals, instruments, and accompaniment from songs, or isolate dialogue, music, and SFX from videos—powered by world-leading AI for studio-quality audio separation.
Melodic Mind
Melodic Mind is an all-in-one music superapp built to help you create, learn, and grow as a musician — no matter your level. It has 20+ different apps that solve every need you have and help you on your musical journey, divided across 4 categories - Studio, Instruments, Music Theory & Toolkit.
Microsoft AI (MAI) Voice-1
MAI-Voice-1 is a lightning-fast speech generation model, with an ability to generate a full minute of audio in under a second on a single GPU, making it one of the most efficient speech systems available today.
Voxtral
Voxtral by Mistral AI is a new family of open-source speech understanding models. Available in 24B and 3B sizes, it goes beyond transcription to offer Q&A, summarization, and function calling directly from voice with SOTA performance.
Monologue
Voice dictation that speaks your language. Stay in flow. Speak naturally. Monologue understands your context, learns your vocabulary, and formats automatically—so you can write what you meant to say.
MyClone
An AI platform built for knowledge professionals to help scale their services. Check out the live demo in https://www.myclone.is/ The clone acts as an extension of you, continuously learning from your YouTube, podcasts, documents, videos, and audio. It speaks in your voice, language, and gets integrated into your current workflow (website, Slack, etc.) Oh, you can complete white label it. Think of MyClone as "Shopify for knowledge professionals".
Parrot TTS
Parrot TTS transforms any web text into remarkably human-like speech. Our advanced AI technology delivers natural-sounding voices that make listening a pleasure, eliminating the robotic experience of traditional text-to-speech solutions.
Pavis
Real-time analysis of your calls. Detect manipulation, fact-check, and come up with unique questions on the spot. Pavis transcribes conversations and instantly detects manipulation tactics, fact-checks claims, and suggests critical questions you'd miss in the moment. Stop walking into bad deals—whether it's investor pitches, sales negotiations, or contractor quotes. See pressure tactics as they happen. Verify statistics before you respond. Ask the questions that change outcomes.
Podmod AI
Podmod AI listens as you record, surfacing images, articles, videos, and AI answers directly on screen. Never miss a moment and never pause to Google again. Now you can be an expert on every topic instantly. Keep the flow, boost accuracy, and engage listeners.
Creating content is fun. But editing? Let’s be honest, not always as fun. With chat-based editing, you have a personal AI agent that helps you edit your videos just by chatting, so you can focus on the fun part.
Roark
Build voice agents you can trust. Roark tracks call metrics, runs evaluations, and stress-tests your agent with simulated callers across accents, languages, and speaking styles. Failed calls become tests - giving you visibility and continuous improvement.
Roark
Build voice agents you can trust. Roark tracks call metrics, runs evaluations, and stress-tests your agent with simulated callers across accents, languages, and speaking styles. Failed calls become tests - giving you visibility and continuous improvement.
Shushu AI
Shushu AI is AI-powered platform that automatically removes background noise and filler words from your audio and video. It also creates short videos by replacing parts of your content with b-roll clips that match the context, no editing skills needed.
SigmaMind AI
SigmaMind AI (YC-backed) is a conversational AI platform to build voice and chat AI agents. Build with our no-code agent builder or plug in APIs. Prebuilt integrations + support for custom tools = fast, flexible deployment across industries.
Singify AI Vocal Remover
Singify AI Vocal Remover uses advanced 10-stem separation to isolate vocals, drums, bass, piano, guitar, and more. Fast, free, and easy to use, it delivers high-quality results with minimal artifacts—perfect for creators, remixers, and music lovers.
Slowed Enchanced
Adjust video or audio playback speed, volume, and reverb with pitch preservation without paying a dime. ( Version 1.0 )
SnapLinear
Web app that automatically extracts actions items from meeting recordings/transcripts and converts them into tasks in Linear using AI.
Snipn
Snipn brings you daily AI-generated 2-minute audio news capsules on WhatsApp — in your language and region. Clear, verified, and human-like updates that make mornings smarter, faster, and scroll-free.
Spit Notes
For songwriters who juggle voice memos and notes. Spit Notes is the iOS app that finally connects audio to your lyrics. Capture inspiration instantly & never lose a song idea again.
Stable Audio 2.5
Stable Audio 2.5 is a new audio model from Stability AI built for enterprise sound production. It delivers fast, high-quality, structured tracks in seconds, with advanced control features like audio inpainting for professional workflows.
Stream Ring by Sandbar
Stream is a conversational self extension. It's designed for talking through ideas and capturing notes, with an Inner Voice personalized to you. Stream Ring is a new device for fast, private voice interactions. Hold to speak, whisper in a crowd, and control music effortlessly. No interruptions, pulling out your phone, or talking loudly in public. Now available for preorder, in limited supply
SubtitlesFast
Generate Captivating Subtitles in Minutes SubtitlesFast automatically adds accurate, readable subtitles to your videos - no software, no editing. Just upload your clip and get polished results in minutes.
SuperU AI
SuperU AI is a Voice AI platform built for businesses to scale. With 100+ languages and ready templates, it runs inbound, outbound, and website calls with ease. Already trusted for 1M+ calls, built to handle 100M+ every month.
Talking Translator
Supports 74 languages with real-time bilingual conversation translation. Put on your earbuds: hear Spanish in the left ear, English in the right. You can ask LLM-powered questions about the conversation, text + photo translation and even offline mode.
Copilot Audio Expressions
Copilot Audio Expressions is a free tool that turns text into expressive audio. Use Emotive Mode to direct your own scripts with custom tone and pace, or Story Mode to have Copilot create a full story with narration. All audio is downloadable as MP3.
TranscriptorPro
Transform your audio and video into searchable text with TranscriptorPro an AI-powered transcription tool that’s fast, accurate, and affordable. Go beyond transcription with built-in chat, summary, and translation features, and export your results in multiple formats effortlessly.
Typeless
Speak naturally, and Typeless will turn your words into polished messages, emails, and documents that read like you carefully typed them. Our AI understands context, fixes grammar, and adapts to your style - so you can focus on what you want to say, not how to say it.
Videoform AI
Turn Q&A into interactive video calls! Create forms with video-based questions, record audio answers, and get instant transcripts via ElevenLabs. Smooth UI, beautiful animations, and seamless Supabase integration for creators & teams.
VNYL
Modern podcast hosting with truly unlimited storage and downloads at a flat rate, no caps, no overage fees, no surprises. While competitors charge per download tier (forcing you to delete episodes or upgrade), we leverage modern cloud infrastructure to make unlimited genuinely affordable. Built-in team collaboration, IAB-compliant analytics, publish scheduling, and dedicated podcast website, all included.
Vogent Voicelab
Vogent Voicelab is a platform for optimized inference of top open-source voice models, like Sesame's CSM-1B, Dia, Chatterbox, and more. Voicelab optimizes and post-trains these models to generate consistently high-quality speech ultra-fast.
VoiceDesi
Generate custom, unique voices from text prompts. Perfect for content creators, marketers, and businesses seeking lifelike, personalized voices in seconds.
vol
vol has been rebuilt. you can now: - tell vol what you want - let it search and select the best podcast episodes for you - listen in your own podcast player
Whispering
Whispering is an open-source, local-first transcription app. Use local and cloud models, chain custom transforms, and most importantly, keep your audio local on-device. Fast, ergonomic, and MIT-licensed. Let’s make closed-source apps obsolete. 🚀
CRAISEE
Generate images, videos, text, and audio with 5,000+ AI models. No more switching tools. No more multiple subscriptions. Everything in one beautiful interface with auto-pick for you task.
Ztalk.ai
Break language barriers in video calls with AI-powered real-time voice translation. Ztalk is a desktop app that works with all video conferencing tools including Gmeet, Zoom and Teams.