OpenUpvote

Projects tagged with "Audio"

60 projects found

Adtwin AI icon

Adtwin AI

120d ago

AI audio ads made easy for marketers, brands, and agencies. Create fast, collaborate across teams, target any customers, distribute everywhere, and track with pixel analytics. Free to create, Pay when you publish.

Generate an AI voiceover (or record your voice / upload your own MP3 file), then pick an intro, a background, an outro, and let the AI Jingle Maker automagically generate your radio jingle, podcast intro or audio ad in the blink of an eye.

Aloude icon

Aloude

77d ago

Aloude lets you record quick audio takes or interviews, then instantly turns them into branded videos ready to share on X, LinkedIn, or embed on your site. Perfect for experts, creators, and businesses to grow authority through voice in just 2 clicks.

A powerful desktop application for real-time Arabic audio transcription and translation that works completely offline.

Audino AI icon

Audino AI

179d ago

Simplify video content creation with AI-powered audio generation. Our platform analyzes your videos to create perfectly synced sound effects and dynamic background music that adapts to every scene. Create content with ai audio that elevates your storytelling.

Audixa AI icon

Audixa AI

19d ago

Audixa AI is a high-fidelity text-to-speech API for developers & creators who want ElevenLabs-level quality without the cost. Great voices are expensive, affordable ones sound robotic-so we built Audixa. Fast, realistic voices. Reliable API. Simple docs. Up to 10x cheaper, with a real free tier. Perfect for AI apps, agents, voice tools & content automation.

It’s like having that friend who knows everything that’s happening, except it whispers directly into your ears as you walk around. BeeBot gives you a few short updates a day about people, places, and things nearby, emceed by your host, DJ BeeBot. BeeBot turns on automatically when you put your headphones in and stays quiet when you take them off. You can check in with DJ BeeBot to share what you’re doing or what’s happening nearby — BeeBot shares your updates with friends and other users nearby.

Add AI-powered audio players to any website in minutes. Your readers can listen to your content while multitasking, making your blog more accessible and engaging than ever before.

CastBandit icon

CastBandit

73d ago

CastBandit turns your podcast into an AI chatbot trained on your episodes. Listeners can ask questions, get episode recommendations, and explore your full back catalog. Embed it on your site or share via a public link.

Create audiobooks, podcasts, and interactive audio from your own content all in one platform where you can speak, join the conversation, and add your voice in real time.

AI-powered Podcast Editor, make your podcast production 10x faster! Automatically removes filler words and silences, generates show notes and highlights, and creates social-ready clips — all in one click. From one audio to 100+ content assets. Try for Free!

Professional voice typing and dictation tool supporting 99+ languages. Free speech to text conversion with AI-powered voice recognition. Start dictating now!

earwink™ icon

earwink™

129d ago

This will never take off! All my life, I've tried to create and launch ideas that are different, edgy, creative, innovative or fresh in some way. But I've noticed that often, these ideas don't get picked up by a mainstream audience. Will this boring one stick?

A major evolution of our platform, empowering the creation of the most advanced and trustworthy voice agents. Just five months since our last release, this update brings powerful improvements and full enterprise readiness—ushering in a new era.

Rename videos, photos, audio, pdfs with AI. Smart, automatic organization. Batch rename thousands of files in under 2-3 minutes for extremely cheap. (1000 renames for $10)

Fish Audio S1 is the most expressive and emotionally rich TTS model—creating lifelike voices that capture emotion, rhythm, and nuance. Clone any voice in 10 seconds, preserving accent, tone, and speaking habits with unmatched realism.

The 1st AI-powered testing infra for voice AI: evaluate across thousands of real-world scenarios in minutes using simulated agents that stress-test edge cases, detect multilingual issues, and uncover failures missed by humans. Ship reliable voice AI at scale!

Google AI Studio now lets you vibe code with your voice. Hit the mic, describe what you need, and watch it build. The AI strips out filler words and false starts, turning your natural speech into clean prompts. No typing, no friction—just talk and build.

Handy icon

Handy

113d ago

Handy is a cross platform, open-source, speech-to-text application for your computer.

Hathora icon

Hathora

15d ago

Build voice agents on open source or closed models with zero DevOps. Start instantly on shared endpoints and upgrade to dedicated infrastructure for privacy, compliance, or VPC requirements. Models run in 14 regions for ultra low latency. Bring your own models or custom containers as you scale.

Introducing Octave 2. What’s new: - Fluent in 11+ languages - 40% faster (<200ms latency⁠⁠) & 50% cheaper than Octave 1 - Multi-speaker conversation - More reliable pronunciation - New voice conversion & phoneme editing capabilities

Informed icon

Informed

95d ago

Tired of generic news readers? We are too. Get your daily news on topics you love, read by a voice you choose. Clone your own voice, a friend's, or any other. Your news, your topics, your voice.

Katalog icon

Katalog

83d ago

Save articles for later and listen to them with high-quality AI narration. Use your voice while listening to ask questions, take notes, or highlight important points.

Build a voice AI agent in minutes, straight from your terminal. One command scaffolds your project with built-in tunneling, sample backends, and global edge deployment. Connect via webhook, use existing agent logic, pay only for speech (silence is free).

Learn By Podcas turns everyday moments into opportunities to learn, discover, and share through audio. Turn commutes, workouts, or walks into hands-free learning with quick, engaging sessions. Explore diverse perspectives on news, get the essence of long reads in short audio bites, and share insights with friends. Browse our growing library of topics and join our early access community — free to use and evolving fast!

LyRuno icon

LyRuno

64d ago

LyRuno Free & Powerful AI Audio Separator. Extract vocals, instruments, and accompaniment from songs, or isolate dialogue, music, and SFX from videos—powered by world-leading AI for studio-quality audio separation.

Melodic Mind is an all-in-one music superapp built to help you create, learn, and grow as a musician — no matter your level. It has 20+ different apps that solve every need you have and help you on your musical journey, divided across 4 categories - Studio, Instruments, Music Theory & Toolkit.

MAI-Voice-1 is a lightning-fast speech generation model, with an ability to generate a full minute of audio in under a second on a single GPU, making it one of the most efficient speech systems available today.

Voxtral icon

Voxtral

134d ago

Voxtral by Mistral AI is a new family of open-source speech understanding models. Available in 24B and 3B sizes, it goes beyond transcription to offer Q&A, summarization, and function calling directly from voice with SOTA performance.

Monologue icon

Monologue

65d ago

Voice dictation that speaks your language. Stay in flow. Speak naturally. Monologue understands your context, learns your vocabulary, and formats automatically—so you can write what you meant to say.

MyClone icon

MyClone

20d ago

An AI platform built for knowledge professionals to help scale their services. Check out the live demo in https://www.myclone.is/ The clone acts as an extension of you, continuously learning from your YouTube, podcasts, documents, videos, and audio. It speaks in your voice, language, and gets integrated into your current workflow (website, Slack, etc.) Oh, you can complete white label it. Think of MyClone as "Shopify for knowledge professionals".

Parrot TTS icon

Parrot TTS

128d ago

Parrot TTS transforms any web text into remarkably human-like speech. Our advanced AI technology delivers natural-sounding voices that make listening a pleasure, eliminating the robotic experience of traditional text-to-speech solutions.

Pavis icon

Pavis

7d ago

Real-time analysis of your calls. Detect manipulation, fact-check, and come up with unique questions on the spot. Pavis transcribes conversations and instantly detects manipulation tactics, fact-checks claims, and suggests critical questions you'd miss in the moment. Stop walking into bad deals—whether it's investor pitches, sales negotiations, or contractor quotes. See pressure tactics as they happen. Verify statistics before you respond. Ask the questions that change outcomes.

Podmod AI icon

Podmod AI

32d ago

Podmod AI listens as you record, surfacing images, articles, videos, and AI answers directly on screen. Never miss a moment and never pause to Google again. Now you can be an expert on every topic instantly. Keep the flow, boost accuracy, and engage listeners.

Creating content is fun. But editing? Let’s be honest, not always as fun. With chat-based editing, you have a personal AI agent that helps you edit your videos just by chatting, so you can focus on the fun part.

Roark icon

Roark

92d ago

Build voice agents you can trust. Roark tracks call metrics, runs evaluations, and stress-tests your agent with simulated callers across accents, languages, and speaking styles. Failed calls become tests - giving you visibility and continuous improvement.

Roark icon

Roark

92d ago

Build voice agents you can trust. Roark tracks call metrics, runs evaluations, and stress-tests your agent with simulated callers across accents, languages, and speaking styles. Failed calls become tests - giving you visibility and continuous improvement.

Shushu AI icon

Shushu AI

131d ago

Shushu AI is AI-powered platform that automatically removes background noise and filler words from your audio and video. It also creates short videos by replacing parts of your content with b-roll clips that match the context, no editing skills needed.

SigmaMind AI (YC-backed) is a conversational AI platform to build voice and chat AI agents. Build with our no-code agent builder or plug in APIs. Prebuilt integrations + support for custom tools = fast, flexible deployment across industries.

Singify AI Vocal Remover uses advanced 10-stem separation to isolate vocals, drums, bass, piano, guitar, and more. Fast, free, and easy to use, it delivers high-quality results with minimal artifacts—perfect for creators, remixers, and music lovers.

Adjust video or audio playback speed, volume, and reverb with pitch preservation without paying a dime. ( Version 1.0 )

SnapLinear icon

SnapLinear

126d ago

Web app that automatically extracts actions items from meeting recordings/transcripts and converts them into tasks in Linear using AI.

Snipn icon

Snipn

40d ago

Snipn brings you daily AI-generated 2-minute audio news capsules on WhatsApp — in your language and region. Clear, verified, and human-like updates that make mornings smarter, faster, and scroll-free.

Spit Notes icon

Spit Notes

54d ago

For songwriters who juggle voice memos and notes. Spit Notes is the iOS app that finally connects audio to your lyrics. Capture inspiration instantly & never lose a song idea again.

Stable Audio 2.5 is a new audio model from Stability AI built for enterprise sound production. It delivers fast, high-quality, structured tracks in seconds, with advanced control features like audio inpainting for professional workflows.

Stream is a conversational self extension. It's designed for talking through ideas and capturing notes, with an Inner Voice personalized to you. Stream Ring is a new device for fast, private voice interactions. Hold to speak, whisper in a crowd, and control music effortlessly. No interruptions, pulling out your phone, or talking loudly in public. Now available for preorder, in limited supply

Generate Captivating Subtitles in Minutes SubtitlesFast automatically adds accurate, readable subtitles to your videos - no software, no editing. Just upload your clip and get polished results in minutes.

SuperU AI icon

SuperU AI

69d ago

SuperU AI is a Voice AI platform built for businesses to scale. With 100+ languages and ready templates, it runs inbound, outbound, and website calls with ease. Already trusted for 1M+ calls, built to handle 100M+ every month.

Supports 74 languages with real-time bilingual conversation translation. Put on your earbuds: hear Spanish in the left ear, English in the right. You can ask LLM-powered questions about the conversation, text + photo translation and even offline mode.

Copilot Audio Expressions is a free tool that turns text into expressive audio. Use Emotive Mode to direct your own scripts with custom tone and pace, or Story Mode to have Copilot create a full story with narration. All audio is downloadable as MP3.

Transform your audio and video into searchable text with TranscriptorPro an AI-powered transcription tool that’s fast, accurate, and affordable. Go beyond transcription with built-in chat, summary, and translation features, and export your results in multiple formats effortlessly.

Typeless icon

Typeless

9d ago

Speak naturally, and Typeless will turn your words into polished messages, emails, and documents that read like you carefully typed them. Our AI understands context, fixes grammar, and adapts to your style - so you can focus on what you want to say, not how to say it.

Videoform AI icon

Videoform AI

109d ago

Turn Q&A into interactive video calls! Create forms with video-based questions, record audio answers, and get instant transcripts via ElevenLabs. Smooth UI, beautiful animations, and seamless Supabase integration for creators & teams.

VNYL icon

VNYL

10d ago

Modern podcast hosting with truly unlimited storage and downloads at a flat rate, no caps, no overage fees, no surprises. While competitors charge per download tier (forcing you to delete episodes or upgrade), we leverage modern cloud infrastructure to make unlimited genuinely affordable. Built-in team collaboration, IAB-compliant analytics, publish scheduling, and dedicated podcast website, all included.

Vogent Voicelab is a platform for optimized inference of top open-source voice models, like Sesame's CSM-1B, Dia, Chatterbox, and more. Voicelab optimizes and post-trains these models to generate consistently high-quality speech ultra-fast.

VoiceDesi icon

VoiceDesi

95d ago

Generate custom, unique voices from text prompts. Perfect for content creators, marketers, and businesses seeking lifelike, personalized voices in seconds.

vol icon

vol

66d ago

vol has been rebuilt. you can now: - tell vol what you want - let it search and select the best podcast episodes for you - listen in your own podcast player

Whispering icon

Whispering

106d ago

Whispering is an open-source, local-first transcription app. Use local and cloud models, chain custom transforms, and most importantly, keep your audio local on-device. Fast, ergonomic, and MIT-licensed. Let’s make closed-source apps obsolete. 🚀

CRAISEE icon

CRAISEE

44d ago

Generate images, videos, text, and audio with 5,000+ AI models. No more switching tools. No more multiple subscriptions. Everything in one beautiful interface with auto-pick for you task.

Ztalk.ai icon

Ztalk.ai

213d ago

Break language barriers in video calls with AI-powered real-time voice translation. Ztalk is a desktop app that works with all video conferencing tools including Gmeet, Zoom and Teams.