OpenUpvote

Projects tagged with "Audio"

41 projects found

Adtwin AI icon

Adtwin AI

73d ago

AI audio ads made easy for marketers, brands, and agencies. Create fast, collaborate across teams, target any customers, distribute everywhere, and track with pixel analytics. Free to create, Pay when you publish.

Generate an AI voiceover (or record your voice / upload your own MP3 file), then pick an intro, a background, an outro, and let the AI Jingle Maker automagically generate your radio jingle, podcast intro or audio ad in the blink of an eye.

Aloude icon

Aloude

30d ago

Aloude lets you record quick audio takes or interviews, then instantly turns them into branded videos ready to share on X, LinkedIn, or embed on your site. Perfect for experts, creators, and businesses to grow authority through voice in just 2 clicks.

A powerful desktop application for real-time Arabic audio transcription and translation that works completely offline.

Audino AI icon

Audino AI

132d ago

Simplify video content creation with AI-powered audio generation. Our platform analyzes your videos to create perfectly synced sound effects and dynamic background music that adapts to every scene. Create content with ai audio that elevates your storytelling.

Add AI-powered audio players to any website in minutes. Your readers can listen to your content while multitasking, making your blog more accessible and engaging than ever before.

CastBandit icon

CastBandit

26d ago

CastBandit turns your podcast into an AI chatbot trained on your episodes. Listeners can ask questions, get episode recommendations, and explore your full back catalog. Embed it on your site or share via a public link.

Create audiobooks, podcasts, and interactive audio from your own content all in one platform where you can speak, join the conversation, and add your voice in real time.

AI-powered Podcast Editor, make your podcast production 10x faster! Automatically removes filler words and silences, generates show notes and highlights, and creates social-ready clips — all in one click. From one audio to 100+ content assets. Try for Free!

Professional voice typing and dictation tool supporting 99+ languages. Free speech to text conversion with AI-powered voice recognition. Start dictating now!

earwink™ icon

earwink™

82d ago

This will never take off! All my life, I've tried to create and launch ideas that are different, edgy, creative, innovative or fresh in some way. But I've noticed that often, these ideas don't get picked up by a mainstream audience. Will this boring one stick?

A major evolution of our platform, empowering the creation of the most advanced and trustworthy voice agents. Just five months since our last release, this update brings powerful improvements and full enterprise readiness—ushering in a new era.

The 1st AI-powered testing infra for voice AI: evaluate across thousands of real-world scenarios in minutes using simulated agents that stress-test edge cases, detect multilingual issues, and uncover failures missed by humans. Ship reliable voice AI at scale!

Google AI Studio now lets you vibe code with your voice. Hit the mic, describe what you need, and watch it build. The AI strips out filler words and false starts, turning your natural speech into clean prompts. No typing, no friction—just talk and build.

Handy icon

Handy

66d ago

Handy is a cross platform, open-source, speech-to-text application for your computer.

Introducing Octave 2. What’s new: - Fluent in 11+ languages - 40% faster (<200ms latency⁠⁠) & 50% cheaper than Octave 1 - Multi-speaker conversation - More reliable pronunciation - New voice conversion & phoneme editing capabilities

Informed icon

Informed

48d ago

Tired of generic news readers? We are too. Get your daily news on topics you love, read by a voice you choose. Clone your own voice, a friend's, or any other. Your news, your topics, your voice.

Katalog icon

Katalog

36d ago

Save articles for later and listen to them with high-quality AI narration. Use your voice while listening to ask questions, take notes, or highlight important points.

Build a voice AI agent in minutes, straight from your terminal. One command scaffolds your project with built-in tunneling, sample backends, and global edge deployment. Connect via webhook, use existing agent logic, pay only for speech (silence is free).

LyRuno icon

LyRuno

17d ago

LyRuno Free & Powerful AI Audio Separator. Extract vocals, instruments, and accompaniment from songs, or isolate dialogue, music, and SFX from videos—powered by world-leading AI for studio-quality audio separation.

MAI-Voice-1 is a lightning-fast speech generation model, with an ability to generate a full minute of audio in under a second on a single GPU, making it one of the most efficient speech systems available today.

Voxtral icon

Voxtral

87d ago

Voxtral by Mistral AI is a new family of open-source speech understanding models. Available in 24B and 3B sizes, it goes beyond transcription to offer Q&A, summarization, and function calling directly from voice with SOTA performance.

Monologue icon

Monologue

18d ago

Voice dictation that speaks your language. Stay in flow. Speak naturally. Monologue understands your context, learns your vocabulary, and formats automatically—so you can write what you meant to say.

Parrot TTS icon

Parrot TTS

81d ago

Parrot TTS transforms any web text into remarkably human-like speech. Our advanced AI technology delivers natural-sounding voices that make listening a pleasure, eliminating the robotic experience of traditional text-to-speech solutions.

Roark icon

Roark

45d ago

Build voice agents you can trust. Roark tracks call metrics, runs evaluations, and stress-tests your agent with simulated callers across accents, languages, and speaking styles. Failed calls become tests - giving you visibility and continuous improvement.

Roark icon

Roark

45d ago

Build voice agents you can trust. Roark tracks call metrics, runs evaluations, and stress-tests your agent with simulated callers across accents, languages, and speaking styles. Failed calls become tests - giving you visibility and continuous improvement.

Shushu AI icon

Shushu AI

84d ago

Shushu AI is AI-powered platform that automatically removes background noise and filler words from your audio and video. It also creates short videos by replacing parts of your content with b-roll clips that match the context, no editing skills needed.

Singify AI Vocal Remover uses advanced 10-stem separation to isolate vocals, drums, bass, piano, guitar, and more. Fast, free, and easy to use, it delivers high-quality results with minimal artifacts—perfect for creators, remixers, and music lovers.

SnapLinear icon

SnapLinear

79d ago

Web app that automatically extracts actions items from meeting recordings/transcripts and converts them into tasks in Linear using AI.

For songwriters who juggle voice memos and notes. Spit Notes is the iOS app that finally connects audio to your lyrics. Capture inspiration instantly & never lose a song idea again.

Stable Audio 2.5 is a new audio model from Stability AI built for enterprise sound production. It delivers fast, high-quality, structured tracks in seconds, with advanced control features like audio inpainting for professional workflows.

Generate Captivating Subtitles in Minutes SubtitlesFast automatically adds accurate, readable subtitles to your videos - no software, no editing. Just upload your clip and get polished results in minutes.

SuperU AI icon

SuperU AI

22d ago

SuperU AI is a Voice AI platform built for businesses to scale. With 100+ languages and ready templates, it runs inbound, outbound, and website calls with ease. Already trusted for 1M+ calls, built to handle 100M+ every month.

Supports 74 languages with real-time bilingual conversation translation. Put on your earbuds: hear Spanish in the left ear, English in the right. You can ask LLM-powered questions about the conversation, text + photo translation and even offline mode.

Copilot Audio Expressions is a free tool that turns text into expressive audio. Use Emotive Mode to direct your own scripts with custom tone and pace, or Story Mode to have Copilot create a full story with narration. All audio is downloadable as MP3.

Turn Q&A into interactive video calls! Create forms with video-based questions, record audio answers, and get instant transcripts via ElevenLabs. Smooth UI, beautiful animations, and seamless Supabase integration for creators & teams.

Vogent Voicelab is a platform for optimized inference of top open-source voice models, like Sesame's CSM-1B, Dia, Chatterbox, and more. Voicelab optimizes and post-trains these models to generate consistently high-quality speech ultra-fast.

VoiceDesi icon

VoiceDesi

48d ago

Generate custom, unique voices from text prompts. Perfect for content creators, marketers, and businesses seeking lifelike, personalized voices in seconds.

vol icon

vol

19d ago

vol has been rebuilt. you can now: - tell vol what you want - let it search and select the best podcast episodes for you - listen in your own podcast player

Whispering icon

Whispering

59d ago

Whispering is an open-source, local-first transcription app. Use local and cloud models, chain custom transforms, and most importantly, keep your audio local on-device. Fast, ergonomic, and MIT-licensed. Let’s make closed-source apps obsolete. 🚀

Ztalk.ai icon

Ztalk.ai

166d ago

Break language barriers in video calls with AI-powered real-time voice translation. Ztalk is a desktop app that works with all video conferencing tools including Gmeet, Zoom and Teams.