r/speechtech is a subreddit with 5k members. The most common kinds of discussions are solution requests and self-promotion, and the community frequently discusses speech recognition, asr, tts, speech, and voice.
Community about the news of speech technology - new software, algorithms, papers and datasets. Speech, recognition, speech synthesis, text-to-speech voice biometrics, speaker identification and audio analysis.
Popular Themes in r/speechtech
#1
Solution Requests
: "PortaSpeech: Portable and High-Quality Generative Text-to-Speech"
12 posts
#2
Self-Promotion
: "Deepgram's Nova: Next-Gen Speech-to-Text & Whisper API with built-in diarization and word-level timestamps"
6 posts
#3
Advice Requests
: "Introducing Whisper"
4 posts
#4
News
: "New AI model outperforms OpenAI, Deepgram, and ElevenLabs on Japanese ASR benchmarks"
3 posts
#5
Money Talk
: "Beta testers needed: Salad Transcription API (from $0.10/hour)"
2 posts
#6
Pain & Anger
: "Does anyone else find lhotse a pain to use"
1 post
#7
Opportunities
: "I built a job aggregator monitoring Speech AI companies"
1 post
Popular Topics in r/speechtech
#1
Speech Recognition
: "MixSpeech: Data Augmentation for Low-resource Automatic Speech Recognition"
61 posts
#2
Asr
: "New AI model outperforms OpenAI, Deepgram, and ElevenLabs on Japanese Asr benchmarks"
32 posts
#3
Tts
: "When do you think Tts costs will become reasonably priced?"
28 posts
#4
Speech
: "Introducing Ursa from Speechmatics | Claimed to be 25% more accurate than Whisper"
24 posts
#5
Voice
: "I’m an ex-Googler — we built an AI Voice agent that answers calls, books leads, and fixes a huge gap in service businesses"
22 posts
#6
Text To Speech
: "Vakyansh TTS (Text To Speech) for Indic Languages"
19 posts
#7
Speech To Text
: "I benchmarked 12+ speech-to-text APIs under various real-world conditions"
17 posts
#8
Ai
: "I’m an ex-Googler — we built an Ai voice agent that answers calls, books leads, and fixes a huge gap in service businesses"
16 posts
#9
Language
: "Practicing a new Language without feeling awkward? This helped me big time"
16 posts
#10
Model
: "New AI Model outperforms OpenAI, Deepgram, and ElevenLabs on Japanese ASR benchmarks"
14 posts
Flair Used in r/speechtech
#1
Technology
: "[Open Source] omnivoice-triton: ~3.4x Inference Speedup for OmniVoice (NAR TTS) via Triton Kernel Fusion & CUDA Graphs"
9 posts
#2
Promotion
: "Standard Speech-to-Text vs. Real-Time "Speech Understanding" (Emotion, Intent, Entities, Voice Bio-metrics)"
2 posts
Member Growth in r/speechtech
Yearly
+3k members(146.6%)
Similar Subreddits to r/speechtech
r/AssistiveTechnology
6k members
73.5% / yr
r/ChatGPT
11.5M members
8.8% / yr
r/LanguageTechnology
64k members
14.3% / yr
r/machinelearningnews
140k members
42.1% / yr
About
GummySearch helps people research Reddit communities by organizing activity, growth, themes, and post-level signals into one place.
This page gives a focused view of r/speechtech, including current member size, discussion patterns, product reviews, and related communities to explore.
This data is synced periodically so insights stay current and useful for ongoing research.
Last updated: June 5, 2026