This is a subreddit preview page. If you have a GummySearch account, please add this Subreddit to your audience to view the full analysis features there.
r/speechtech is a subreddit with 2k members. Its distinguishing qualities are that the community is medium in size.
Community about the news of speech technology - new software, algorithms, papers and datasets. Speech, recognition, speech synthesis, text-to-speech voice biometrics, speaker identification and audio analysis.
Popular Themes in r/speechtech
#1
Solution Requests
: "PortaSpeech: Portable and High-Quality Generative Text-to-Speech"
9 posts
#2
Advice Requests
: "Nice course on speech recognition/synthesis"
7 posts
#3
Self-Promotion
: "Deepgram's Nova: Next-Gen Speech-to-Text & Whisper API with built-in diarization and word-level timestamps"
6 posts
#4
News
: "New AI model outperforms OpenAI, Deepgram, and ElevenLabs on Japanese ASR benchmarks"
3 posts
#5
Pain & Anger
: "Does anyone else find lhotse a pain to use"
2 posts
#6
Money Talk
: "Beta testers needed: Salad Transcription API (from $0.10/hour)"
2 posts
#7
Opportunities
: "I built a job aggregator monitoring Speech AI companies"
1 post
Popular Topics in r/speechtech
#1
Speech Recognition
: "Nice course on Speech Recognition/synthesis"
39 posts
#2
Asr
: "New AI model outperforms OpenAI, Deepgram, and ElevenLabs on Japanese Asr benchmarks"
19 posts
#3
Speech
: "Introducing Ursa from Speechmatics | Claimed to be 25% more accurate than Whisper"
19 posts
#4
Tts
: "[2409.10058] StyleTts-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion"
18 posts
#5
Text To Speech
: "Show HN: Neural Text To Speech with dozens of celebrity voices"
15 posts
#6
Voice
: "FlowTSE -- a new method for extracting a target speaker’s Voice from noisy, multi-speaker recordings"
13 posts
#7
Model
: "New AI Model outperforms OpenAI, Deepgram, and ElevenLabs on Japanese ASR benchmarks"
13 posts
#8
Speech To Text
: "I benchmarked 12+ speech-to-text APIs under various real-world conditions"
12 posts
#9
Models
: "WavLM, UniSpeech-SAT and UniSpeech Transformer Models from Microsoft"
10 posts
#10
Dataset
: "A Large, modern and evolving Dataset for automatic speech recognition (10k hours)"
9 posts
Member Growth in r/speechtech
Yearly
+719 members(53.0%)
Similar Subreddits to r/speechtech

r/artificial
1.1M members
32.4% / yr

r/ArtificialInteligence
1.5M members
169.8% / yr

r/Bard
82k members
110.5% / yr

r/ChatGPT
10.5M members
83.8% / yr

r/ChatGPTPro
423k members
77.7% / yr

r/ElevenLabs
14k members
100.9% / yr
r/LanguageTechnology
56k members
16.7% / yr

r/LocalLLaMA
479k members
181.8% / yr

r/ollama
70k members
1082.6% / yr
r/TextToSpeech
2k members
192.9% / yr