This is a subreddit preview page. If you have a GummySearch account, please add this Subreddit to your audience to view the full analysis features there.
/r/speechtech/

r/speechtech

2k members
r/speechtech is a subreddit with 2k members. Its distinguishing qualities are that the community is medium in size.
Community about the news of speech technology - new software, algorithms, papers and datasets. Speech, recognition, speech synthesis, text-to-speech voice biometrics, speaker identification and audio analysis.

Popular Themes in r/speechtech

#1
Solution Requests
: "PortaSpeech: Portable and High-Quality Generative Text-to-Speech"
9 posts
#2
Advice Requests
: "Nice course on speech recognition/synthesis"
7 posts
#3
Self-Promotion
: "Deepgram's Nova: Next-Gen Speech-to-Text & Whisper API with built-in diarization and word-level timestamps"
6 posts
#4
News
: "New AI model outperforms OpenAI, Deepgram, and ElevenLabs on Japanese ASR benchmarks"
3 posts
#5
Pain & Anger
: "Does anyone else find lhotse a pain to use"
2 posts
#6
Money Talk
: "Beta testers needed: Salad Transcription API (from $0.10/hour)"
2 posts
#7
Opportunities
: "I built a job aggregator monitoring Speech AI companies"
1 post

Popular Topics in r/speechtech

#1

Speech Recognition

: "Nice course on Speech Recognition/synthesis"
39 posts
#2

Asr

: "New AI model outperforms OpenAI, Deepgram, and ElevenLabs on Japanese Asr benchmarks"
19 posts
#3

Speech

: "Introducing Ursa from Speechmatics | Claimed to be 25% more accurate than Whisper"
19 posts
#4

Tts

: "[2409.10058] StyleTts-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion"
18 posts
#5

Text To Speech

: "Show HN: Neural Text To Speech with dozens of celebrity voices"
15 posts
#6

Voice

: "FlowTSE -- a new method for extracting a target speaker’s Voice from noisy, multi-speaker recordings"
13 posts
#7

Model

: "New AI Model outperforms OpenAI, Deepgram, and ElevenLabs on Japanese ASR benchmarks"
13 posts
#8

Speech To Text

: "I benchmarked 12+ speech-to-text APIs under various real-world conditions"
12 posts
#9

Models

: "WavLM, UniSpeech-SAT and UniSpeech Transformer Models from Microsoft"
10 posts
#10

Dataset

: "A Large, modern and evolving Dataset for automatic speech recognition (10k hours)"
9 posts

Member Growth in r/speechtech

Yearly
+719 members(53.0%)

Similar Subreddits to r/speechtech

/r/artificial

r/artificial

1.1M members
32.4% / yr
/r/ArtificialInteligence

r/ArtificialInteligence

1.5M members
169.8% / yr
/r/Bard

r/Bard

82k members
110.5% / yr
/r/ChatGPT

r/ChatGPT

10.5M members
83.8% / yr
/r/ChatGPTPro

r/ChatGPTPro

423k members
77.7% / yr
/r/ElevenLabs

r/ElevenLabs

14k members
100.9% / yr

r/LanguageTechnology

56k members
16.7% / yr
/r/LocalLLaMA

r/LocalLLaMA

479k members
181.8% / yr
/r/ollama

r/ollama

70k members
1082.6% / yr

r/TextToSpeech

2k members
192.9% / yr