Tomasz Tunguz Blog · 2023-09-08
· 1000d
Hybrid Cloud Architecture as the Future of AI-Powered Dictation Software
Tomasz Tunguz explores OpenAI's Whisper dictation model and its advantages over traditional speech recognition systems through contextual understanding of language. He analyzes the hardware constraints of running large language models locally and proposes that a hybrid architecture—processing some audio locally and some in the cloud—will emerge as the optimal deployment strategy for dictation software.
Metrics in this report
Dictation Speed Advantage
3multiplier
minimum
Speaking versus typing speed comparison
Hardware Performance Gap
3multiplier
consensus
Nvidia versus Apple Mac GPU for ML model inference
MacBook Pro RAM Configuration
64GB
specific
Author's test machine with M1 Max chip