Join Wispr Flow as an ML Engineer to build the next-generation voice interface. You will prototype features, optimize LLM inference for sub-500ms latency, and scale personalization of speech models and LLMs using fine-tuning and RL. Work with a team of AI researchers and engineers in a fast-growing startup.
Responsibilities
Prototype and design new features of the voice interface
Build infrastructure for <500ms LLM inference at scale
Scale personalization of speech models and LLMs with fine-tuning and RL
Requirements
Previous founding or startup experience
Experience optimizing ML inference or engineering systems for research teams
Fluency in Python and LLM development
Attention to detail
Aptitude and clarity of thought
Creativity, excellence in engineering, and code velocity