View All Jobs 112179

Backend Engineer - Inference Services - Remote Eligible

Design and implement scalable backend inference services for Deepgram’s engine team
Remote
Junior
$150,000 – 220,000 USD / year
22 hours agoBe an early applicant
Deepgram

Deepgram

Provides AI-powered speech recognition and audio intelligence APIs for real-time transcription, understanding, and analysis at scale.

Deepgram Backend Software Engineer

Deepgram is the leading platform underpinning the emerging trillion-dollar Voice AI economy, providing real-time APIs for speech-to-text (STT), text-to-speech (TTS), and building production-grade voice agents at scale. More than 200,000 developers and 1,300+ organizations build voice offerings that are 'Powered by Deepgram', including Twilio, Cloudflare, Sierra, Decagon, Vapi, Daily, Cresta, Granola, and Jack in the Box. Deepgram's voice-native foundation models are accessed through cloud APIs or as self-hosted and on-premises software, with unmatched accuracy, low latency, and cost efficiency. Backed by a recent Series C led by leading global investors and strategic partners, Deepgram has processed over 50,000 years of audio and transcribed more than 1 trillion words. There is no organization in the world that understands voice better than Deepgram.

Deepgram is looking for a Backend Software Engineer to join the Engine team to lead the design and implementation of Deepgram's products. You will design and implement secure, robust, and scalable services for speech processing; efficient, distributed compute orchestration; optimized scheduling, and more. Your skill at building highly reusable code that overcomes technical challenges is paired with an intuition for delightful user experiences. You will be a critical voice in Deepgram's Product and Engineering teams, driving high impact products from start to finish.

What You'll Do

  • Improve Deepgram's core inference services including areas in networking, speech processing, audio transcoding, and latency and memory optimization
  • Develop processes for measuring, building, and optimizing services to maximize system performance
  • Debug complex system issues that include networking, scheduling, and high performance computing interactions
  • Rapidly customize backend services to support our customer needs
  • Partner with Product to design and implement new services, features, and/or products end to end

You'll Love This Role If You

  • Thrive in a fast-paced, impact-driven environment where learning new skills on-the-fly is not only encouraged but a regular necessity
  • Enjoy balancing decisions about product and feature maturity to decide when to make minimally invasive changes versus when to incorporate detailed design work

It's Important To Us That You Have

  • 3+ years of experience in an industry role
  • Programming experience in Rust (or C, C++), with competence in Python
  • Excellent communication and organizational skills, both written and verbal.
  • A high level of experience and understanding of version control; preferably git.
  • Comprehensive experience with UNIX-style systems.

It Would Be Great if You Had

  • Experience with modern machine learning, such as experience with a framework like Torch or implementation knowledge of architectures like CNNs, RNNS, and transformers
  • Experience with audio processing

Benefits & Perks*

Holistic Health

  • Medical, dental, vision benefits
  • Annual wellness stipend
  • Mental health support
  • Life, STD, LTD Income Insurance Plans

Work/Life Blend

  • Unlimited PTO
  • Generous paid parental leave
  • Flexible schedule
  • 12 Paid US company holidays
  • Quarterly personal productivity stipend
  • One-time stipend for home office upgrades
  • 401(k) plan with company match
  • Tax Savings Programs

Continuous Learning

  • Learning / Education stipend
  • Participation in talks and conferences
  • Employee Resource Groups
  • AI enablement workshops / sessions

* For candidates outside of the US, we use an Employer of Record model in many countries, which means benefits are administered locally and governed by country-specific regulations. Because of this, benefits will differ by region — in some cases international employees receive benefits US employees do not, and vice versa. As we scale, we will continue to evaluate where we can create more alignment, but a 1:1 global benefits structure is not always legally or operationally possible.

Backed by prominent investors including Y Combinator, Madrona, Tiger Global, Wing VC and NVIDIA, Deepgram has raised over $215M in total funding. If you're looking to work on cutting-edge technology and make a significant impact in the AI industry, we'd love to hear from you!

Deepgram is an equal opportunity employer. We want all voices and perspectives represented in our workforce. We are a curious bunch focused on collaboration and doing the right thing. We put our customers first, grow together and move quickly. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, gender identity or expression, age, marital status, veteran status, disability status, pregnancy, parental status, genetic information, political affiliation, or any other status protected by the laws or regulations in the locations where we operate.

We are happy to provide accommodations for applicants who need them.

+ Show Original Job Post
























Backend Engineer - Inference Services - Remote Eligible
Remote
$150,000 – 220,000 USD / year
Engineering
About Deepgram
Provides AI-powered speech recognition and audio intelligence APIs for real-time transcription, understanding, and analysis at scale.