Senior Software Engineer - Computer Vision/Machine Learning for Human Understanding
We're starting to see the incredible potential of multimodal foundation and large language models, and many applications in the computer vision and machine learning domain that previously appeared infeasible are now within reach. We are looking for a highly motivated and skilled Senior Software Engineer to join our team in the Video Computer Vision group and help us enable that potential for realtime human understanding on Apple devices. The Video Computer Vision org has pioneered human-centric real-time features such as FaceID, FaceKit, and Gaze and Hand gesture control which have changed the way millions of users interact with their devices. We balance research and product requirements to deliver Apple quality, pioneering experiences, innovating through the full stack, and partnering with HW, SW and AI teams to shape Apple's products and bring our vision to life.
Description
You'll work on ground breaking projects to advance our AI and computer vision capabilities for human understanding. You have a strong background in implementing and optimizing CV/ML algorithms and efficiently running foundation and language models on device. You'll have the opportunity to collaborate with multi-functional teams, including researchers, data scientists, software engineers, human interface designers and application domain experts. You'll stay up-to-date on the latest advancements in AI, machine learning, and computer vision and apply this knowledge to drive innovation within the company.
Responsibilities
- Designing robust, extensible architectures and APIs for engineering partners.
- Developing applications and tools for algorithm evaluation, prototyping, and technology demonstrations.
- Supporting teams across Apple by integrating ML and CV components into production systems.
- Advocating for engineering excellence, code quality, thorough test suites, and long-term maintainability.
Minimum Qualifications
- Experience with Robust API Design: Proven ability to design developer-facing APIs with a clear understanding of architectural tradeoffs, design patterns, and anti-patterns. Strong intuition for maintainability and extensibility. Exceptional debugging and performance optimization skills.
- Track record of multi-functional collaboration and product delivery: Demonstrated success delivering high-performance, production-quality code in collaborative, multi-disciplinary environments.
- Experience with iOS/macOS development: Familiarity with Swift, SwiftUI, modern concurrency (e.g., structured concurrency with async/await), and Apple system frameworks such as Cocoa/Cocoa Touch, Core ML, Metal, and Accelerate.
- Foundational understanding of machine learning: Solid grasp of ML algorithms and development pipelines, with the ability to work effectively with ML practitioners and integrate ML components into production systems.
Preferred Qualifications
- Experience building internal developer tools: Hands-on experience developing tools such as test data visualization systems, debugging enhancements, and robust unit/integration testing frameworks to support engineering workflows.
- Experience with live camera streaming applications: Understanding of real-time video pipelines, image transformations, and rendering loops.
- Experience integrating on-device CV/ML algorithms: Familiarity with common computer vision techniques (e.g., object detection, segmentation, tracking, pose estimation), sequence models for real-time inference and FMs/LLMs optimized for on-device performance.
- Experience with scene understanding and AR frameworks: Exposure to ARKit, RealityKit, or other 3D perception pipelines (e.g., LiDAR-based depth fusion, spatial tracking).
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics.