Software Engineer, Infrastructure

Software Engineer, Infrastructure
Location pin icon
Singapore
The MRS ML Infra team will be focusing on ML Infra performance and efficiency for both large scale AI training and inference workflows in the recommendation domain. In this role, the engineer works on optimizing the e2e stack for model training and inference for large scale recommendation models. The opportunities are from distributed systems, to model/system co-design, to GPU system optimizations. We are looking for someone who has previous experiences on high performance infrastructure and performance optimization. We need the candidate to not only identify and lead the execution for short/mid term opportunities for perf/efficiency optimization, but also drive long term strategies on things like model/system co-design, performance automation, etc.
Software Engineer, Infrastructure Responsibilities
  • Hands on driving performance and efficiency optimizations by identifying and delivering the large optimizations across MRS models and systems.
  • Drive XFN collaborations and alignments with multiple partner or product ML teams.
  • Lead technical directions and roadmap for the SGP perf and efficiency team.
  • Providing mentorship and guidance to grow junior engineers on the team
Minimum Qualifications
  • BS/MS in Electrical Engineering, Computer Science or a related field or equivalent experience.
  • 7+ years of experience on AI Infra or System performance.
  • Hands on experiences on deep system performance optimization, for example, distributed systems, or high performance GPU/GPU systems, or memory/cache optimizations.
  • Strong written and verbal communication skills to align XFN and driving team execution
  • Previous experiences on mentoring and growing junior engineers as either a tech lead or a manager.
  • Strong debugging skills in complex systems that are across multiple components or sub-systems.
Preferred Qualifications
  • Hands on experiences on large scale AI infra system (for example, GPU training system)
  • Experiences on large models training and inference such as LLM or recommendation models.
  • Experiences in high performance computing including communication optimization, CUDA kernel optimization, distributed training and inference, etc.
Locations
About Meta
Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today—beyond the constraints of screens, the limits of distance, and even the rules of physics.
Meta is committed to providing reasonable support (called accommodations) in our recruiting processes for candidates with disabilities, long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support. If you need support, please reach out to accommodations-ext@fb.com.
Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. We may use your information to maintain the safety and security of Meta, its employees, and others as required or permitted by law. You may view Meta Pay Transparency Policy, Equal Employment Opportunity is the Law notice, and Notice to Applicants for Employment and Employees by clicking on their corresponding links. Additionally, Meta participates in the E-Verify program in certain locations, as required by law.

Meta is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, you may contact us at accommodations-ext@fb.com.