Clera - Your AI talent agent
LoginStart
Start
Meta logo
Meta

Research Scientist Intern, Audio, Machine Learning and Computer Vision (PhD)

internship•Redmond•$7k - $12k

Summary

Location

Redmond

Salary

$7k - $12k

Type

internship

Experience

2-5 years

Company links

WebsiteLinkedInLinkedIn

About this role

The Meta Reality Labs Research Team brings together a world-class team of researchers, developers, and engineers to create the future of virtual and augmented reality, which together will become as universal and essential as smartphones and personal computers are today. And just as personal computers have done over the past 45 years, AR, VR and MR will ultimately change everything about how we work, play, and connect. We are developing all the technologies needed to enable breakthrough AR glasses and VR headsets, including optics and displays, computer vision, audio, graphics, brain-computer interfaces, haptic interaction, eye/hand/face/body tracking, perception science, and true telepresence. Some of those will advance much faster than others, but they all need to happen to enable AR, VR and MR that are so compelling that they become an integral part of our lives. In particular, the Meta Reality Labs Research audio team is focused on two goals; creating virtual sounds that are perceptually indistinguishable from reality, and redefining human hearing. See more about our work here: https://tech.fb.com/inside-facebook-reality-labs-research-the-future-of-audio/. These two initiatives will allow us to connect people by allowing them to feel together despite being physically apart, and allow them to converse in even the most difficult listening environments. Meta Reality Labs Research is looking for experienced interns who are passionate about ground breaking research in audio signal processing, machine learning and audio visual learning to solve important audio-driven problems for AR/VR applications. We currently have open positions for a range of projects in multimodal representation learning, audio visual scene analysis, egocentric audio visual learning, multi-sensory speech enhancement and acoustic activity localization. Our internships are twelve (12) to twenty four (24) weeks long and we have various start dates throughout the year.

Responsibilities

  • Research, model, design, develop and test novel audio and speech processing algorithms using machine learning, signal processing, and computer vision
  • Collaborate with researchers and engineers across diverse disciplines
  • Design and implementation of novel algorithms to solve audio research problems
  • Experimental design, implementation, and execution to evaluate new audio technologies
  • Collaboration with other researchers across audio and acoustic engineering disciplines
  • Communication of research agenda, progress, and results


Minimum Qualifications

  • Currently has, or is in the process of obtaining, a PhD degree in the field of Computer Science, Artificial Intelligence, Signal Processing, Machine learning, Computer vision, Electrical Engineering, Applied Math, Acoustics Engineering or a related STEM field
  • 3+ years experience with Python, Matlab, or similar
  • 3+ years experience with machine learning software platforms such as PyTorch, TensorFlow, etc
  • 2+ years experience building novel computational models in audio or audio-visual or speech application domains using machine learning or signal processing
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment


Preferred Qualifications

  • Demonstrated software engineer experience via an internship, work experience, coding competitions, or widely used contributions in open source repositories (e.g. Github)
  • Strong background in statistical modeling techniques and / or signal processing
  • Proven track record of achieving results as demonstrated in accepted papers at top computer vision and machine learning related conferences such as CVPR, ECCV, NIPS, ICASSP, InterSpeech etc
  • Experience working and communicating cross functionally in a team environment
  • Intent to return to a degree-program after the completion of the internship/co-op


$7,650/month to $12,134/month + benefits

What you'll do

  • The intern will research, model, design, develop, and test novel audio and speech processing algorithms. They will collaborate with researchers and engineers across diverse disciplines to solve audio research problems.

About Meta

Meta's mission is to build the future of human connection and the technology that makes it possible. Our technologies help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. To help create a safe and respectful online space, we encourage constructive conversations on this page. Please note the following: • Start with an open mind. Whether you agree or disagree, engage with empathy. • Comments violating our Community Standards will be removed or hidden. Please treat everybody with respect. • Keep it constructive. Use your interactions here to learn about and grow your understanding of others. • Our moderators are here to uphold these guidelines for the benefit of everyone, every day. • If you are seeking support for issues related to your Facebook account, please reference our Help Center (https://www.facebook.com/help) or Help Community (https://www.facebook.com/help/community). For a full listing of our jobs, visit https://www.metacareers.com

Ready to join Meta?

Take the next step in your career journey

Frequently Asked Questions

What does Meta pay for a Research Scientist Intern, Audio, Machine Learning and Computer Vision (PhD)?

Toggle
Meta offers a competitive compensation package for the Research Scientist Intern, Audio, Machine Learning and Computer Vision (PhD) role. The salary range is USD 8k - 12k per year. Apply through Clera to learn more about the full compensation details.

What does a Research Scientist Intern, Audio, Machine Learning and Computer Vision (PhD) do at Meta?

Toggle
As a Research Scientist Intern, Audio, Machine Learning and Computer Vision (PhD) at Meta, you will: the intern will research, model, design, develop, and test novel audio and speech processing algorithms. They will collaborate with researchers and engineers across diverse disciplines to solve audio research problems..

Is the Research Scientist Intern, Audio, Machine Learning and Computer Vision (PhD) position at Meta remote?

Toggle
The Research Scientist Intern, Audio, Machine Learning and Computer Vision (PhD) position at Meta is based in Redmond, Washington, United States. Contact the company through Clera for specific work arrangement details.

How do I apply for the Research Scientist Intern, Audio, Machine Learning and Computer Vision (PhD) position at Meta?

Toggle
You can apply for the Research Scientist Intern, Audio, Machine Learning and Computer Vision (PhD) position at Meta directly through Clera. Click the "Apply Now" button above to start your application. Clera's AI-powered platform will help match your profile with this opportunity and guide you through the application process.
Clera - Your AI talent agent
© 2026 Clera Labs, Inc.TermsPrivacyHelp

Join Clera's Talent Pool

Get matched with similar opportunities at top startups

This role is hosted on Meta's careers site.
Join our talent pool first to get notified about similar roles that match your profile.