Meta is seeking AI research engineers to help us build the data foundation for Meta's most advanced Large Language Models. We're looking for engineers with LLM expertise to join us on working with data at scale and to push beyond the data ceiling.
Our team contributes to data curation across all stages of LLM development (pre-training, mid-training, post-training) and all domains/modalities (e.g., web, code, agent, multilingual). We tackle the hardest challenges at trillion-scale, including organic data curation, synthetic data generation, agent and interaction data, and frontier paradigms that redefine what's possible.
Based in Meta Superintelligence Labs (MSL) within the Fundamental AI Research Organization (FAIR), you'll directly contribute to Meta’s frontier models like Llama, while having the chance to collaborate with researchers and engineers across MSL.
Responsibilities
Collaborate with cross-functional teams to develop Meta’s next foundational models
Architect efficient and scalable data curation systems and pipelines
Fundamentally improve our data velocity across workflows and projects by contributing to the advancement of data tooling
Execute on high priority projects in pre-training, mid-training, or post-training data curation
Apply specialized expertise in agentic data, synthetic data, reasoning data, web parser, coding data, data scaling laws, or datamix optimization
Lead complex technical projects end-to-end
Minimum Qualifications
Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
2+ years of industry research experience in LLM/NLP or related AI/ML models
Experience as a formal technical lead, leading major technical initiatives with cross-functional impact, and/or influencing strategy across multiple teams
Practical experience with pre-training or mid-training data curation for large foundational models and experience working with organic, synthetic, agentic, or reasoning data for LLMs
Demonstrated data infrastructure and software background, and experience building data tooling and services
Published research in leading peer-reviewed conferences (e.g., NeurIPS, ICML, ICLR, ACL, EMNLP) and/or demonstrated significant industry influence in the field of AI
Preferred Qualifications
Experience working on frontier-quality/state-of-the-art Large Language Models
Masters degree or PhD in Computer Science or a related technical field
Hands-on experience with modeling frameworks like PyTorch
Hands-on experience on SQL and large-scale data handling, with familiarity of frameworks like Spark and Hive
$88.46/hour to $257,000/year + bonus + equity + benefits
What you'll do
Collaborate with cross-functional teams to develop foundational models and architect scalable data curation systems. Lead complex technical projects and improve data velocity across workflows.
About Meta
Meta's mission is to build the future of human connection and the technology that makes it possible.
Our technologies help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology.
To help create a safe and respectful online space, we encourage constructive conversations on this page. Please note the following:
• Start with an open mind. Whether you agree or disagree, engage with empathy.
• Comments violating our Community Standards will be removed or hidden. Please treat everybody with respect.
• Keep it constructive. Use your interactions here to learn about and grow your understanding of others.
• Our moderators are here to uphold these guidelines for the benefit of everyone, every day.
• If you are seeking support for issues related to your Facebook account, please reference our Help Center (https://www.facebook.com/help) or Help Community (https://www.facebook.com/help/community).
For a full listing of our jobs, visit https://www.metacareers.com
Ready to join Meta?
Take the next step in your career journey
Frequently Asked Questions
What does Meta pay for a Research Engineer, Text Data Research - MSL FAIR?
Meta offers a competitive compensation package for the Research Engineer, Text Data Research - MSL FAIR role. The salary range is USD 9k - 257k per year. Apply through Clera to learn more about the full compensation details.
What does a Research Engineer, Text Data Research - MSL FAIR do at Meta?
As a Research Engineer, Text Data Research - MSL FAIR at Meta, you will: collaborate with cross-functional teams to develop foundational models and architect scalable data curation systems. Lead complex technical projects and improve data velocity across workflows..
Is the Research Engineer, Text Data Research - MSL FAIR position at Meta remote?
The Research Engineer, Text Data Research - MSL FAIR position at Meta is based in Menlo Park, California, United States. Contact the company through Clera for specific work arrangement details.
How do I apply for the Research Engineer, Text Data Research - MSL FAIR position at Meta?
You can apply for the Research Engineer, Text Data Research - MSL FAIR position at Meta directly through Clera. Click the "Apply Now" button above to start your application. Clera's AI-powered platform will help match your profile with this opportunity and guide you through the application process.
Join Clera's Talent Pool
Get matched with similar opportunities at top startups
This role is hosted on Meta's careers site. Join our talent pool first to get notified about similar roles that match your profile.