AI Research Scientist, Human-AI Interaction

full-time•San Francisco

Summary

Location

San Francisco

Type

full-time

Experience

5-10 years

Company links

Website LinkedIn

About this role

About Handshake

Handshake is the career network for the AI economy. 20 million knowledge workers, 1,600 educational institutions, 1 million employers (including 100% of the Fortune 50), and every foundational AI lab trust Handshake to power career discovery, hiring, and upskilling, from freelance AI training gigs to first internships to full-time careers and beyond. This unique value is leading to unparalleled growth; in 2025, we tripled our ARR at scale.

Why join Handshake now:

Shape how every career evolves in the AI economy, at global scale, with impact your friends, family and peers can see and feel
Work hand-in-hand with world-class AI labs, Fortune 500 partners and the world’s top educational institutions
Join a team with leadership from Scale AI, Meta, xAI, Notion, Coinbase, and Palantir, among others
Build a massive, fast-growing business with billions in revenue

About the Role

As a Research Scientist, Human–AI Interaction, you will play a pivotal role in defining how AI systems support real human work by leading research at the intersection of Human–Computer Interaction (HCI), Large Language Models (LLMs), and task-level benchmarking.

You will operate at the frontier of human-centered AI evaluation, with a focus on understanding what people actually do to accomplish meaningful work—and how AI systems change, accelerate, or reshape that activity. Your research will define jobs-to-be-done benchmarks, comparative evaluation frameworks, and empirical methods for measuring human effort, time, quality, and outcomes when working with AI copilots. Additionally, the Handshake AI platform is an interface used by thousands of the top subject matter experts in the world to evaluate AI systems, and offers numerous interesting HCI / HITL-AI research questions that will drive large business impact.

You’ll set research direction, establish standards for measuring human activity in AI-mediated workflows, publish papers and open-source code, and lead the development of rigorous, scalable benchmarks that connect human work, AI assistance, and real economic value.

You will:

Lead high-impact research on jobs-to-be-done benchmarks for AI systems, including:
- Defining task taxonomies grounded in real professional and economic activities
- Identifying what constitutes meaningful task completion, quality, and success
- Translating qualitative work understanding into measurable, repeatable benchmarks
Develop methods to measure human activity in AI-mediated workflows
Design benchmarks to assess AI-as-a-collaborator/copilot, rather than autonomous agents / basic Q&A
Design and run empirical studies of how people use AI to solve tasks, including:
- Controlled experiments and field studies measuring task performance
- Instrumentation for capturing fine-grained interaction traces and outcomes
Drive strategy for professional-domain AI benchmarks, focusing on:
- Understanding domain-specific workflows (e.g., analysis, writing, planning, coordination)
- Grounding benchmark design in how work is actually performed, not idealized tasks
Build and prototype AI systems and evaluation infrastructure to support research and Data production, including:
- LLM-powered copilots and experimental tools used for task-level measurement
- Benchmark harnesses that evaluate both model behavior and human outcomes
- Data pipelines for analyzing human–AI interaction at scale
- The human-in-the-loop experience for Handshake fellows to produce effective evaluations and training data for frontier models, through structured UI/UX interactions with these models.
Collaborate closely with User Experience Research (UXR) to:
- Leverage deep qualitative insights into real user behavior and workflows
- Translate ethnographic and observational findings into formal research constructs

(This role is distinct from UXR and focuses on measurement, modeling, and evaluation.)

Publish and present research that advances the field of human-centered AI benchmarking, with an expectation of regular contributions to top-tier venues such as CHI (Conference on Human Factors in Computing Systems), and related HCI and AI conferences.

Desired Capabilities

PhD or equivalent experience in Human–Computer Interaction, Computer Science, Cognitive Science, or a related field, with a strong emphasis on empirical evaluation of interactive AI/LLM systems.
3+ years of academic or industry research experience post-PhD, including leadership on complex research initiatives and analyzing data from a real AI product.
Strong publication record, with demonstrated impact in top-tier HCI venues — CHI experience required.
Deep expertise in experimental design and measurement, particularly for:
- Task performance and human activity
- Comparative evaluation frameworks
- Mixed-methods research grounded in real-world behavior
Strong technical and coding skills, including:
- Python and data analysis / ML tooling
- Experience building experimental systems and benchmark infrastructure
- Familiarity working with LLM APIs, agent frameworks, or AI-assisted tooling
Proven ability to define and lead research agendas that connect human work, AI capability, and business or economic impact.
Strong collaboration skills, especially working across research, engineering, product, and UXR teams.

Extra Credit

Experience developing benchmarks or evaluation frameworks for human–AI systems or AI-assisted productivity tools.

Prior work on copilot-style systems, agentic workflows, or automation of professional tasks.
Familiarity with workplace studies, CSCW, or socio-technical systems research.
Contributions to open-source tools, datasets, or benchmarks related to task-level evaluation.
Interest in how AI reshapes labor, productivity, and the future of work.

Perks

Handshake delivers benefits that help you feel supported—and thrive at work and in life.

The below benefits are for full-time US employees.

🎯 Ownership: Equity in a fast-growing company

💰 Financial Wellness: 401(k) match, competitive compensation, financial coaching

🍼 Family Support: Paid parental leave, fertility benefits, parental coaching

💝 Wellbeing: Medical, dental, and vision, mental health support, $500 wellness stipend

📚 Growth: $2,000 learning stipend, ongoing development

💻 Remote & Office: Internet, commuting, and free lunch/gym in our SF office

🏝 Time Off: Flexible PTO, 15 holidays + 2 flex days, winter #ShakeBreak where our whole office closes for a week!

🤝 Connection: Team outings & referral bonuses

Explore our mission, values, and comprehensive US benefits at joinhandshake.com/careers .

What you'll do

Lead high-impact research on jobs-to-be-done benchmarks for AI systems and develop methods to measure human activity in AI-mediated workflows. Collaborate closely with User Experience Research to leverage insights into user behavior and translate findings into formal research constructs.

About Handshake

Ready to join Handshake?

Take the next step in your career journey

Frequently Asked Questions

What does a AI Research Scientist, Human-AI Interaction do at Handshake?

As a AI Research Scientist, Human-AI Interaction at Handshake, you will: lead high-impact research on jobs-to-be-done benchmarks for AI systems and develop methods to measure human activity in AI-mediated workflows. Collaborate closely with User Experience Research to leverage insights into user behavior and translate findings into formal research constructs..

Is the AI Research Scientist, Human-AI Interaction position at Handshake remote?

The AI Research Scientist, Human-AI Interaction position at Handshake is based in San Francisco, United States. Contact the company through Clera for specific work arrangement details.

How do I apply for the AI Research Scientist, Human-AI Interaction position at Handshake?

You can apply for the AI Research Scientist, Human-AI Interaction position at Handshakedirectly through Clera. Click the "Apply Now" button above to start your application. Clera's AI-powered platform will help match your profile with this opportunity and guide you through the application process.

Handshake

AI Research Scientist, Human-AI Interaction

full-time•San Francisco

Summary

Location

San Francisco

Type

full-time

Experience

5-10 years

Company links

Website LinkedIn

About this role

About Handshake

Why join Handshake now:

Shape how every career evolves in the AI economy, at global scale, with impact your friends, family and peers can see and feel
Work hand-in-hand with world-class AI labs, Fortune 500 partners and the world’s top educational institutions
Join a team with leadership from Scale AI, Meta, xAI, Notion, Coinbase, and Palantir, among others
Build a massive, fast-growing business with billions in revenue

About the Role

You will:

Lead high-impact research on jobs-to-be-done benchmarks for AI systems, including:
- Defining task taxonomies grounded in real professional and economic activities
- Identifying what constitutes meaningful task completion, quality, and success
- Translating qualitative work understanding into measurable, repeatable benchmarks
Develop methods to measure human activity in AI-mediated workflows
Design benchmarks to assess AI-as-a-collaborator/copilot, rather than autonomous agents / basic Q&A
Design and run empirical studies of how people use AI to solve tasks, including:
- Controlled experiments and field studies measuring task performance
- Instrumentation for capturing fine-grained interaction traces and outcomes
Drive strategy for professional-domain AI benchmarks, focusing on:
- Understanding domain-specific workflows (e.g., analysis, writing, planning, coordination)
- Grounding benchmark design in how work is actually performed, not idealized tasks
Build and prototype AI systems and evaluation infrastructure to support research and Data production, including:
- LLM-powered copilots and experimental tools used for task-level measurement
- Benchmark harnesses that evaluate both model behavior and human outcomes
- Data pipelines for analyzing human–AI interaction at scale
- The human-in-the-loop experience for Handshake fellows to produce effective evaluations and training data for frontier models, through structured UI/UX interactions with these models.
Collaborate closely with User Experience Research (UXR) to:
- Leverage deep qualitative insights into real user behavior and workflows
- Translate ethnographic and observational findings into formal research constructs

(This role is distinct from UXR and focuses on measurement, modeling, and evaluation.)

Publish and present research that advances the field of human-centered AI benchmarking, with an expectation of regular contributions to top-tier venues such as CHI (Conference on Human Factors in Computing Systems), and related HCI and AI conferences.

Desired Capabilities

PhD or equivalent experience in Human–Computer Interaction, Computer Science, Cognitive Science, or a related field, with a strong emphasis on empirical evaluation of interactive AI/LLM systems.
3+ years of academic or industry research experience post-PhD, including leadership on complex research initiatives and analyzing data from a real AI product.
Strong publication record, with demonstrated impact in top-tier HCI venues — CHI experience required.
Deep expertise in experimental design and measurement, particularly for:
- Task performance and human activity
- Comparative evaluation frameworks
- Mixed-methods research grounded in real-world behavior
Strong technical and coding skills, including:
- Python and data analysis / ML tooling
- Experience building experimental systems and benchmark infrastructure
- Familiarity working with LLM APIs, agent frameworks, or AI-assisted tooling
Proven ability to define and lead research agendas that connect human work, AI capability, and business or economic impact.
Strong collaboration skills, especially working across research, engineering, product, and UXR teams.

Extra Credit

Experience developing benchmarks or evaluation frameworks for human–AI systems or AI-assisted productivity tools.

Prior work on copilot-style systems, agentic workflows, or automation of professional tasks.
Familiarity with workplace studies, CSCW, or socio-technical systems research.
Contributions to open-source tools, datasets, or benchmarks related to task-level evaluation.
Interest in how AI reshapes labor, productivity, and the future of work.

Perks

Handshake delivers benefits that help you feel supported—and thrive at work and in life.

The below benefits are for full-time US employees.

🎯 Ownership: Equity in a fast-growing company

💰 Financial Wellness: 401(k) match, competitive compensation, financial coaching

🍼 Family Support: Paid parental leave, fertility benefits, parental coaching

💝 Wellbeing: Medical, dental, and vision, mental health support, $500 wellness stipend

📚 Growth: $2,000 learning stipend, ongoing development

💻 Remote & Office: Internet, commuting, and free lunch/gym in our SF office

🏝 Time Off: Flexible PTO, 15 holidays + 2 flex days, winter #ShakeBreak where our whole office closes for a week!

🤝 Connection: Team outings & referral bonuses

Explore our mission, values, and comprehensive US benefits at joinhandshake.com/careers .

What you'll do

Lead high-impact research on jobs-to-be-done benchmarks for AI systems and develop methods to measure human activity in AI-mediated workflows. Collaborate closely with User Experience Research to leverage insights into user behavior and translate findings into formal research constructs.

About Handshake

Ready to join Handshake?

Take the next step in your career journey

Frequently Asked Questions

What does a AI Research Scientist, Human-AI Interaction do at Handshake?

Is the AI Research Scientist, Human-AI Interaction position at Handshake remote?

The AI Research Scientist, Human-AI Interaction position at Handshake is based in San Francisco, United States. Contact the company through Clera for specific work arrangement details.

AI Research Scientist, Human-AI Interaction

Summary

Location

Type

Experience

Company links

About this role

About Handshake

About the Role

You will:

Desired Capabilities

Extra Credit

Experience developing benchmarks or evaluation frameworks for human–AI systems or AI-assisted productivity tools.

Perks

What you'll do

About Handshake

Ready to join Handshake?

Frequently Asked Questions

What does a AI Research Scientist, Human-AI Interaction do at Handshake?

Is the AI Research Scientist, Human-AI Interaction position at Handshake remote?

How do I apply for the AI Research Scientist, Human-AI Interaction position at Handshake?

AI Research Scientist, Human-AI Interaction

Summary

Location

Type

Experience

Company links

About this role

About Handshake

About the Role

You will:

Desired Capabilities

Extra Credit

Experience developing benchmarks or evaluation frameworks for human–AI systems or AI-assisted productivity tools.

Perks

What you'll do

About Handshake

Ready to join Handshake?

Frequently Asked Questions

What does a AI Research Scientist, Human-AI Interaction do at Handshake?

Is the AI Research Scientist, Human-AI Interaction position at Handshake remote?

How do I apply for the AI Research Scientist, Human-AI Interaction position at Handshake?

Join Clera's Talent Pool

Join Clera's Talent Pool