Solar Turbines logo
Lead Data Scientist
full-timeSan Diego$128k - $192k

Summary

Location

San Diego

Salary

$128k - $192k

Type

full-time

Explore Jobs

About this role

Career Area:

Technology, Digital and Data

Job Description:

Your Work Shapes the World at Caterpillar Inc.

When you join Caterpillar, you're joining a global team who cares not just about the work we do – but also about each other.  We are the makers, problem solvers, and future world builders who are creating stronger, more sustainable communities. We don't just talk about progress and innovation here – we make it happen, with our customers, where we work and live. Together, we are building a better world, so we can all enjoy living in it.

Lead Data Scientist – Power the future with AI and Digital Twins

At Solar Turbines, we’re driving smarter decisions through data science and physics—and we’re looking for a Lead Data Scientist to help us push boundaries. In this role, you’ll interact with massive datasets, apply advanced machine learning and physics methods in Python and Matlab, and work on a team to develop core AI products that help our customers achieve industry leading reliability, performance and growth.

What You’ll Do

  • Lead data mining, modeling, and analysis of time series data, event data and meta data to build anomaly detection models to be used within InSight Platform.
  • Apply machine learning and physics principles to create, explore and validate hybrid digital twin models using lumped parameter ODE models, PINNs and NeuralODEs to approximate PDE model outputs.
  • Design and optimize algorithms to improve accuracy, numerical performance, and scalability.
  • Build compelling visualizations and present findings to stakeholders using tools like Quicksight or Plotly.
  • Collaborate across teams and business segments to define analytical requirements and deliver actionable solutions related to InSight Platform’s digital twin strategy.
  • Work with stakeholders, SMEs and customers to develop project plans and execution schedules.

Technical Expertise

  • Machine Learning:
    Experience applying ML algorithms and techniques to solve both physics and/or business problems. Proficient in Python, with knowledge of libraries like pandas, NumPy, SciPy, Scikit-learn, PyTorch, and SpaCy.
  • Statistical Analysis & Modeling:
    Solid understanding of hypothesis testing, and predictive modeling. Able to validate models using standard statistical measures and interpret results.
  • Physics & Engineering:
    Solid understanding of the physics of 1D heat transfer and fluid mechanics with ability to solve these types of problems using python or Matlab libraries.
  • Data Handling:
    Ability to write SQL statements for database access using SQL Alchemy and CX Oracle. Able to write and optimize queries across multiple tables and schemas.
  • Programming & Automation:
    Comfortable developing scripts and tools to automate digital twin workflows. Ability to use LLMs for code creation but familiar with structured programming practices and debugging techniques to ensure output is correct. 
  • Visualization & Communication:
    Excellent communication and visualization skills.  Need to feel comfortable creating presentations that make technical concepts easy for non-technical audience members to understand.

Analytical & Business Acumen

  • Analytical Thinking
    Above average problem-solving and critical thinking. Ability to take loosely defined requirements or statements of value and turn them into a structured problem that can be solved mathematically. Capable of comparing alternative solutions and making optimal decisions.

Educational Background:

  • Master of Science (MS) in Mechanical Engineering, Chemical Engineering, Applied Mathematics, or Physics is required, PhD preferred.

 Top candidates will also have:

  • At least 2 years of experience post education in mechanical engineering roles.
  • Many years of experience in engineering analysis, including structural, thermal, fluid, and/or performance calculations for the purposes modeling and simulation.
  • Many years of experience modeling physical systems starting with first principles equations and implementing ODEs or PDEs in Matlab, Python or similar programming languages.
  • Some experience with control theory and applications to dynamic systems and industrial machines.
  • Published technical papers at conferences or journals in the engineering and/or physics fields

Summary Pay Range:

$128,470.00 - $192,710.00

Compensation and benefits offered may vary depending on multiple individualized factors, job level, market location, job-related knowledge, skills, individual performance and experience. Please note that salary is only one component of total compensation at Caterpillar. 

Benefits:

Subject to plan eligibility, terms, and guidelines. This is a summary list of benefits.

  • Medical, dental, and vision benefits

  • Paid time off plan (Vacation, Holidays, Volunteer, etc.)

  • 401(k) savings plans

  • Health Savings Account (HSA)

  • Flexible Spending Accounts (FSAs)

  • Health Lifestyle Programs

  • Employee Assistance Program

  • Voluntary Benefits and Employee Discounts

  • Career Development

  • Incentive bonus

  • Disability benefits

  • Life Insurance

  • Parental leave

  • Adoption benefits

  • Tuition Reimbursement

       

* These benefits also apply to part-time employees

This position requires working onsite five days a week.

Visa Sponsorship is not available for this position. This employer is not currently hiring foreign national applicants that require or will require sponsorship tied to a specific employer, such as, H, L, TN, F, J, E, O. As a global company, Caterpillar offers many job opportunities outside of the U.S which can be found through our employment website at www.caterpillar.com/careers.

Posting Dates:

Any offer of employment is conditioned upon the successful completion of a drug screen.     

Caterpillar is an Equal Opportunity Employer, Including Veterans and Individuals with Disabilities.  Qualified applicants of any age are encouraged to apply.

Not ready to apply? Join our Talent Community.

Other facts

Tech stack
Machine Learning,Python,Statistical Analysis,Modeling,Physics,Engineering,Data Handling,Programming,Automation,Visualization,Communication,Analytical Thinking,Problem Solving,Critical Thinking,Predictive Modeling,SQL

About Solar Turbines

Headquartered in San Diego, California, USA, Solar Turbines Incorporated, a subsidiary of Caterpillar Inc., is one of the world’s leading manufacturers of industrial gas turbines, with more than 16,000 units and over 3 billion operating hours in over 100 countries. Products from Solar Turbines play an important role in the development of oil, natural gas and power generation projects around the world. Solar Turbines’ products include gas turbine engines (rated from 1,590 to 52,500 horsepower), gas compressors, and gas turbine-powered compressor sets, mechanical-drive packages and generator sets (ranging from 1 to 39 megawatts). Solar’s customers put the company’s products to work in many areas including production, processing and pipeline transmission of natural gas and crude oil and generation of electricity and thermal energy for processing applications, such as manufacturing chemicals, pharmaceuticals, and food products.

Solar’s foundation is people and Solar’s culture is one where individual contributions are valued, diversity in the workplace is encouraged, and safety is emphasized in all aspects of the business. Solar Turbines, founded in 1927, is comprised of a dedicated and multi-talented workforce of more than 8,000 employees with decades of experience working as a global team.

Team size: 5,001-10,000 employees
LinkedIn: Visit
Industry: Oil and Gas
Founding Year: 1927

What you'll do

  • Lead data mining, modeling, and analysis of various data types to build anomaly detection models. Collaborate across teams to define analytical requirements and deliver actionable solutions related to the digital twin strategy.

Ready to join Solar Turbines?

Take the next step in your career journey

Frequently Asked Questions

What does Solar Turbines pay for a Lead Data Scientist?

Solar Turbines offers a competitive compensation package for the Lead Data Scientist role. The salary range is USD 128k - 193k per year. Apply through Clera to learn more about the full compensation details.

What does a Lead Data Scientist do at Solar Turbines?

As a Lead Data Scientist at Solar Turbines, you will: lead data mining, modeling, and analysis of various data types to build anomaly detection models. Collaborate across teams to define analytical requirements and deliver actionable solutions related to the digital twin strategy..

Why join Solar Turbines as a Lead Data Scientist?

Solar Turbines is a leading Oil and Gas company. The Lead Data Scientist role offers competitive compensation.

Is the Lead Data Scientist position at Solar Turbines remote?

The Lead Data Scientist position at Solar Turbines is based in San Diego, California, United States. Contact the company through Clera for specific work arrangement details.

How do I apply for the Lead Data Scientist position at Solar Turbines?

You can apply for the Lead Data Scientist position at Solar Turbines directly through Clera. Click the "Apply Now" button above to start your application. Clera's AI-powered platform will help match your profile with this opportunity and guide you through the application process. You can also learn more about Solar Turbines on their website.