Job Description:
Summary:
The New York Mets are seeking a Data Engineer on the Data Engineering Technology team. This role requires hands on experience in ingesting, processing, warehousing, and distributing both structured and unstructured sources of baseball data. You will join a global team (NY and Poland) of data engineering/support professionals to provide best in class end to end data and engineering support to Baseball Operations, including but not limited to, Analytics and Systems teams. Prior experience in or knowledge of baseball is a plus but is not required.
Essential Duties & Responsibilities:
Ensure that all production data sources are ingested, processed, and distributed to applications and users at set frequencies including real time in game data
Onboard new datasets and technologies for trials
Troubleshoot and communicate any data ingestion and/or quality issues to all stakeholders
Design, code, test and roll out of new datasets, as well as enhancements and bug fixes for existing datasets
Support quantitative analysts in Baseball Analytics with productionization and maintenance of predictive models
Build and manage Data Model and Data Domains to keep data clean, accurate and well organized
Deploy and manage data quality solutions (e.g., scalable check framework, monitoring dashboard) to ensure accuracy, integrity, and proactive monitoring of information
Help maintain a data catalog for efficient data discovery
Collaborate with global team of offshore Data Engineering/Support professionals on data engineering and support requests/issues
Able to work flexible schedule during baseball season to ensure in game support for users
Coordinate with IT and Infrastructure team to ensure a robust cloud development and production environment
Qualifications:
BS degree in Computer Science or a related field
2+ years’ experience in data engineering and data operations/support roles
Comfort level with both on-prem and cloud (GCP experience required) environments
Technical skills (Python, SQL, Linux) in working with ingesting, processing, and distributing large scale (both structured and unstructured) data sets
Experience building data structures and data pipelines in the cloud, preferably GCP & on prem
Experience in using modern Software Development Life Cycle (SDLC) and DevOps tools from development to production e.g., Terraform, Cloud Build, BitBucket
Ability to provide prompt support and resolution of data issues
Experience in implementing scalable data quality solutions
Knowledge of big data frameworks (e.g., Spark/Databricks), Airflow, Dataflow, Pub/Sub, etc.
Ability to deliver superior customer experience through continuous process improvement
Strong analytical skills and ability to work well in a collaborative and fast paced environment
The above information is intended to describe the general nature, type, and level of work to be performed. The information is not intended to be an exhaustive or complete list of all responsibilities, duties, and skills required for this position. Nothing in this job description restricts management’s right to assign or reassign duties and responsibilities to this job at any time. The individual selected may perform other related duties as assigned or requested.
The New York Mets recognize the importance of a diverse workforce and value the unique qualities individuals of various backgrounds and experiences can offer to the Organization. Our continued success depends heavily on the quality of our workforce. The Organization is committed to providing employees with the opportunity to develop to their fullest potential.
Salary Range: $75,000 - $90,000
For technical reasons, we strongly advise to not use an .edu email address when applying. Thank you very much.
Take the next step in your career journey
Get matched with similar opportunities at top startups
This role is hosted on New York Mets's careers site.
Join our talent pool first to get notified about similar roles that match your profile.