Job Description
Summary
We are seeking a passionate and experienced Data Engineer to play a pivotal role in revolutionizing how we process and use substantial datasets as the heart of Siri, Search and Machine Learning. You will be instrumental in data processing framework, powered by technologies such as Spark or Iceberg.
You will collaborate closely with teams with varied strengths (i.e. Data Scientists and Analysts, other Engineering teams, Machine Learning teams) to transform massive data into valuable, actionable datasets. You will also build data processing platform that fuel our innovative features and future machine learning area.
Description
As a Data Engineer, you will be at the forefront of designing and implementing a robust data processing framework to streamline machine learning data pipelines, applying your expertise in Spark and Python. In this collaborative role, you'll partner closely with the Machine Learning teams to design solutions that process data or apply machine learning models, which drive innovation.
Your work will focus on optimizing performance, ensuring data quality, and contributing to a long-term vision that extends to support innovative machine learning applications. We're looking for someone who thrives on tackling data challenges at scale and being adaptable.
Minimum Qualifications
- Demonstrated expertise in large-scale data processing, with a strong background of working with Spark and Python or PySpark.
- Understanding of distributed computing principles, data engineering and DevOps standard processes.
- Proven programming skills in Python and Java.
- A genuine passion for working with data and solving complex problems at scale, in cloud platforms (AWS, GCP).
- Experience with machine learning and data mining.
- B.S.degree in Computer Science.
Preferred Qualifications
- Ability to build strong relationships, influence decision-making, and drive projects in a fast-paced environment.
- Growth mindset and ability to learn new technologies
- PhD or MS in Computer Science.
- Exposure to Machine Learning