Lead Data Engineer with Scala

Tokyo

Company Social & Media:

Minor Hotels Europe and Americas

About the Company

Capgemini is a global business and technology transformation partner with over 340,000 professionals in more than 50 countries. The company helps organizations accelerate digital and sustainable transformation, delivering end-to-end services from strategy and design to engineering.

Capgemini combines expertise in AI, generative AI, cloud, and data with deep industry knowledge and a strong partner ecosystem to create tangible impact for clients and society. The organization fosters a collaborative, diverse, and inclusive environment that empowers employees to grow and innovate.

About the Role

The Lead Data Engineer collaborates with users and project teams to design and implement scalable, robust data solutions that meet functional and non-functional requirements. This role focuses on building advanced data pipelines, reusable frameworks, and automation to support enterprise-level data operations.

Key responsibilities include:

  • Collaborate with users and project teams to define requirements and expectations
  • Analyze business requirements and design data solutions that meet both functional and non-functional needs
  • Design and implement highly scalable data pipelines using Scala, Spark, or Java for processing very large datasets
  • Create and maintain reusable frameworks for data ingestion, validation, normalization, and transformation
  • Develop scripts for test automation, CI/CD, data migrations, and data validation
  • Provide technical support for incident recovery
  • Ensure adherence to coding standards, design principles, and best practices

Requirements

  • 3 to 6 years of experience as a developer or data engineer
  • Strong programming skills with experience in Scala, Java, or Python
  • Deep understanding of distributed systems such as Hadoop and Spark, including optimization
  • Experience with public cloud technologies, preferably Azure
  • Good knowledge of Oracle, MS-SQL Server, Linux, and networking
  • Proficient in SQL, shell scripting, Git, unit testing, and CI/CD tools
  • Strong learning ability and problem-solving skills
  • Excellent communication, presentation, and interpersonal skills

Preferred Qualifications

  • Understanding of REST API, NoSQL, microservices, and ETL processes
  • Hands-on experience with cloud services
  • Knowledge of streaming data processing and change data capture
  • Experience with BI tools
  • Business-level proficiency in Japanese (optional)
  • Business-level proficiency in English
  • Detail-oriented and precise, capable of independently designing and developing complex systems
  • Able to explain technical concepts clearly to non-technical stakeholders
  • Focused on delivery, fast learner, and adaptable across domains
  • Eager to learn new technologies and take on challenging work
  • Collaborative team player

Benefits

  • Opportunity to work on cutting-edge technologies in AI, cloud, and data
  • Exposure to large-scale, enterprise-level data projects
  • Global collaborative environment with diverse teams
  • Professional growth through challenging and varied assignments
  • Supportive culture fostering learning and innovation

Complete details about this role can be found on the official website below: