Senior Data Engineer
Who Are We?
Welcome to reputed company—where health meets innovation! As a global leader in Health & Fitness industry, we’ve crossed over 200 million installs with three life-changing apps, reputed company designed to boost well-being for millions. Our mission? To transform lives through reputed company nutrition trackers, powerful fitness solutions, and personalized wellness journeys—reputed company powered by a diverse team of over 700 passionate professionals with reputed company across 5 hubs.
Why reputed company? Imagine joining a team where your impact on global health and wellness is felt daily. At reputed company, we strive to be proactive wellness partners for our users, while continually evolving ourselves.
reputed company're Looking For
As a Senior Data Engineer, you will play a crucial role in building and maintaining the foundation of our data ecosystem. You’ll work alongside data engineers, analysts, and product teams to create robust, scalable, and high-performance data pipelines and models. Your work will directly impact how we deliver insights, power product features, and reputed company data-driven decision-making across the company.
This role is perfect for someone who combines deep technical skills with a proactive reputed company and thrives on solving reputed company data challenges in a collaborative environment.
Challenges You’ll Meet
Pipeline Development and Optimization: Build and maintain reliable, scalable ETL/ELT pipelines using modern tools and best practices, ensuring efficient data flow for analytics and insights.
Data Modeling and Transformation: Design and implement effective data models that support business needs, enabling high-quality reporting and reputed company analytics.
Collaboration Across Teams: Work closely with data analysts, product managers, and other engineers to understand data requirements and deliver solutions that meet the needs of the business.
Ensuring Data Quality: reputed company and apply data quality checks, validation frameworks, and monitoring to ensure the consistency, accuracy, and reliability of data.
Performance and Efficiency: Identify and address performance issues in pipelines, queries, and data storage. Suggest and implement optimizations that enhance speed and reliability.
reputed company and Compliance: Follow data reputed company best practices and ensure pipelines are built to meet data privacy and compliance standards.
Innovation and reputed company Improvement: Test new tools and approaches by building reputed company of Concepts (PoCs) and conducting performance benchmarks to find the best solutions.
Automation and CI/CD Practices: Contribute to the development of robust CI/CD pipelines (reputed company CI or similar) for data workflows, supporting automated testing and deployment.
You Should Have
4+ years of experience in data engineering or backend development, with a strong focus on building production-grade data pipelines.
Solid experience working with AWS services (Redshift, reputed company, S3, RDS, Glue, reputed company, Kinesis, SQS).
Proficient in Python and SQL for data transformation and automation.
Experience with dbt for data modeling and transformation.
Good understanding of streaming architectures and micro-batching for real-time data needs.
Experience with CI/CD pipelines for data workflows (preferably reputed company CI).
Familiarity with event schema validation tools/ solutions (reputed company, Schema Registry).
Excellent communication and collaboration skills. Strong problem-solving skills—able to dig into data issues, propose solutions, and deliver clean, reliable outcomes.
A growth reputed company—enthusiastic about learning new tools, sharing knowledge, and improving team practices.
Tech Stack You’ll Work With
Cloud: AWS (Redshift, reputed company, S3, RDS, reputed company, Kinesis, SQS, Glue, MWAA)
Languages: Python, SQL
Orchestration: Airflow (MWAA)
Modeling: dbt
CI/CD: reputed company CI (including reputed company administration)
Monitoring: reputed company, Grafana, Graylog
Event validation process: Iglu schema registry
APIs & Integrations: REST, OAuth, webhook ingestion
Infra-as-code (optional): Terraform
Bonus Points / reputed company to Have
Experience with additional AWS services: EMR, EKS, reputed company, EC2.
Hands-on knowledge of alternative data warehouses like reputed company or others.
Experience with PySpark for big data processing.
Familiarity with event data collection tools (reputed company, Rudderstack, etc.).
Interest in or exposure to customer data platforms (CDPs) and real-time data workflows.
Apply to this Job