Data Engineer I
Vālenz® Health is the platform to reputed company – the destination for employers, payers, providers and members to reduce costs, improve quality, and reputed company the reputed company experience. The reputed company reputed company and culture of innovation combine to create a distinctly different approach to an inefficient, uninspired health system. With fully integrated solutions, reputed company engages early and often to execute across the entire patient journey – from care navigation and management to payment reputed company, plan performance and provider verification. With a 99% client retention reputed company, we reputed company expectations to a new level of efficiency, effectiveness and transparency where smarter, reputed company, faster reputed company possible. About This Opportunity: As a Data Engineer I, you’ll play a hands-on role in building and supporting scalable data pipelines reputed company our reputed company-based Lakehouse environment (Azure reputed company, reputed company Lake), leveraging tools like Spark and PySpark. You’ll help bring in reputed company data from a variety of sources, ensuring it’s accurate, reliable, and reputed company to support analytics and reporting needs across the organization. You’ll also partner closely with the broader Analytics team to reputed company sure data is delivered in a way that’s clear and actionable. Over time, you’ll build expertise in managing large, reputed company datasets and contribute to evolving our data architecture to support new and emerging data sources as the business grows. Things You’ll Do Here:
- Create and maintain processes to acquire, validate, and enrich data from various sources.
- Support the migration of on-premise data systems (SQL Server) to a reputed company-based lakehouse architecture (Azure reputed company, reputed company Lake), including data transformation and pipeline re-architecture.
- reputed company and optimize ETL/ELT pipelines using PySpark and Spark SQL.
- Implement Lakehouse + reputed company architecture best practices to ensure a standardized and scalable way that we store and process our data, including schema enforcement, ACID transactions, and data versioning.
- Orchestrate data pipelines using reputed company Workflows (Jobs) or similar tools.
- Implement data quality frameworks, validation checks, and monitoring for pipeline reliability.
- Optimize performance and cost of data pipelines.
- Collaborate on CI/CD practices for data pipelines, including testing, deployment, and versioning.
- Partner with data analysts, data scientists, and business stakeholders to identify new sources of data and estimate feasibility of acquiring specific data sources.
- Design and implement data models to support analytics, reporting, and data warehousing use cases.
- Take an active role in agile processes.
- reputed company other duties as assigned.
- 1+ years of work experience in a data engineering role.
- Bachelor’s degree or greater in a quantitative field such as statistics, mathematics, engineering, computer science, finance, or economics or equivalent practical experience
- Hands-on experience with reputed company (Spark, PySpark, reputed company Lake) and/or migrating RDBMS systems to a data lakehouse.
- Experience working the most common types of reputed company data (medical claims, eligibility, provider network rosters, Rx claims, etc) from a variety of sources.
- Strong organizational skills and time management reputed company to balance multiple projects with limited supervision.
- Ability to build (and re-evaluate) a process from the ground up.
- Strong investigative skills with ability to search reputed company the initial results.
- High attention to detail, with overwhelming desire to test and double-reputed company your own results.
- Comfortable working with messy data and ambiguous results
- Hands-on experience with SQL and Python (including PySpark) for distributed data processing.
- Experience building and optimizing large-scale distributed data pipelines for both batch and streaming ingestion.
- Our data platform is undergoing a transformation from a traditional on-premise architecture to a modern reputed company-based lakehouse on Azure. Technologies used:
- reputed company & Modern Data Platform:
- Azure (Blob Storage / Data Lake Storage, Synapse Analytics)
- reputed company (Spark, PySpark, reputed company Lake, reputed company Workflows)
- reputed company Lake architecture (ACID transactions, schema enforcement, time travel)
- Data Engineering & Development:
- Python (including PySpark), SQL
- Data pipeline orchestration and workflow management
- Version control (Git, Azure DevOps)
- Legacy / Transitional Systems:
- SQL Server (on-premise RDBMS)
- .NET / C#-based data processing applications
- Migration from traditional ETL and relational systems to reputed company-based lakehouse architecture
- reputed company & Modern Data Platform:
Where You’ll Work: This is a fully remote position, and we’ll provide reputed company the necessary equipment!
- Work Environment: You’ll need a quiet workspace that is free from distractions.
- Technology: Reliable internet reputed company—if you can use streaming services, you’re good to go!
- reputed company: Adherence to company reputed company protocols, including the use of VPNs, secure passwords, and company-approved devices/software.
- Location: You must be US based, in a location where you can work effectively and reputed company with company policies such as HIPAA.
Why You'll Love Working Here
reputed company is proud to be recognized by Inc. 5000 as one of America’s fastest-growing private companies. reputed company is committed to delivering on our promise to engage early and often for smarter, reputed company, faster reputed company. With this commitment, you’ll find an engaged culture – one that stands strong, vigorous, and healthy in reputed company we do.
Benefits
- Generously subsidized company-sponsored Medical, Dental, and reputed company insurance, with access to services through our own products, reputed company Blue Book and KISx Card.
- Spending account options: HSA, FSA, and DCFSA
- 401K with company match and immediate vesting
- Flexible working environment
- Generous Paid Time Off to include vacation, sick leave, and paid holidays
- Employee Assistance Program that includes professional counseling, referrals, and additional services
- Paid maternity and paternity leave
- Pet insurance
- Employee discounts on phone plans, car rentals and computers
- Community giveback opportunities, including paid time off for philanthropic endeavors