Responsibilities
- Architect and manage scalable ETL and ELT workflows for large-scale data platforms
- Develop and sustain data systems using Snowflake, Python, and AWS technologies
- Write complex SQL queries with CTEs, window functions, and optimization strategies
- Utilize Snowflake features such as warehouses, tasks, streams, and data pipelines for efficient processing
- Implement and manage data transformation processes using dbt
- Build datasets for reporting and enable dashboard creation with Amazon QuickSight
- Conduct data validation and integrity checks to ensure accuracy and consistency
- Troubleshoot and resolve data quality issues through root cause analysis
- Integrate and orchestrate data across AWS services including S3, DynamoDB, Glue, Lambda, and Rockset
- Collaborate with analytics, product, and clinical teams to meet reporting needs
- Monitor data pipeline performance and resolve production-level data problems