About the Role
The ideal candidate will have strong experience in building and optimizing data infrastructure, working with cloud platforms, and enabling data-driven decision-making through robust pipeline architecture.
Responsibilities
- Design and implement scalable data pipelines for ingestion and transformation
- Collaborate with analysts and scientists to understand data requirements
- Ensure data accuracy, consistency, and accessibility across platforms
- Monitor and troubleshoot data workflows for performance issues
- Develop and maintain documentation for data models and processes
- Optimize data storage and query performance in cloud environments
- Support the integration of new data sources into existing systems
- Work with engineering teams to enforce data quality standards
- Participate in code reviews and architecture discussions
- Automate routine data operations and monitoring tasks
- Contribute to the evolution of the data platform roadmap
- Ensure compliance with data governance policies
- Assist in defining schema standards and naming conventions
- Evaluate and integrate new data tools and technologies
- Respond to data incidents and implement corrective actions
- Collaborate on data modeling for analytics and reporting
- Support migration efforts between data systems
- Maintain awareness of industry trends in data engineering
- Implement access controls and security best practices
- Work closely with product teams to enable data usage
Nice to Have
- Experience with Apache Airflow or similar orchestration tools
- Familiarity with dbt (data build tool)
- Knowledge of data observability platforms
- Experience with real-time data processing
- Background in financial or transactional data domains
- Exposure to machine learning pipelines
- Contributions to open-source data projects
- Certifications in cloud data services
Compensation
Competitive salary with benefits
Work Arrangement
Hybrid remote
Team
Small, agile data team focused on scalable solutions
Our Tech Stack
- We use Google Cloud Platform as our primary infrastructure
- Data pipelines are built with Apache Airflow and Cloud Composer
- Our warehouse runs on BigQuery with dbt for transformation
- We manage code via GitHub with full CI/CD integration
- Monitoring is handled through Datadog and Cloud Logging
Growth Opportunities
- Engineers are encouraged to lead initiatives and propose tooling changes
- Regular internal tech talks and learning groups
- Budget for conferences and training courses
- Mentorship programs for career development
- Opportunities to contribute to open-source projects
Available for qualified candidates
