About the Role

The ideal candidate will have strong experience in building and optimizing data infrastructure, working with cloud platforms, and enabling data-driven decision-making through robust pipeline architecture.

Responsibilities

Design and implement scalable data pipelines for ingestion and transformation
Collaborate with analysts and scientists to understand data requirements
Ensure data accuracy, consistency, and accessibility across platforms
Monitor and troubleshoot data workflows for performance issues
Develop and maintain documentation for data models and processes
Optimize data storage and query performance in cloud environments
Support the integration of new data sources into existing systems
Work with engineering teams to enforce data quality standards
Participate in code reviews and architecture discussions
Automate routine data operations and monitoring tasks
Contribute to the evolution of the data platform roadmap
Ensure compliance with data governance policies
Assist in defining schema standards and naming conventions
Evaluate and integrate new data tools and technologies
Respond to data incidents and implement corrective actions
Collaborate on data modeling for analytics and reporting
Support migration efforts between data systems
Maintain awareness of industry trends in data engineering
Implement access controls and security best practices
Work closely with product teams to enable data usage

Nice to Have

Experience with Apache Airflow or similar orchestration tools
Familiarity with dbt (data build tool)
Knowledge of data observability platforms
Experience with real-time data processing
Background in financial or transactional data domains
Exposure to machine learning pipelines
Contributions to open-source data projects
Certifications in cloud data services

Compensation

Competitive salary with benefits

Work Arrangement

Hybrid remote

Team

Small, agile data team focused on scalable solutions

Our Tech Stack

We use Google Cloud Platform as our primary infrastructure
Data pipelines are built with Apache Airflow and Cloud Composer
Our warehouse runs on BigQuery with dbt for transformation
We manage code via GitHub with full CI/CD integration
Monitoring is handled through Datadog and Cloud Logging

Growth Opportunities

Engineers are encouraged to lead initiatives and propose tooling changes
Regular internal tech talks and learning groups
Budget for conferences and training courses
Mentorship programs for career development
Opportunities to contribute to open-source projects

Available for qualified candidates

Thesis Inc. is hiring a Data Engineer (Mid level)

About the Role

Responsibilities

Nice to Have

Compensation

Work Arrangement

Team

Our Tech Stack

Growth Opportunities

Similar Jobs

Data Engineer

Senior Data Engineer (Data + Applied AI)

Analytics Engineer

Data Engineer Confirmé Snowflake - F/H

Data Engineer II - SRC - Music

Senior Data Engineer / Analytics Engineer (AWS)

Related Articles

Become an AI Developer: Your Career Guide