About the Role
The role involves building and optimizing data infrastructure to enable advanced analytics and machine learning workflows, ensuring data is accessible, reliable, and efficiently processed.
Responsibilities
- Design and implement data pipelines for large-scale datasets
- Develop and maintain data models for analytical use cases
- Ensure data quality and integrity across systems
- Optimize data storage and retrieval performance
- Support data science and machine learning teams with structured data access
- Integrate data from diverse sources into unified systems
- Collaborate on schema design and database architecture
- Automate data ingestion and transformation workflows
- Monitor data pipeline health and resolve issues
- Apply best practices in data governance and metadata management
- Work with cloud-based data platforms and services
- Ensure scalability and reliability of data systems
- Document data architectures and technical designs
- Troubleshoot performance bottlenecks in data workflows
- Support compliance with data security standards
- Evaluate and integrate new data technologies
- Participate in agile development cycles
- Provide technical guidance on data modeling
- Assist in defining data standards and policies
- Collaborate with stakeholders to understand data needs
- Maintain version control for data pipeline code
- Implement monitoring and alerting for data jobs
- Contribute to data catalog development
- Ensure reproducibility of data processing steps
- Support deployment of machine learning models in production
Nice to Have
- Master’s degree in a technical field
- Experience in academic or research institutions
- Familiarity with machine learning model deployment
- Knowledge of data mesh or data fabric concepts
- Experience with streaming data platforms
- Contributions to open-source data projects
- Certifications in cloud data services
- Experience with data lineage tools
- Background in scientific computing
- Involvement in data engineering communities
Compensation
Salary commensurate with experience
Work Arrangement
Hybrid
Team
Collaborative environment with data scientists, researchers, and IT professionals
About the Unit
- This position is part of a research-focused unit that advances data-driven discovery and innovation across disciplines.
- The team works on interdisciplinary projects involving large, complex datasets and emerging analytical methods.
Application Instructions
- Applicants must submit a resume, cover letter, and contact information for three references.
- Review of candidates will begin immediately and continue until the position is filled.
Not specified