State College, PA, USA On-site

Penn State is hiring a Data Engineer – Data Architecture for Data Science & Machine Learning

About the Role

The role involves building and optimizing data infrastructure to enable advanced analytics and machine learning workflows, ensuring data is accessible, reliable, and efficiently processed.

Responsibilities

  • Design and implement data pipelines for large-scale datasets
  • Develop and maintain data models for analytical use cases
  • Ensure data quality and integrity across systems
  • Optimize data storage and retrieval performance
  • Support data science and machine learning teams with structured data access
  • Integrate data from diverse sources into unified systems
  • Collaborate on schema design and database architecture
  • Automate data ingestion and transformation workflows
  • Monitor data pipeline health and resolve issues
  • Apply best practices in data governance and metadata management
  • Work with cloud-based data platforms and services
  • Ensure scalability and reliability of data systems
  • Document data architectures and technical designs
  • Troubleshoot performance bottlenecks in data workflows
  • Support compliance with data security standards
  • Evaluate and integrate new data technologies
  • Participate in agile development cycles
  • Provide technical guidance on data modeling
  • Assist in defining data standards and policies
  • Collaborate with stakeholders to understand data needs
  • Maintain version control for data pipeline code
  • Implement monitoring and alerting for data jobs
  • Contribute to data catalog development
  • Ensure reproducibility of data processing steps
  • Support deployment of machine learning models in production

Nice to Have

  • Master’s degree in a technical field
  • Experience in academic or research institutions
  • Familiarity with machine learning model deployment
  • Knowledge of data mesh or data fabric concepts
  • Experience with streaming data platforms
  • Contributions to open-source data projects
  • Certifications in cloud data services
  • Experience with data lineage tools
  • Background in scientific computing
  • Involvement in data engineering communities

Compensation

Salary commensurate with experience

Work Arrangement

Hybrid

Team

Collaborative environment with data scientists, researchers, and IT professionals

About the Unit

  • This position is part of a research-focused unit that advances data-driven discovery and innovation across disciplines.
  • The team works on interdisciplinary projects involving large, complex datasets and emerging analytical methods.

Application Instructions

  • Applicants must submit a resume, cover letter, and contact information for three references.
  • Review of candidates will begin immediately and continue until the position is filled.

Not specified

Required Skills
PostgreSQLMongoDBNeo4jRedisCassandraAmazon DynamoDBData ModelingETLData Pipelines
About company
Penn State
A university with a teaching, research, and service mission.
All jobs at Penn State Visit website
Job Details
Category data
Posted 7 months ago