Harbor is hiring a Data Engineer to modernize secure data infrastructure for a critical defense health program. Your work will enable real-time insights that support care delivery, operational readiness, and mission decision-making. You will design and scale data pipelines to transform raw information into actionable intelligence, ensuring interoperability across large-scale, distributed environments.
What You'll Do
- Build and maintain end-to-end data pipelines (Ingest → Transform → Expose) using Apache Airflow, Apache Spark, dbt, and Iceberg.
- Ingest and normalize structured and unstructured data (HL7, FHIR, PDF, JSON) for analytics and AI/ML use cases.
- Map datasets to FHIR and OMOP standards to enable interoperability and decision support.
- Implement schema versioning and governance to ensure traceability and audit-ready lineage.
- Collaborate with DevSecOps and Data Science teams to deliver AI-ready datasets for predictive analytics and readiness forecasting.
- Optimize data performance across distributed environments while ensuring compliance with DoD Responsible AI and NIST AI Risk Management frameworks.
What We're Looking For
- 4+ years of experience in Data Engineering or Analytics Engineering.
- Proficient in SQL and Python, with experience building modular, version-controlled transformations.
- Hands-on with Apache Airflow, Apache Spark, dbt, and data lake frameworks (Iceberg, Trino, Athena).
- Strong understanding of ETL/ELT design, data modeling, and governance in distributed systems.
- Familiar with AWS (Glue, S3, Lambda, Athena, EMR) and cloud-native data architectures.
- Excellent collaborator with experience in cross-functional environments (DevSecOps, Data Science, Security).
- Active DoD Secret Clearance (or higher).
- U.S. Based.
Nice to Have
- Experience with FHIR, OMOP, or HL7 data standards.
- Background in DoD, VA, or other federal health IT programs.
- Knowledge of Responsible AI and NIST AI Risk Management Frameworks.
- Familiarity with complex operational data systems or readiness analytics.
Technical Stack
- Apache Airflow, Apache Spark, dbt, Iceberg, Trino, Athena
- SQL, Python
- AWS Glue, AWS S3, AWS Lambda, AWS Athena, AWS EMR
- HL7, FHIR, OMOP
Team & Environment
You will work in a mission-driven, supportive, and inclusive team culture, collaborating cross-functionally with DevSecOps and Data Science teams to solve high-impact problems.
Benefits & Compensation
- Weekly Pay
- Paid Certifications
- Full Benefits (Medical/Dental/Vision)
- 401(k) 100% Match (up to 6%)
- Training & Career Growth
- Generous PTO
- Life, short- and long-term disability insurance
- Home office equipment plan
Work Mode
This is a remote position. Candidates must be located in the U.S.
Harbor is an equal opportunity employer.
