Responsibilities
- Gather, preprocess, and consolidate individual patient data from electronic health records, laboratory systems, and third-party databases.
- Use tokenization methods to remove personally identifiable information while maintaining data usability for integration and analysis.
- Conduct or assist in expert-led de-identification to assess and reduce the risk of re-identification in shared datasets.
- Ensure adherence to HIPAA regulations and organizational privacy standards, especially in HISEC-compliant environments.
- Oversee secure data workflows and storage systems containing confidential patient data.
- Help create standardized procedures for securely aggregating and managing health data.
- Support the creation of de-identified datasets for use in research, analytics, or external collaborations.
- Track data accuracy, identify anomalies, and maintain thorough records of data sources and transformations.
- Assist with internal and external audits, security evaluations, and data governance initiatives.
- Build and deploy complete data aggregation pipelines aligned with business needs using Python and PySpark.
- Interpret analysis outcomes, formulate actionable insights, and present findings to stakeholders and project teams.
- Take ownership of data aggregation projects with limited oversight, including coordinating onshore or client communications, leading discussions, and preparing meeting agendas.
- Handle multiple concurrent assignments while consistently meeting project timelines.
Work Arrangement
Remote (Worldwide)
Other
Must be available to support US-based clients during American business hours, ensuring at least 3 to 4 hours of daily overlap.