Boston, Massachusetts, United States Hybrid

Harvard University is hiring a Senior Data Scientist

Harvard University is looking for a Senior Data Scientist to lead comprehensive applications and web development for complex projects within Harvard Business School (HBS). You will play a pivotal role in developing innovative generative AI products that support the HBS Foundry team and learners on our platform.

What You'll Do

  • Design and implement efficient data pipelines for data collection, cleaning, labeling, and preprocessing.
  • Ensure quality of datasets tailored for LLM training and build data science pipelines from feature generation to model evaluation.
  • Design, develop, deploy, and implement multi-agent AI tools that assess venture-related submissions, generate materials, and simulate entrepreneurial interactions.
  • Design generative AI models using techniques like natural language processing and machine learning.
  • Design an AI-powered system that collects key factors to determine a learner’s profile and provides customized learning pathways.
  • Integrate AI tools into the HBS Foundry platform in a seamless manner for user access.
  • Collaborate on technical vision to accelerate product impact through cutting-edge LLM innovations.
  • Build guardrails, compliance rules, and oversight workflows into the GenAI tools and Foundry platform.
  • Continually improve AI tools using feedback from users.
  • Monitor, debug, track, and resolve production issues.
  • Work with the Foundry Project Director to ensure projects proceed on time and on budget.
  • Collaborate with Product Managers to track performance KPIs and prioritize improvements.
  • Present prototypes and progress updates to stakeholders.
  • Stay up to date on the latest advances in AI/ML research and identify new techniques for the platform.
  • Collaborate with a team of engineers, designers, and subject matter experts.
  • Build trust and collaboration by being present on-site and engaging directly with colleagues.

What We're Looking For

  • Minimum of seven years’ post-secondary education or relevant work experience.
  • Advanced degree in computer science, data science, engineering, or a related field within AI/ML.

Nice to Have

  • 3+ years of experience building Generative AI models and tools.
  • Experience working in a similar role in a startup environment.
  • Expertise in NLP, deep learning, and other relevant techniques.
  • Strong programming skills in Python and frameworks like TensorFlow.
  • Experience building advanced workflows such as retrieval augmented generation, model chaining, dynamic prompting, PEFT/SFT, etc. using Langchain and similar tools.
  • Experience integrating AI into consumer-facing applications.
  • Experience establishing model guardrails and developing bias detection and mitigation techniques for AI applications using tools like NeMo.
  • Experience with various embedding models and setting up and tuning vector databases.
  • Experience working with a variety of relational SQL and NoSQL databases, big data tools (such as Hadoop, Spark, Kafka), and at least one cloud provider solution (AWS highly preferable).
  • Experience in using version control systems such as Git.
  • Experience with containerization and orchestration tools like Docker and Kubernetes.
  • Experience with API development and integration for AI services.
  • Understanding of and experience with software engineering best practices like version control, CI/CD, and containerization.
  • Proficiency with various prompting techniques, with a clear understanding of tradeoffs between prompting and finetuning.
  • Experience with finetuning embedding models and tuning vector databases.
  • Experience operationalizing end-to-end machine learning applications.
  • Excellent communication and collaboration skills.
  • Passion for using AI to solve real-world problems.

Technical Stack

  • Python, TensorFlow, Langchain, NeMo, Hadoop, Spark, Kafka, AWS, SQL, NoSQL, Git, Docker, Kubernetes

Team & Environment

You will be part of the Foundry team, collaborating with engineers, designers, and subject matter experts, and reporting to the Foundry Project Director.

Benefits & Compensation

  • Generous paid time off including parental leave.
  • Medical, dental, and vision health insurance coverage starting on day one.
  • Retirement plans with university contributions.
  • Wellbeing and mental health resources.

Work Mode

This is a hybrid position based in Boston, MA.

Harvard University is dedicated to creating a diverse and welcoming environment where everyone can thrive and fosters bold new ideas and collaborative learning networks.

Required Skills
PythonTensorFlowLangchainNeMoHadoopSparkKafkaAWSSQLNoSQLAIMLData Science PythonTensorFlowLangchainNeMoHadoopSparkKafkaAWSSQLNoSQLAIMLData Science
Got hired remotely?

Get paid like a professional

Remote clients expect company invoices, not personal PayPal requests. Glopay forms an EU partnership that makes you look legitimate while you stay independent.

Professional invoices with EU company details
Compliance handled automatically
Withdraw to any bank account
Income reports for easy tax filing
Create free account
Free signup • 5 min setup
About company
Harvard University
Harvard University is a world-renowned academic institution dedicated to advancing knowledge, research, and public service across multiple disciplines.
All jobs at Harvard University Visit website
Job Details
Department Information Technology
Category data
Posted 2 months ago