Remote (Global)

Clarivate is hiring a Senior Data Scientist (NLP)

About the Role

The role involves designing, developing, and deploying NLP models to extract meaningful patterns from unstructured text. The candidate will work closely with cross-functional teams to integrate AI-driven solutions into production systems.

Responsibilities

  • Design and implement NLP models for text classification, entity recognition, and semantic analysis
  • Collaborate with engineering teams to integrate models into scalable platforms
  • Evaluate model performance using statistical and qualitative methods
  • Stay current with advancements in NLP and machine learning research
  • Translate business requirements into technical solutions
  • Optimize models for speed and accuracy in production environments
  • Work with large, diverse datasets including structured and unstructured text
  • Develop pipelines for data preprocessing and feature engineering
  • Support deployment of models using containerized environments
  • Conduct error analysis to improve model robustness
  • Document methodologies and share findings with stakeholders
  • Mentor junior team members in NLP best practices
  • Participate in code reviews and technical design discussions
  • Ensure models comply with data privacy and ethical AI guidelines
  • Use Python and relevant NLP libraries for model development
  • Apply transformer-based architectures such as BERT and RoBERTa
  • Leverage cloud platforms for distributed computing tasks
  • Collaborate on product feature development based on NLP insights
  • Present technical results to non-technical audiences
  • Contribute to research initiatives and potential publications

Nice to Have

  • Experience with large language models and prompt engineering
  • Contributions to open-source NLP projects
  • Research publications in NLP or machine learning venues
  • Hands-on experience with distributed computing frameworks
  • Knowledge of semantic web technologies or knowledge graphs

Compensation

Competitive salary and benefits package

Work Arrangement

Hybrid work model with flexibility for remote work

Team

Part of a collaborative data science team focused on advanced analytics and machine learning

About the Team

The data science team operates at the intersection of research and product development, focusing on delivering intelligent systems that solve real-world problems through data-driven methods.

Technology Stack

The team uses Python, PyTorch, Hugging Face Transformers, Docker, Kubernetes, and cloud-based ML platforms to build and deploy NLP solutions.

Available for qualified candidates

Required Skills
PythonLangChainAWSMicrosoft AzureGCPNLPMachine LearningData Science
About company
Clarivate
Clarivate provides innovative data and analytical solutions to the largest biopharmaceutical and medical technology companies in the world.
All jobs at Clarivate Visit website
Job Details
Category data
Posted 10 months ago