As a Software Engineer, you will be responsible for building and maintaining robust, scalable systems across cloud and container environments. Your work will focus on designing CI/CD pipelines, managing containerized applications, and ensuring high reliability through observability and proactive monitoring.
Key Responsibilities
- Build and manage automated CI/CD workflows using GitHub Actions
- Apply Git best practices for branching, pull requests, and repository structure
- Develop backend services and internal tools using TypeScript and Python
- Package and optimize applications using Docker
- Deploy and manage workloads in Kubernetes, including scaling and troubleshooting
- Provision and maintain infrastructure in Microsoft Azure
- Implement logging, monitoring, and alerting strategies using Datadog and Splunk
- Configure and maintain Datadog agents and integrations
- Centralize logs in Splunk and optimize search and retention policies
- Create dashboards to track system health and performance
- Respond to incidents, perform root cause analysis, and drive improvements
- Enhance deployment reliability and streamline engineering processes
- Collaborate with both technical and non-technical partners to deliver stable systems
Required Qualifications
- At least 3 years of hands-on engineering experience
- Proven experience with GitHub Actions and CI/CD automation
- Strong command of Git workflows and repository management
- Development experience in TypeScript or Python
- Practical knowledge of Docker and container optimization
- Working expertise with Kubernetes, including deployments and services
- Familiarity with Microsoft Azure services and infrastructure
- Experience with Datadog and Splunk for observability and log management
- Understanding of monitoring, alerting, and dashboarding practices
- Proficiency with command-line tools and production debugging
- Ability to document and update system changes clearly
- Self-directed with strong problem-solving and ownership
- Clear communication skills for technical and non-technical audiences
- Experience with asynchronous collaboration via pull requests, issue tracking, and team messaging


