Responsibilities
- Working on Internet technologies to improve the performance, availability, and scalability of large distributed content delivery systems
- Engaging in collaborative efforts with cross-functional teams, including Product & engineering, to define and establish measurable SLIs and SLOs
- Providing technical expertise and feedback to ensure system designs and implementations meet reliability and performance requirements
- Monitoring platform availability and performance, debug issues by leveraging data analysis skills and implement corrective actions to avoid recurrence
- Developing and implementing automation solutions to improve operational efficiency and reduce toil.
- Participating in design reviews and providing technical guidance to ensure designs meet requirements for scalability, performance, and robustness
Requirements
- 5 years of relevant experience and a Bachelor's degree in Computer Science or its equivalent
- Familiarity with Internet protocols (DNS/HTTP/TLS/TCP etc.)
- Experience utilizing Oracle SQL for data integrity checks, root cause analysis of data anomalies, and the development of data reports
- Proficiency in Scripting languages (Python, bash, JavaScript etc)
- Experience with monitoring and alerting systems (e.g., Prometheus, Grafana, ADBMS, Datadog), including metric collection, alerting, dashboarding, and troubleshooting
- Fluency working in a UNIX/Linux computing environment
Additional Information
- Applications are being accepted on an ongoing basis until the job is filled.
- Akamai Technologies is an Affirmative Action, Equal Opportunity Employer that values the strength that diversity brings to the workplace.
- All qualified applicants will receive consideration for employment and will not be discriminated against on the basis of gender, gender identity, sexual orientation, race/ethnicity, protected veteran status, disability, or other protected group status.
