Join a team responsible for managing Linux-based infrastructure across on-premises and cloud platforms. In this role, you'll monitor system health, respond to operational alerts, and resolve incidents affecting servers, services, and network connectivity. Your work ensures high availability, performance, and security for production and development environments.
Key Responsibilities
- Respond to system alerts and resolve incidents within defined timeframes
- Diagnose and fix issues related to operating systems, applications, middleware, and network services
- Investigate boot failures, performance bottlenecks, storage constraints, and service outages
- Analyze system and application logs to determine root causes of failures
- Support critical services including web servers, email relays, DNS, SSL/TLS, and scheduled tasks
- Manage user accounts, access controls, SSH configurations, and privilege escalation policies
- Perform OS updates, package management, kernel upgrades, and repository maintenance
- Administer systemd services and troubleshoot failed processes or daemons
- Handle storage tasks such as filesystem management, LVM configuration, and disk cleanup
- Write and maintain Bash scripts to automate routine system tasks
- Verify network connectivity, firewall rules, routing, and name resolution
- Apply security hardening practices and address vulnerabilities in Linux environments
- Review access logs, audit privileged usage, and maintain secure configurations
- Support deployment and onboarding of new systems and coordinate migration activities
- Conduct regular system checks, performance monitoring, and capacity planning
- Act as a technical resource for support teams and provide escalation assistance
- Participate in on-call rotations with after-hours and weekend coverage
- Document system configurations, procedures, and operational best practices
- Follow change and problem management workflows for internal and client systems
- Contribute to continuous improvement initiatives and client onboarding projects
Qualifications
Candidates should hold a degree in information technology or a related field, or have equivalent experience. Minimum of 5–7 years of hands-on Linux system administration in both physical and virtual environments is required, including cloud and on-premises infrastructures.
Proficiency with Ubuntu, RHEL, CentOS, Rocky Linux, and AlmaLinux is essential. Experience with systemd, logging systems, package managers, and system monitoring tools is expected. You must be comfortable using command-line utilities for process, memory, disk, and network analysis.
A solid understanding of TCP/IP, DNS, SMTP, HTTP/HTTPS, SSL/TLS, and firewall concepts is required. Prior support experience with Apache, Nginx, Postfix, reverse proxies, and certificate management is highly valued.
Strong analytical, communication, and time management skills are necessary. You should be self-driven, detail-oriented, and capable of working independently or within a collaborative team structure. Remote work capability with a stable internet connection is mandatory.
Preferred Experience
- Work within a Managed Service Provider (MSP) setting
- Experience in ticket-based operational support environments
- Familiarity with monitoring platforms such as LogicMonitor, Zabbix, or Nagios
- Scripting and automation experience using Bash for operational efficiency
Work Environment
This is a remote position requiring reliable internet access. The role includes rotational on-call duties with weekend and after-hours availability to support 24/7 operations. Work hours are flexible but must align with team coverage needs.
Our Values
We emphasize accountability, collaboration, meaningful contributions, mutual respect, and a positive experience in every aspect of our work. We support diversity and inclusion and provide reasonable accommodations for qualified individuals with disabilities.