Systems Administrator, Ecommerce Domain
2 days ago
We are seeking a talented and experienced Linux System Administrator to join our team. The ideal candidate will be responsible for managing our hosted.....
We are seeking a talented and experienced Linux System Administrator to join our team. The ideal candidate will be responsible for managing our hosted Linux systems to ensure high availability, security, and optimal performance. Additionally, the candidate will be responsible for communication to all levels and should have excellent customer service skills and be comfortable in a customer facing role. The System Administrator plays a crucial role in provisioning and delivering services requested by Cloud customers and ensuring smooth Cloud Operations. This role will involve close collaboration with various internal and customer-facing teams and operations teams to deploy, monitor, and troubleshoot various applications and services hosted on our servers.
As a 24x7x365 organization, work on different shifts including night shifts, work on holidays and on-call responsibilities are required.
Note: The role will require the person to support in night shifts ( 10 pm IST to 7 am IST)
Responsibilties:
Install, configure, maintain, and upgrade Cloud Infrastructure environments.
Design, deploy, and manage cloud infrastructure, ensuring scalability, security, and reliability.
Manage and maintain Linux including installation, configuration, and patching.
Implement and maintain security measures to protect servers and data, including firewall configurations, access controls, and encryption.
Monitor system performance and resource utilization, identifying and resolving issues to ensure uptime and responsiveness.
Collaborate with internal and customer-facing teams to deploy applications and troubleshoot issues in a timely manner.
Automate repetitive tasks and workflows using scripting languages such as Bash, Python, Ansible, Terraform and other tooling.
Implement backup and disaster recovery solutions to ensure data integrity and business continuity.
Work closely with operations teams to ensure the smooth conduct of Cloud Operations and deliver high-quality services to customers.
Change Management:
Creating RFC (maintenance window) documents for the installation/upgrade activities in strict compliance with document procedures and work towards securing approval for execution of the RFC tasks.
Executing the approved RFC’s on Cloud customer environments if the requisite certification level has been achieved.
Execute environment creation, environment cloning, and server migration/upgrade projects for Cloud customers including A & A+ customers.
Service Request, Incident Management & Problem Management:
Working on assigned incidents and resolving them as per defined Service Level Agreements (SLAs). The incidents assigned are of a more complex and critical nature for this position.
Key technical resource for handling outage situations and RCA creation.
Root cause analysis of high complexity issues that are recurring in nature by working closely with the Cloud Problem Management team. Employs DMAIC approach to problem solving.
Continual Improvement:
Provide suggestions for improvement and draft work instructions for activities that do not have documented instructions
Develop training/mentoring plans for team members
Evaluate new products so as to ensure readiness within the team for future RFCs
Official account of Jobstore.