Senior Linux & AWS Systems Administrator
Full-time
Senior Executive
1 month ago
We are seeking a talented and experienced Senior Linux/AWS System Administrator to join our team. The ideal candidate will be responsible for managing.....
We are seeking a talented and experienced Senior Linux/AWS System Administrator to join our team. The ideal candidate will be responsible for managing our Linux and Windows-based systems hosted in AWS infrastructure and VMware, ensuring high availability, security, and optimal performance. The System Administrator plays a crucial role in provisioning and delivering services requested by Cloud customers and ensuring smooth Cloud Operations. This role will involve close collaboration with various internal and customer-facing teams and operations teams to deploy, monitor, and troubleshoot various applications and services hosted on our servers.
As a 24x7x365 organization, work on different shifts including night shifts, work on holidays and on-call responsibilities are required.
Note: The role will require the person to support in night shifts ( 10 pm IST to 7 am IST)
Responsibilties:
- Install, configure, maintain, and upgrade Cloud Infrastructure including Linux, Windows, and VMware environments.
Proven experience as a Linux system administrator, with strong knowledge of CentOS/Red Hat or Ubuntu.
Hands-on experience with AWS services such as EC2, S3, RDS, IAM, and VPC.
Proficiency in scripting languages such as Bash, PowerShell, Python, or Perl.
Design, deploy, and manage AWS cloud infrastructure, ensuring scalability, security, and reliability.
Strong troubleshooting and problem-solving skills, with the ability to analyze complex issues and implement effective solutions.
Excellent communication and collaboration skills, with the ability to work effectively in a team environment.
Manage and maintain Linux and Windows-based servers, including installation, configuration, and patching.
Implement and maintain security measures to protect servers and data, including firewall configurations, access controls, and encryption.
Monitor system performance and resource utilization, identifying and resolving issues to ensure uptime and responsiveness.
Collaborate with internal and customer-facing teams to deploy applications and troubleshoot issues in a timely manner.
Automate repetitive tasks and workflows using scripting languages such as Bash, PowerShell, Python, or Perl.
Implement backup and disaster recovery solutions to ensure data integrity and business continuity.
Work closely with operations teams to ensure the smooth conduct of Cloud Operations and deliver high-quality services to customers.
Change Management:
Creating RFC (maintenance window) documents for the installation/upgrade activities in strict compliance with document procedures and work towards securing approval for execution of the RFC tasks.
Executing the approved RFC’s on Cloud customer environments if the requisite certification level has been achieved.
Execute environment creation, environment cloning, and server migration/upgrade projects for Cloud customers including A & A+ customers.
Service Request, Incident Management & Problem Management:
Working on assigned incidents and resolving them as per defined Service Level Agreements (SLAs). The incidents assigned are of a more complex and critical nature for this position.
Key technical resource for handling outage situations and RCA creation.
Root cause analysis of high complexity issues that are recurring in nature by working closely with the Cloud Problem Management team. Employs DMAIC approach to problem solving.
Continual Improvement:
Provide suggestions for improvement and draft work instructions for activities that do not have documented instructions
Develop training/mentoring plans for team members
Evaluate new products so as to ensure readiness within the team for future RFCs
Official account of Jobstore.