Role: Azure Reliability Consultant
Job Location: Atlanta, GA (Onsite)
Duration: 12+ Months
Job Description : -
Exp:10+ Yrs.
- Operational Strategy Development: Design and implement strategies to optimize operational processes and improve system reliability and performance within the Azure cloud environment.
- Infrastructure Management: Oversee the management and provisioning of Azure cloud infrastructure using tools like Azure Resource Manager (ARM) templates, Terraform, or Ansible.
- Kubernetes Management: Deploy, manage, and optimize Kubernetes clusters within Azure Kubernetes Service (AKS) to ensure high availability and scalability.
- Monitoring and Incident Management: Implement monitoring solutions and establish incident management protocols to ensure high availability and reliability of Azure services and Kubernetes clusters.
- Performance Optimization: Analyze system performance and implement improvements to enhance scalability and efficiency in the Azure cloud and Kubernetes environments.
- Collaboration: Work closely with development, QA, and operations teams to ensure seamless integration and delivery of software within the Azure and Kubernetes environments.
- Security Best Practices: Implement security best practices in operational processes, Azure infrastructure management, and Kubernetes cluster configurations.
- Documentation: Create and maintain comprehensive documentation for operational processes, Azure infrastructure configurations, Kubernetes deployments, and incident management procedures.
- Training and Mentorship: Provide training and mentorship to team members on Azure operational practices, Kubernetes management, tools, and methodologies.