Codersbrain technology pvt ltd
Lead Platform Engineer - Azure Kubernetes Service
Job Location
bangalore, India
Job Description
Lead Platform Engineer (Azure Kubernetes Services) - Seeking a seasoned Lead Platform Engineer to drive the maintenance and expansion of our Azure Kubernetes Services (AKS) environment. - This critical role is central to our IT services and cloud platform strategy, overseeing the architecture, design, development, implementation, and on-call production support of our 18-component Kubernetes ecosystem and all associated CI/CD pipeline services. - In this position, you will collaborate closely with digital product, software development, infrastructure, and operations teams to enhance the Developer Experience. - You'll lead the utilization of CI/CD tools, including GitHub Actions and Flux CD, and leverage monitoring tools such as Grafana to ensure optimal performance of our applications and API services. - As a thought leader in Kubernetes, you will play a key role in shaping and executing our Kubernetes platform strategy, managing technical debt efficiently, and ensuring a robust, scalable, and secure platform. - Additionally, as a member of the Enterprise Architecture team, you will leverage your deep expertise in Application and Cloud Platform Engineering to help drive Camping World's growth and long-term success. What You'll Do : - Architect, design, and implement Kubernetes clusters on Azure Kubernetes Service (AKS), ensuring high availability, scalability, and reliability. - Develop, manage, and support Infrastructure as Code (IaC) components, leveraging Terraform to deploy and maintain primary and supporting infrastructures. - Design, implement, and maintain CI/CD pipelines for Kubernetes deployments, utilizing GitHub Actions and Flux CD. - Collaborate with development teams by offering guidance throughout the development and deployment phases, reviewing and modifying code within GitHub repositories to ensure smooth integration and fully automated deployment processes. - Provide on-call production support, troubleshoot, and resolve complex issues related to AKS and container orchestration, ensuring quick resolution and minimal downtime. - Optimize cluster performance, scalability, and security to meet evolving requirements and resolve technical challenges. - Monitor and manage Kubernetes resources using observability tools (Grafana, SolarWinds, Dynatrace, Datadog, New Relic, etc.) to proactively identify and resolve issues. - Troubleshoot and address malfunctioning or underperforming applications, ensuring root causes are identified and long-term solutions are implemented. - Serve as a thought leader in Kubernetes, driving the platform strategy, advocating for best practices, and fostering continuous improvement and innovation. What You'll Need to Have for the Role: - 5 years of hands-on experience in designing, managing, and supporting complex, enterprise-grade Microsoft AKS environments. - Extensive experience with Azure cloud services, including Azure SQL Database, Storage Accounts, and Azure Container Registry. - Strong understanding and hands-on experience with Terraform for automating infrastructure deployment and management. - Deep knowledge of containerization technologies (Docker) and orchestration (Kubernetes), including Helm for managing Kubernetes applications. - Proven experience in designing, implementing, and managing CI/CD pipelines using GitHub Actions and Flux CD. - Proficient in reading, understanding, and modifying code in GitHub, supporting development teams, and ensuring smooth integration with Kubernetes platforms. - Expertise in security best practices within Kubernetes environments, ensuring secure and compliant deployments. - Hands-on experience with monitoring and observability tools, including the Grafana stack (Grafana, Loki, Mimir, Tempo), for creating dashboards and alerts. - Practical experience with Kuma/Kong Mesh service mesh technologies. - Hands-on experience managing Kong API gateways. - Exceptional problem-solving skills and strong communication abilities, capable of leading troubleshooting sessions and guiding cross-functional teams. - Experience in platform architecture (IaaS, PaaS), site reliability engineering (SRE), quality assurance (QA), system design, integrations, and end-to-end implementation. - Experience working with Enterprise Architecture (EA) teams, participating in EA processes, and engaging with Architecture Review Boards (ARB), Change Advisory Boards (CAB), and other governance bodies (GRC). (ref:hirist.tech)
Location: bangalore, IN
Posted Date: 1/15/2025
Location: bangalore, IN
Posted Date: 1/15/2025
Contact Information
Contact | Human Resources Codersbrain technology pvt ltd |
---|