American Expressprivate limited

American Express - Engineer - Site Reliability Engineering

Click Here to Apply

Job Location

India, India

Job Description

You Lead the Way. We've Got Your Back. With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, you'll learn and grow as we help you create a career journey that's unique and meaningful to you with benefits, programs, and flexibility that support you personally and professionally. At American Express, you'll be recognized for your contributions, leadership, and impactevery colleague has the opportunity to share in the company's success. Together, well win as a team, striving to uphold our company values and powerful backing promise to provide the worlds best customer experience every day. And well do it with the utmost integrity, and in an environment where everyone is seen, heard and feels like they belong. Key Responsibilities : - SRE Strategy and Leadership : Develop and implement a comprehensive SRE strategy aligned with the company's goals and objectives. Lead a team of SRE professionals to drive the reliability, performance, and scalability of GRC technology solutions. - Observability and Monitoring : Establish observability practices to ensure real-time insights into system performance, availability, and customer experience. Implement monitoring tools, metrics, and dashboards to proactively identify and address potential issues. - Production Support Optimization : Lead all aspects of the end-to-end production support process, including incident management, problem resolution, and service-level agreement (SLA) compliance. Drive continuous improvement initiatives to enhance operational effectiveness and reduce mean time to resolution (MTTR). - GRC Customer Journeys : Collaborate with multi-functional teams to enhance customer journeys through seamless and reliable technology experiences. - Reliability Engineering Best Practices : Promote and implement standard methodologies, including error budgeting, chaos engineering, and disaster recovery planning. Cultivate a culture of resilience and reliability within technology. - Automation and Efficiency : Champion automation initiatives to streamline operational workflows, deployment processes, and incident response tasks. Leverage automation tools and orchestration to improve reliability and reduce manual : - 3 to 6 years of experience and degree or equivalent experience in Computer Science, Information Technology, or related field. Advanced certifications in SRE or related are a plus. - Deep understanding of observability tools and methodologies, including experience with logging, monitoring, tracing, and performance analysis platforms. - Strong leadership and people management skills, with the ability to inspire and empower successful SRE teams. Preferred Skills : - Hands-on coding and System Design of highly available distributed systems - Java/Golang/Javascript, Kubernetes, Docker - Knowledge on modern observability stack splunk, elastic search, Prometheus, Grafana - Knowledge of cloud-based SRE practices and experience with public cloud platforms such as AWS, Azure, or Google Cloud. - Familiarity with containerization technologies (e.g., Kubernetes, Docker) and microservices architecture. - Demonstrated expertise in driving culture change, DevOps practices, and continuous improvement in SRE and production support functions. (ref:hirist.tech)

Location: India, IN

Posted Date: 10/25/2024
Click Here to Apply
View More American Expressprivate limited Jobs

Contact Information

Contact Human Resources
American Expressprivate limited

Posted

October 25, 2024
UID: 4863174765

AboutJobs.com does not guarantee the validity or accuracy of the job information posted in this database. It is the job seeker's responsibility to independently review all posting companies, contracts and job offers.