SRE Architect

United States
Remote
Full-Time

Job Description

Role - SRE Architect

Location - Remote

Duration - Full Time

Responsibilities

Roles & Responsibilities:

  • 18+ years of Development and Operations experience in building and running applications in production that has uptime over 99%. Related experience and/or training; or equivalent combination of education and experience

  • 8+ years of experience as a SRE Architect in running large Reliability & Observability Programs for large, complex infrastructure deployments / distributed systems for major Banking customers.

  • Has a keen eye for industry trends, tries out newer tools/infrastructure to improve current systems in terms of execution and/or operability

  • Strong hands-on coding experience in one or more of programming languages such as Java etc.

  • Good understanding of Observability (monitoring, logging, tracing, metrics), Chaos engineering concepts.

  • Proficiency in using Application Performance Monitoring (APM) tool New Relic/Dynatrace for monitoring, logging, tracing and Splunk for Log monitoring.

  • Expert level hands on knowledge in cloud platforms like PCF.

  • should have implemented solutions around Service Level Indicators (SLIs) and Service Level Objectives (SLOs) for services.

  • Should have supported Production Incidents (PIs) on mission critical applications of a company. Troubleshoot, debug, and diagnose operational issues and drive them to closure.

  • Understanding of software delivery life cycles, particularly Agile/Lean & DevOps

  • Proven experience in handling large scale and growing infrastructure across Data Centers and heterogeneous Cloud platforms

  • Experience as a service owner in managing large geographically diverse stakeholders

  • Ability to work with creative – fast growing engineering team and motivate them to deliver their best work

Qualifications