Job, Senior Site Reliability Engineer (Kubernetes) in Carlsbad or Menlo Park

See job description below. Contact me if interested or have someone you’d like to refer.

Senior Site Reliability Engineer (SRE)

As a Senior SRE, you will plan, design, and help to build and operate a cutting edge, cloud native FinTech SaaS application with an increasing number of users, features, business requirements, partners and new engineers. You will lead design, build, and delivery of software leveraging IaaS and PaaS to optimize the application infrastructure and to enhance the scalability, availability, and efficiency using tools including but not limited to AWS, Docker, Kubernetes, Kafka and Java based microservices.

Responsibilities:

Primary point of contact for developers, quality assurance and operations
Architectures and methods to configure, deploy and operate services
Production readiness review for service launch
Monitor and troubleshoot issues in the application stack
Design release management processes
Implement best practice workflows in an Agile environment
Provide application performance insights to development and product teams
Escalation support for operations staff to troubleshoot application stack issues
Demonstrate initiative in identifying best practices and proposing appropriate technologies to achieve organizational results.

Required Knowledge & Skills

Deep Docker and Kubernetes expertise (critical)
Experience operating a production cloud service
Hands-on knowledge of IaaS and PaaS, e.g., AWS (RDS, ElastiCache, SNS, SQS)
Understanding of CI/CD pipelines and related tools (GitHub, Jenkins or CircleCI, JFrog, etc.)
Automation experience
Knowledge of SaaS application design and architecture including real-time distributed web services
Understanding of availability, reliability, performance and scalability
Experience with agile development methodologies
Java or Scala development experience, e.g., microservices, WebSockets, RESTful APIs is a plus
Experience with reactive platforms, e.g., event streaming, Akka, Kafka, Lagom is a plus
Python or go programming knowledge is a plus
Secure coding and operational practices are a plus
Desire to work in a fast-paced dynamic environment

Leave a Reply

Your email address will not be published.