Senior Platform Systems Engineer
Serve as subject matter expert of microservices and distributed application systems at scale.
Drive operational improvements using insights from metrics and customer feedback.
Lead incident response and post-incident reviews.
Take responsibility for maintaining services, platforms, and infrastructure to ensure smooth operation at a large production scale.
Monitor and optimize the health and performance of applications and platforms.
Debug problems reported by partners and end-users using in-depth log analysis and stack traces.
Create comprehensive documentation for operational procedures and environment setup.
Develop tools, operational enhancements, and automated solutions to streamline processes.
Be a member of a 24x7 shifting rotation.
Bachelor’s degree in Information Technology (I.T.) or a related field.
Demonstrated ability in supporting critical production services.
At least 5 years experience working with the following:
Logging and monitoring: Splunk, Grafana, and Prometheus.
Container Orchestration: Docker, Kubernetes.
Version/Config Management: Git, Ansible.
Linux administration: RHEL, CentOS, or other Unix-like systems.
At least 3 years experience working with the following:
Scripting: Bash, Python, Ruby.
Incident management: PagerDuty, ServiceNow.
APIs, HTTP requests, and JSON/YAML formats.
Strong communication skills, both written and verbal.
Relevant certifications in any of the key skills (e.g. CKA or CKAD certified).
Familiarity with CI/CD tools (Jenkins, Spinnaker).
Operations experience in e-commerce, payments systems, or private cloud services.
Strong understanding of security concepts, including SSL and certificate renewal processes.
You'll be part of a driven and passionate Platform Systems Engineer team, whose work is known for excellent operational work. These are people who are taking head-on data center and platform operation challenges, not afraid of doing the dirty work, but still continuously looking out for ways to make things more efficient and better.
Our team is made up of individuals who are aligned with OpsWerks’ values. In the spirit of building a healthy community, which requires open and honest communication, here are our expectations for every one of us at OpsWerks:
To uphold OpsWerks’ Mission and Methods.
To know, believe, and execute each team’s mission plan.
Growing in the 4 awareness (self, others, surroundings, and situation).
To take ownership of your personal growth for the team’s well-being.
To never give up, to never give in… only giving your best.
Ready to start your awesome journey and be part of OpsWerks?