Total-TECH Co.
” The Job Description”
- Triage and Handle Node Health issues in-hours.
- Participate in Firefighting along with development engineers .
- Own the Design, execution and support the deployment topology of the product through infrastructure as code.
- Own and maintain the distribution, scaling, metrics collection, and monitoring of multiple clusters.
- Support the engineers in their needs to define resourcing for services that they are building as a stakeholder.
- Own the running of our CI/CD systems and work with the Testing Engineers to create a well tested product.
- Improve and own operational processes .
- Have knowledge and focus in the security of the topologies that we have running in production.
- Plan the growth of the infrastructure based on business needs and inputs.
Requirements:
- Kubernetes, Docker, and Helm.
- Very comfortable operating in Linux, including a knowledge of BASH.
- Cloud hosting platform (Ideally GCP, but AWS or Azure).
- Able to write code in Python.
- Experience deploying and maintaining modern CI/CD systems (Zuul, CircleCI, Concourse, etc.).
- A knowledge and passion for infrastructure as Code.
