22 may
|
CTC
|
Heroica Puebla de Zaragoza
22 may
CTC
Heroica Puebla de Zaragoza
Postúlate en Kit Empleo: kitempleo.com.mx/empleo/5qas71
Site Reliability Engineer - 100% Remote in Mexico
Since its founding in 1996, CTC has grown into a trusted general partner in AI & ML, Enterprise Applications, Digital Services, Managed Services, and Business Services. With headquarters in Detroit, Michigan, CTC has a team of over 2,000 experts worldwide. We empower more than 100 organizations to tackle complex challenges and transform them into sustainable competitive strengths--driving innovation, efficiency, and growth every step of the way. Our strengths have always been Commitment to Customer, Commitment to Colleagues, and Commitment to Community (CTC).
We’re looking for a technical leader who thrives at the intersection of software engineering, automation, and systems thinking — someone who can influence strategy while still diving deep into hands‑on engineering.
What You’ll Do
Build & Evolve Reliable Infrastructure
Design and implement highly available, scalable, fault‑tolerant systems.
Partner with engineering teams to define and enforce reliability standards and best practices .
Automate Everything
Automate provisioning, configuration, and deployments using IaC and CI/CD pipelines .
Collaborate with developers to build frictionless, automated delivery workflows in GitHub.
Drive Observability & Performance Insights
Build and enhance full‑stack observability solutions.
Use analytics and monitoring tools to proactively detect issues and optimize performance.
Own the incident lifecycle: response, coordination, communication, and resolution.
Create and maintain incident response playbooks and escalation paths.
What You Bring
5+ years as an SRE or similar engineering role.
4+ years working with Azure in production environments.
4+ years supporting enterprise‑scale applications .
3+ years building observability solutions (Grafana, Elastic, Splunk, Prometheus, etc.).
2+ years building CI/CD pipelines in GitHub .
Strong programming skills in Python, Go, Java, or C# (automation‑focused).
Deep experience with Terraform and IaC principles.
Hands‑on experience with Kubernetes and Docker .
Familiarity with Ansible , Grafana, Elastic, Splunk, Prometheus, and other monitoring/alerting tools.
If you’re an SRE who loves solving complex problems, automating at scale, and building systems that just don’t go down, we’d love to meet you.
Apply now or reach out directly — let’s build something exceptional together.
#J-18808-Ljbffr
Postúlate en Kit Empleo: kitempleo.com.mx/empleo/5qas71
📌 Site Reliability Engineer (Heroica Puebla de Zaragoza)
🏢 CTC
📍 Heroica Puebla de Zaragoza