Site Reliability Engineering (SRE)
FICO
Guadalajara, Jalisco, México
hace 5 días
source : JobLeads

Estamos buscando el candidato adecuado para cubrir este puesto en una empresa apasionante.

  • A FICO SRE Manager is a multi-talented leader responsible for overall production services of highly distributed and complex systems.
  • Versed in both hardware and software, this individual will govern a global team of Site Reliability Engineers focused on building resiliency proactively into Company applications and infrastructure.
  • A deep understanding of service observability using saturation, traffic, latency, and error rate is required to ensure SLAs / SLOs / SLIs metrics are met and exceeded.
  • The SRE Manager must also comprehend the Software Delivery Lifecycle (SDLC) and incorporate SRE engineers into the workflow / communication channels along with tracking the effectiveness of the practice.
  • This role also requires strong communication skills that not only brings together different disciplines to the SRE organization, but is also well-connected with broader business, engineering, development, and IT teams.
  • Rigid incident management process adherence should be fundamental to this position as well as task prioritization and project planning.
  • Experience with developing operational automation and self-healing is beneficial. Collectively, these traits inherent to a FICO SRE Manager serves to add reliability to not only the overall architecture but build more resilient systems and teams.
  • What they are Seeking

  • Senior technical leader with breadth and depth. Multiple specializations over a career in the 5+ year range is preferable
  • Global Management Experience working as a technical manager with remote workers
  • Hands-on infrastructure and application software experience
  • Ability to analyze and solve issues for globally distributed systems
  • Technical experience in Java application servers (JBOSS), Windows / Linux Operating Systems, network fundamentals, cloud technology fundamentals
  • KPI driven expertise to quantify application and engineering performance against SLAs / SLOs / SLIs with emphasis on continual improvement and systems reliability / capacity
  • Full-stack software development experience with focus on automation Successful SRE Managers have come from multiple backgrounds, but must have Application / Systems Administration knowledge if not specialization
  • Foster and evangelize best practices for Site Reliability Engineering
  • Bachelor’s degree or equivalent or higher
  • Expertise in the practices and technologies of :

  • Operations / SRE Management Experience with at least 5 direct reports
  • Understanding of full-stack software development
  • Incident management
  • Large Unix / Linux distributed systems
  • Cloud implementations
  • Large scale traditional and NoSQL databases
  • Modern networking
  • Modern multi-tier large web systems
  • Reportar esta oferta
    checkmark

    Thank you for reporting this job!

    Your feedback will help us improve the quality of our services.

    Inscribirse
    Mi Correo Electrónico
    Al hacer clic en la opción "Continuar", doy mi consentimiento para que neuvoo procese mis datos de conformidad con lo establecido en su Política de privacidad . Puedo darme de baja o retirar mi autorización en cualquier momento.
    Continuar
    Formulario de postulación