Company DescriptionIt all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today — ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500. Our intelligent cloud-based platform seamlessly connects people, systems, and processes to empower organizations to find smarter, faster, and better ways to work. But this is just the beginning of our journey. Join us as we pursue our purpose to make the world work better for everyone.
Job DescriptionThe Site Reliability Engineering team is a group of highly technical engineers who are tasked with maintaining and developing the reliability, scalability and performance of the ServiceNow platform and infrastructure. The SRE is empowered to drive technical resolutions across the technology stack from application through to hardware and all stops in between. The ultimate goal of the SRE is to never have to escalate an issue to an engineering or development team and to completely own the resolution of incidents. They are also tasked with driving forward the operability of the platform to drive down incident numbers and to reduce MTTR. To accomplish this the team combines Software Development, Networking and Systems Engineering expertise with a strong desire to be challenged by problems of scale and complexity and to make services better for our customers.
What you get to do in this role: As an Engineer in the SRE team you will:
Provide relief and sustainable resolution to issues within our infrastructure.Use your experience in software development, systems engineering, and networking to proactively prevent repeatable issues.Drive initiatives with partner teams to improve the reliability and performance of the infrastructure through improved system design.Drive a culture of intolerance to manual activity which results in a highly automated environment delivering scalable solutions.Drive monitoring and automation initiatives.Please note this is a Swing shift role with a Wed-Sat working week, and includes a shift allowance to compensate. Candidates for this role must be based in Ireland. QualificationsTo be successful in this role you have: Deep knowledge of Linux systems.Experience working with relational databases: MySQL, MariaDB or PostgreSQL.Experience working with systems at scale - supporting critical services with focus on automation, observability, availability, and performance.Experience with Kubernetes to orchestrate the deployment, scaling, and management of containers.Experience coding in various languages; preferably Python, JavaScript, and Ruby.Networking skills, IP addressing and routing.Team-first attitude and an uncompromising attention to detail.An eye for proactively anticipating potential issues, expertise in performing root cause analysis, and a mindset focused on building effective solutions to prevent recurrence.Good collaboration and communication skills.Good to have: Expertise in Observability and Monitoring of applications, services, and networks at scale.Experience with DevOps automation, CI/CD pipeline and agile methodologies such as GitLab CI/CD.Experience writing test specifications and understand the fundamentals of test automation.Experience working with Cloud technologies such as Azure and AWS.Experience in configuration management of infrastructure using Ansible or Puppet.Experience developing on the ServiceNow Platform.
#J-18808-Ljbffr