Systems Development Engineer, Managed Operations Sector: Engineering, Operations and Facilities Management
Role: Professional
Contract Type: Permanent
Hours: Full Time
DESCRIPTION
Would you enjoy diving deep into, operating, and improving some of the largest software systems humanity has ever built? Do the challenges that come of driving technical, business, and cultural change to improve the reliability, performance, and efficiency excite you?
The AWS Managed Operations (MO) organization was founded in April 2023, with the objective to reduce operational load and toil through long-term engineering projects. MO is building the best-in-class engineering and operations team that will own the day-to-day operations for AWS Regions; improving the availability, reliability, latency, performance and efficiency to operate AWS regions.
Amazon is looking for highly motivated Systems Development Engineers who can balance the day-to-day operations of AWS' software systems with long-term software engineering to reduce operational toil. We need engineers who enjoy constantly learning and diving deep into the wide range of systems and technologies that make up one of the world's largest cloud providers.
A day in the life
You'll roughly spend 50% of your time operating production systems and 50% making long-term improvements to the reliability, availability, and performance of those software systems. Over the course of a week, this could look like; Monday morning you root caused why some deployments recently failed, and in the afternoon, you made fixes for those bugs. Tuesday and Wednesday you executed a highly sensitive time critical change to production. Thursday and Friday you were developing software with your team to remove humans from the loop on problems like you worked on over the previous two days, driving a common source of error out of the system and improving its reliability.
BASIC QUALIFICATIONS
- Able to participate in a 24x7 on-call rotation
- 3+ years of experience in software development or related field with proficiency in at least one modern programming language such as Java, Typescript, Python, or Ruby
- Experience operating and troubleshooting reliable, scalable software systems
- Able to troubleshoot at all levels, from network to operating systems to software applications
- Successful applicants must have the legal right to work in Ireland
PREFERRED QUALIFICATIONS
- Excellent communication and problem-solving skills across languages
- Experience operating 24x7 high-availability, distributed software applications and performance tuning software applications and optimizing fleet utilization
- Strong understanding of network fundamentals (DNS, DHCP, TCP/IP, routing, load balancing, load shedding) and experience with monitoring frameworks (such as CloudWatch, Datadog, Grafana, Elastic or similar)
- Experience scripting operating system tasks in Bash, Python, etc. and with Infrastructure as Code, (such as CDK, CloudFormation, Puppet, Chef, Ansible, or similar)
- Experience operating services in AWS
Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover, invent, simplify and build.
#J-18808-Ljbffr