This site uses cookies. To find out more, see our Cookies Policy

Senior Site Reliability Engineer (RT3) in Columbus, OH at Fast Switch

Date Posted: 1/11/2019

Job Snapshot

Job Description

Job ID: 50385

Senior Site Reliability Engineer. The next generation of our clients' products are delivering engaging, adaptive, and personalized learning experiences to optimally support every learner. They are hiring a Senior Site Reliability Engineer contractor who will work with system and software engineers to build reliable, high capacity and high-performance systems in support of their mission to reimagine learning for millions of students and learners worldwide. This position will be located at their Columbus, OH facility.

Responsibilities:

  • As a Sr. Site Reliability Engineer, you will help design, analyze and resolve issues with infrastructure in collaboration with product development teams; you will design, deploy and manage automation tools that increase predictability as well as decrease time to market while reducing cost.
  • Hands-on design, analysis and troubleshooting of highly-distributed large-scale production systems.
  • Ownership of reliability, uptime, capacity- and performance-analysis thereof.
  • Ensuring the repeatability, traceability, and transparency of our infrastructure automation including alignment with client standards and best practices for operational excellence.
  • Identifying highest-impact opportunities to optimize existing systems.
  • System design consulting for teams seeking to leverage or improve their production infrastructure.
  • Anticipate, build and plan capacity for upcoming product/feature launches.

Required Skills:

  • Experience with programming in languages like Javascript, Python, PHP, Go, or Ruby
  • Strong skills in reading, understanding and writing code in the same
  • Mastery of infrastructure automation technologies (like Terraform, CodeDeploy, Puppet, Ansible, Chef)
  • Expertise in container/container-fleet-orchestration technologies (like Docker, Vagrant, Mesosphere, etcd, zookeeper)
  • Cloud and container native Linux administration/build/management skills (AWS AMIs, Packer, etc.)
  • Significant experience troubleshooting concurrent and distributed system interactions
  • Expertise with cloud- continuous-deployment- based software development lifecycles (e.g. CI/CD)
  • Cloud database operations and deployment experience (RDS MySQL/Postgres/Aurora), Caching operations & deployment experience (memcache, Redis)
  • Expertise with Lean/Agile deployment processes (Blue/Green, ZDT, Canary, load balancers/DNS strategies)
  • Familiarity with site and infrastructure monitoring systems (like Datadog, New Relic, Sumologic)
  • Strong problem solving, root cause analysis and systems engineering skills
  • Excellent presentation and communication skills
  • Ability to design and manage escalation response plans from monitoring, react, respond, remediate and retrospect in culturally aligned (proactive, customer focused, collaborative, data-driven) ways
  • Demonstrated expertise building and managing highly scaled production infrastructure in the cloud (AWS required; GCP, Azure, OpenStack a plus)
  • Expertise with SDLC branching, SCM, and code deployment systems (git/gitflow, Jenkins, CircleCI, TravisCI, etc.)
  • BS Degree in Computer Science (or related technical field and/or equivalent industry experience)

Nice to Have:

  • Being able to translate between development, operations, security, product, and management dialects is a highly-sought skill
  • Ability to translate knowledge and ideas into written-word as documentation
  • Being “conversational” in JavaScript/TypeScript, Python, PHP, Ruby, Golang, Java, Bash, Markdown, reStructuredText, HCL, JSON, YAML, and TOML would be valuable. Being fluent in 2-3 of them would be a huge plus
  • A non-trivial background in open source is a huge plus.

** Due to high volume of applicants, only applicants with the following information will have further consideration for the above open position: First and last name, Education (graduation date, degree), Authorization to work in the U.S. including C2C, or H1 sponsorship transfer request. 

To view all our open positions, please go to: http://www.jobs.net/jobs/fastswitch/en-us/all-jobs/