Site Reliability Engineer I, Cloud Computing Technologies Agile Development Labor Rate builds a meaningful engineering discipline, combines software and systems to develop creative engineering solutions to enhance operations and support development teams. Ensures the design and upgrades of automated software, changes management, and delivers management solutions to operations problems encountered by the organization. This may involve engaging the development team throughout the project lifecycle to design and develop software for reliability and scale in order to ensure minimal refactoring or changes. Troubleshoots priority incidents and participates in the creation of new designs, architectures, standards and methods supporting large scale distributed systems. May have 2-5 years of experience in this field, with a minimum of one year of satisfactory full-time professional experience in an Information Technology, software development or security operations environment in one or a combination of the following: Department of computer science, information security, cybersecurity, public health service, public safety services, engineering, information technology, and any other specialized area related to site reliability engineering field. Proficient in Perl, Shell, Java, Python, C++, and other relevant programming/scripting languages, as well as relational database languages such as SQL and no-SQL data stores. Possesses excellent organizational skills and ability to multitask and prioritize responsibilities according to order of importance. Have moderate experience working with at least one relevant orchestration and configuration management toolsets such as Ansible, Puppet, Chef, Terraform, Packer, OpenStack Heat, and/or CloudFormation. Demonstrates adequate knowledge of operational concepts (including change management, on call rotations, escalations, uptime, performance tuning, monitoring, log analysis, etc.) to ensure effective operations and support the development team. Have adequate knowledge of utilizing monitoring tools like Nagios, AppDynamics, and/or CloudDogs, as well as experience in design and development of new monitoring checks. Proficient in UNIX/Linux operating systems and showcases expertise in Java application containers like Tomcat and Apache web servers. Works in conjunction with the development and engineering teams to maintain and enhance internal tools. Operates under the direction and guidance of the Senior Site Reliability Engineer.
Further rates within this Site Reliability Engineer I category.