Waypoint Human Capital ยท 2 days ago
Site Reliability Engineer
Wonder how qualified you are to the job?
Maximize your interview chances
Staffing and Recruiting
Insider Connection @Waypoint Human Capital
Responsibilities
Develop and implement automation tools to support monitoring and system administration, reducing the risk and labor associated with manual tasks.
Create sustainable tools that match or exceed the performance of manual methods, enhancing the efficiency and reliability of operations.
Provide technical direction for system monitoring, health checks, and performance optimization.
Work on tasks with varying complexity, including those requiring development of GUIs for easier cluster management and automation.
Formulate and enforce Standard Operating Procedures to ensure consistency and reliability in system administration tasks.
Utilize knowledge of tools like SALT and PUPPET to determine the best automation approach for different tasks.
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
Bachelor's Degree in Computer Science or a related technical field (equivalent to two years of experience)
Master's Degree in a Technical Field (equivalent to four years of experience)
AWS Certified Solutions Architect - Professional
AWS DevOps Engineer - Professional
Fourteen (14) years in software development/engineering, covering requirements analysis, software development, installation, integration, evaluation, enhancement, maintenance, testing, and problem diagnosis/resolution
Ten (10) years in system engineering/architecture
Ten (10) years with products supporting highly distributed, massively parallel computation (e.g., Hbase, Hadoop, Accumulo, Big Table, Cassandra, Scality)
Ten (10) years writing software scripts for automation using languages like Perl, Python, or Ruby
Four (4) years managing and monitoring large Cloud Systems
Experience in providing technical direction for the development, engineering, interfacing, integration, and testing of complete hardware/software systems, including postmortem analysis and incident management
Active TS/SCI security clearance with a current full scope polygraph
Preferred
Proficient in developing automation tools to improve system administration efficiency
Experienced in handling and optimizing large-scale, distributed systems
Skilled in writing and maintaining scripts for software automation
Capable of providing technical direction and improving organizational processes
Able to work effectively in a fast-paced, dynamic environment with shifting priorities