CompQsoft ยท 1 week ago
Site Reliability Engineer (SRE)
Wonder how qualified you are to the job?
Information TechnologyRobotics
Insider Connection @CompQsoft
Responsibilities
Possess and maintain active TS/SCI w/Poly Security Clearance.
Proficient with writing services/task automation in Python, Bash
Proficient with interpersonal skills (writing, organization, learning exchange)
Familiarity with core protocols (DNS, DHCP, HTTP, TCP)
Deep knowledge of Linux internals and host-based networking
Expert Linux/Unix performance and stability troubleshooting skills
Familiarity with configuration management solutions such as Chef, Puppet, etc.
Experience with devising, managing, and extending monitoring solutions for large scale environments.
Experience in database management (Oracle DB, MYSQL, Postgres)
Experience in shared file systems (Gluster, ZFS, etc.)
Systematic problem-solving approach, strong communication skills, a sense of ownership and drive
Deep understand of service metrics and alarms through the development of dashboards, service KPIs, alarming systems
Experience working in an operational environment with mission critical tier one services with associated pager duty
Managing large scale, highly distributed, services infrastructures.
Managing host virtualization technologies (KVM, Containers, Docker, etc.)
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
Possess and maintain active TS/SCI w/Poly Security Clearance
Degree in Computer Science or related technical field involving coding or equivalent practical experience
Proficient with writing services/task automation in Python, Bash
Proficient with interpersonal skills (writing, organization, learning exchange)
Familiarity with core protocols (DNS, DHCP, HTTP, TCP)
Deep knowledge of Linux internals and host-based networking
Expert Linux/Unix performance and stability troubleshooting skills
Familiarity with configuration management solutions such as Chef, Puppet, etc.
Experience with devising, managing, and extending monitoring solutions for large scale environments
Experience in database management (Oracle DB, MYSQL, Postgres)
Experience in shared file systems (Gluster, ZFS, etc.)
Systematic problem-solving approach, strong communication skills, a sense of ownership and drive
Deep understand of service metrics and alarms through the development of dashboards, service KPIs, alarming systems
Experience working in an operational environment with mission-critical tier one services with associated pager duty
15.5+ years managing large scale, highly distributed, services infrastructures
16.2+ years managing host virtualization technologies (KVM, Containers, Docker, etc.)
Preferred
Proficient in coding complex, distributed systems using Python or Java
Deep knowledge of Networking (TCP, UDP, DNS, DHCP, IPSec)
Deep focus on building secure Internet-facing systems and services
DevOps, Cloud experience
3+ years of experience in production software development with Agile methodologies
3+ years managing host, network, or storage virtualization technologies
Expert troubleshooting skills
Expert fleet automation and management solutions
Company
CompQsoft
CompQsoft is an ISO 9001:2015, ISO 27001:2013 & ISO 20000-1:2011 & CMMI Level 3 certified company was founded as a minority owned HUBZone small business in Houston, Texas in 1997 to provide SAP services to assist commercial companies to respond to the Year 2000 challenge.
Funding
Current Stage
Growth StageRecent News
2024-06-04
Company data provided by crunchbase