Altech Solutions · 16 hours ago
Site Reliability Engineer - El Segundo, Calio
Maximize your interview chances
Information TechnologySoftware
Insider Connection @Altech Solutions
Get 3x more responses when you reach out via email instead of LinkedIn.
Responsibilities
Lead real-time change management processes, ensuring minimal disruption to operations and adherence to client protocols.
Coordinate incident responses, troubleshoot issues, and provide root cause analysis to improve overall system stability and resilience.
Act as the primary interface with the organization’s web platform team to manage cross-functional workflows.
Design, create, and modify workflows for the deskside support team to streamline operations, enhance productivity, and improve response times.
Collaborate with the network and ISSO support teams to identify and address potential reliability issues proactively.
Integrate and manage knowledge base articles, ensuring they are accurate, up-to-date, and easily accessible for deskside and network support teams.
Develop processes for continual knowledge management, including tracking and updating software licenses, asset inventory, and documentation of support procedures.
Implement and maintain performance monitoring tools to ensure high availability and optimal system performance.
Conduct load testing and capacity planning to meet current and anticipated future demands.
Use site reliability engineering best practices to automate repetitive tasks, reduce manual effort, and improve service uptime.
Serve as a multi-skilled team member capable of working across service domains to address deskside support, inventory management, network operations, and security-related challenges.
Develop custom tools and scripts to automate processes and support the deskside and network teams.
Collaborate with stakeholders to understand requirements, design effective solutions, and continuously improve service quality.
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
Bachelor’s degree in Computer Science, Information Technology, or a related field, or equivalent work experience.
5+ years of experience in a Site Reliability Engineer, Systems Engineer, or similar role with a focus on IT service delivery and infrastructure.
Proven expertise in real-time change management, with a solid understanding of ITIL practices.
Hands-on experience with workflow automation tools, configuration management (e.g., Ansible, Puppet), and scripting (e.g., Python, PowerShell).
Familiarity with knowledge base management, inventory management systems, and software license tracking.
Strong understanding of networking principles, including experience with network troubleshooting and security (ISSO) practices.
Excellent problem-solving skills with the ability to work in a fast-paced environment and handle multiple responsibilities.
Strong communication skills, able to interact effectively with technical and non-technical stakeholders.
Company
Altech Solutions
Altech Solution develops & manages platforms for Online Betting and all Related Technologies, offers UI/XD Design, Graphic Design, etc.
Funding
Current Stage
Early StageCompany data provided by crunchbase