Sharp Decisions · 16 hours ago
DevOps Engineer T3 (499)
Sharp Decisions is seeking a DevOps Engineer T3 to join their Cloud Infrastructure Platform Engineering team. The role involves designing and deploying robust infrastructure solutions while collaborating with product teams to deliver adaptable and scalable code.
Responsibilities
Write, modify, run terraform to deploy and maintain infrastructure across multiple cloud service providers.? Be able to debug errors when deploying terraform
Run ansible playbooks to perform configuration and maintenance tasks.? Be able to develop new ansible roles and troubleshooting as errors occur
Use GitLab with multiple repositories, create and improve CI utilizing runners to maintain infrastructure
Configure, build, and deploy containerized services using Docker and/or Kubernetes
Develop standard operating procedures, knowledge base articles, technical bulletins, and other documents in support of the infrastructure
Operate in a security-first mindset, performing all other responsibilities with security in mind
Implement monitoring, config management, and logging capabilities to manage a multiple tenant cloud infrastructure across multiple cloud service providers
Use generative AI elements to increase efficiency and speed, improve accuracy and consistency, enhance security, and better manage resources where practical and within security boundary guidelines
Qualification
Required
Must Be US Citizen
Write, modify, run terraform to deploy and maintain infrastructure across multiple cloud service providers
Be able to debug errors when deploying terraform
Run ansible playbooks to perform configuration and maintenance tasks
Be able to develop new ansible roles and troubleshooting as errors occur
Use GitLab with multiple repositories, create and improve CI utilizing runners to maintain infrastructure
Configure, build, and deploy containerized services using Docker and/or Kubernetes
Develop standard operating procedures, knowledge base articles, technical bulletins, and other documents in support of the infrastructure
Operate in a security-first mindset, performing all other responsibilities with security in mind
Implement monitoring, config management, and logging capabilities to manage a multiple tenant cloud infrastructure across multiple cloud service providers
Use generative AI elements to increase efficiency and speed, improve accuracy and consistency, enhance security, and better manage resources where practical and within security boundary guidelines
Knowledge of AWS foundational technologies (EC2, S3, IAM, Route53, VPC)
Knowledge of Azure foundational technologies (Virtual Networks, Application Gateway, Storage Account, Virtual Machines, Load Balancer, Resource Groups)
Knowledge of GCP foundational technologies (Projects, Compute Engine, GKE, Cloud Storage, VPC)
Proficient in using Terraform to build modules for deploying infrastructure in AWS, Azure, and/or GCP
Proficient in CI/CD technologies including pipeline design and implementation using tools such as GitLab runners and ConcourseCI
Working knowledge of HashiCorp Vault for secret management, including secure storage and retrieval of credentials, API keys, and certificates
Experience with Ansible for automation, configuration management, and playbook development
Proficient in a scripting or programming language for automation (bash, python)
Experience with RBAC design and Single Sign-On (SSO) for identity management
Investigation skills to review detailed Infrastructure and configurations
Experience with logging and log processing in a cloud environment
Working knowledge of networking: ip routing, subnetting, network troubleshooting
Strong Linux OS skills (deployment, configuration, and troubleshooting)
Knowledge of ITIL-compliant ticket management using ServiceNow, including incident resolution, change requests and sprint planning
Bachelor's degree in Computer Science or equivalent practical experience
5-8+ years of experience with the following technologies: Terraform, Ansible, CI/CD, Secrets Management, Identity Management (SSO)
Experience with Linux operating system internals and administration (e.g., filesystems, inodes, system calls, hardening) and networking (e.g., TCP / IP, routing, DNS, network topologies, SDN)
Preferred
Expertise in designing, analyzing and troubleshooting large-scale distributed systems
Ability to debug and optimize code and automate routine tasks
Systematic problem-solving approach coupled with strong communication skills and a sense of ownership and drive
Hands-on experience developing infrastructure as code (IaC)