Sr. AWS DevOps Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Finch AI · 3 months ago

Sr. AWS DevOps Engineer

Finch AI is a fast-growing software development organization focused on innovative ways to interact with information. They are seeking a seasoned Senior AWS DevOps Engineer to lead the design, automation, and maintenance of AWS environments, ensuring infrastructure robustness, security, and scalability while optimizing costs and mentoring engineers.

AnalyticsArtificial Intelligence (AI)Big DataBusiness IntelligenceMachine LearningNatural Language ProcessingPredictive AnalyticsSaaSSoftwareText Analytics
badNo H1BnoteSecurity Clearance RequirednoteU.S. Citizen Onlynote

Responsibilities

Design, build, and maintain AWS cloud environments with a focus on automation and security
Automate provisioning and management of AWS resources using Infrastructure-as-Code (IaC) with Terraform (familiarity with AWS CloudFormation and AWS CDK a plus)
Assist with implementing proactive security measures, including vulnerability remediation, patching, and firewall rule management
Maintain compliance with security standards and internal policies, supporting security audits and assessments
Monitor AWS cost usage and provide cost optimization recommendations, ensuring efficient resource scaling
Maintain and test backup and disaster recovery procedures to ensure business continuity
Support and participate in SES IT Service Continuity drills and report on system resilience
Implement performance monitoring and alerting using tools like Datadog, AWS CloudWatch, and Splunk
Tune AWS resources for optimal performance, ensuring reliability and scalability of cloud services
Respond to Severity 1 & 2 incidents according to predefined SLAs and ensure rapid resolution
Conduct Root Cause Analysis (RCA) and document mitigation strategies for major incidents
Improve system reliability by analyzing and addressing infrastructure bottlenecks
Manage CI/CD pipelines to automate software deployment and infrastructure updates
Collaborate in an Agile Scrum environment, participating in sprint planning, backlog grooming, and retrospectives
Work closely with developers to integrate infrastructure automation into the software development
Maintain comprehensive AWS infrastructure documentation, including architectural diagrams and firewall configurations
Securely manage access credentials and account privileges, ensuring adherence to security best practices
Ensure all changes follow compliance frameworks and are properly documented in Git and JIRA

Qualification

AWS servicesTerraformCloud securityDisaster recoveryCI/CD pipelinesCost optimizationLinux administrationAgile ScrumIncident responseRoot Cause AnalysisMonitoring toolsDocumentationCollaboration

Required

Must be eligible for a US Security Clearance (US Citizenship required)
Hands-on experience with Terraform for infrastructure automation
Deep expertise in AWS services
Strong background in cloud security, cost management, and disaster recovery
Experience migrating cloud workloads to AWS
Mentoring engineers
Improving DevOps processes
Collaborating with development teams in an Agile Scrum environment
Design, build, and maintain AWS cloud environments with a focus on automation and security
Automate provisioning and management of AWS resources using Infrastructure-as-Code (IaC) with Terraform
Assist with implementing proactive security measures, including vulnerability remediation, patching, and firewall rule management
Maintain compliance with security standards and internal policies, supporting security audits and assessments
Monitor AWS cost usage and provide cost optimization recommendations
Maintain and test backup and disaster recovery procedures to ensure business continuity
Support and participate in SES IT Service Continuity drills and report on system resilience
Implement performance monitoring and alerting using tools like Datadog, AWS CloudWatch, and Splunk
Tune AWS resources for optimal performance, ensuring reliability and scalability of cloud services
Respond to Severity 1 & 2 incidents according to predefined SLAs and ensure rapid resolution
Conduct Root Cause Analysis (RCA) and document mitigation strategies for major incidents
Improve system reliability by analyzing and addressing infrastructure bottlenecks
Manage CI/CD pipelines to automate software deployment and infrastructure updates
Collaborate in an Agile Scrum environment, participating in sprint planning, backlog grooming, and retrospectives
Work closely with developers to integrate infrastructure automation into the software development
Maintain comprehensive AWS infrastructure documentation, including architectural diagrams and firewall configurations
Securely manage access credentials and account privileges, ensuring adherence to security best practices
Ensure all changes follow compliance frameworks and are properly documented in Git and JIRA
Deep experience with AWS infrastructure and services such as EC2, VPC, ALB, Lambda, SSM, EKS, ECS, CF, etc
Virtualization and containerization using Docker, AWS ECS, Kubernetes, AWS EKS, and Fargate
6+ years' experience in a specifically-DevOps role supporting software development and distributed applications

Preferred

8+ years of AWS cloud engineering experience, with a strong background in DevOps & infrastructure automation
Hands-on experience and fluency with Terraform (familiarity with CloudFormation & AWS CDK a plus)
Extensive experience with containerization and orchestration tools (Docker, Dockerfile AWS ECS, Fargate) and with building secure containers and sidecars
Expertise in AWS security, including IAM, firewall management, vulnerability scanning, and compliance
Experience in cost monitoring and optimization of AWS resources
Strong knowledge of disaster recovery strategies, backup management, and system resilience planning
Experience with creating CI/CD pipelines and automation tools like Jenkins, or Code Pipeline
Proficiency in Linux administration (Amazon Linux, RedHat, Rocky, Ubuntu)
Familiarity with monitoring and logging tools such as Datadog, Splunk, CloudWatch, and AWS Config
Strong troubleshooting skills and experience with incident response & Root Cause Analysis (RCA)
AWS certifications such as AWS Certified Solutions Architect or AWS Certified DevOps Engineer
Experience working in a customer-facing role or a managed services environment
Prior experience migrating cloud workloads from on-premises or legacy systems to AWS
Bachelor's Degree in a related field or equivalent experience

Benefits

Health, dental, vision, long and short-term disability, 401k matching, basic and supplemental life insurance, employee assistance program
Competitive salary and benefits package.
Liberal leave policy
Ability to work remote/hybrid

Company

Finch AI

twittertwittertwitter
company-logo
At Finch AI, we think like analysts. So we build tools that accelerate their workflows, that never get tired, and that dramatically improve outcomes.

Funding

Current Stage
Growth Stage

Leadership Team

leader-logo
Scott Lightner
Chief Technology Officer
linkedin
Company data provided by crunchbase