Site Reliability Engineer @ Honeywell | Jobright.ai
JOBSarrow
RecommendedLiked
0
Applied
0
Site Reliability Engineer jobs in Atlanta, GA
88 applicants
expire-info-iconThis job has closed.
company-logo

Honeywell · 1 week ago

Site Reliability Engineer

Wonder how qualified you are to the job?

AerospaceElectronics
check
Growth Opportunities

Insider Connection @Honeywell

Discover valuable connections within the company who might provide insights and potential referrals, giving your job application an inside edge.

Responsibilities

Hands-on design, analysis, development, and troubleshooting of highly distributed large-scale production systems and event-driven, cloud-based services
Primarily Linux Administration, managing a fleet of Linux and Windows VMs as part of the application solutions
Involved in Pull Requests for site reliability goals
Advocate IaC (Infrastructure as Code) and CaC (Configuration as Code) practices within Honeywell HCE
Ownership of reliability, uptime, system security, cost, operations, capacity, and performance analysis
Monitor and report on service level objectives for given application services. Work with the business, Technology teams, and product owners to establish key service level indicators.
Ensuring the repeatability, traceability, and transparency of infrastructure automation
Support on-call rotations for operational duties that have not been addressed with automation
Support healthy software development practices, including complying with the chosen software development methodology (Agile, or alternatives), building standards for code reviews, work packaging, etc.
Create and maintain monitoring technologies and processes that improve the visibility to applications' performance and business metrics and keep operational workload in-check.
Partnering with security engineers and developing plans and automation to aggressively and safely respond to new risks and vulnerabilities.
Develop, communicate, collaborate, and monitor standard processes to promote the long-term health and sustainability of operational development tasks.
Participate in technical training events, game day scenarios, and professional conferences

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

System AdministrationApplication DevelopmentInfrastructure DevelopmentCode WritingInfrastructure AutomationTerraformCodeDeployPuppetAnsibleChefContainer OrchestrationKubernetesOpenshiftAKSEKSDockerVagrantEtcdZookeeperCloud AdministrationLinux AdministrationContainer ManagementCloud TechnologiesContinuous DeploymentDatabase OperationsAgile Deployment ProcessesInfrastructure MonitoringSystems EngineeringEscalation Response PlansProduction Infrastructure Management

Required

5+ Years of experience in system administration, application development, infrastructure development or related areas
3+ years of in reading, understanding and writing code in the same
3+ years Mastery of infrastructure automation technologies (like Terraform, CodeDeploy, Puppet, Ansible, Chef)
3+ years expertise in container/container-fleet-orchestration technologies (like Kubernetes, Openshift, AKS, EKS, Docker, Vagrant, etcd, zookeeper)
5+ years Cloud and container native Linux administration/build/management skills

Preferred

Proficiency in Azure Databricks for designing, developing, and maintaining efficient data export pipelines and processes within our Azure Databricks environment.
AI modeling, Python, DB Modeling, and Azure AI understanding
Versatility with troubleshooting diverse sets of hosting technologies strongly desired. These include web server platforms, application platforms, operating systems, network components, virtualization technologies, storage, and database platforms.
Expertise with cloud- continuous-deployment- based software development lifecycles (e.g. CI/CD)
Cloud database operations and deployment experience (RDS MySQL/Postgres/Aurora), Caching operations & deployment experience (memcache, Redis)
Expertise with Lean/Agile deployment processes (Blue/Green, ZDT, Canary, load balancers/DNS strategies A/B test, feature flagging methodologies)
Familiarity with site and infrastructure monitoring systems (like ELK, Datadog, AppDynamics, New Relic, Splunk, Sumologic, Grafana)
Strong problem solving, root cause analysis and systems engineering skills
Excellent presentation and communication skills
Ability to design and manage escalation response plans from monitoring, react, respond, remediate and retrospect in culturally aligned (proactive, customer focused, collaborative, data-driven) ways.
Demonstrated expertise building and managing highly scaled production infrastructure in the cloud (Azure required; AWS)
Expertise with SDLC branching, SCM, and code deployment systems (Bitbucket, git/gitflow, Jenkins, CircleCI, TravisCI, etc.)

Benefits

Medical
Vision
Dental
Mental Health
Paid Vacation
401k Plan/Retirement Benefits
Career Growth
Professional Development

Company

Honeywell

company-logo
Honeywell International is a technology and manufacturing company that offers energy, safety, and security solutions and technologies.

Funding

Current Stage
Public Company
Total Funding
$11.4M
2017-10-11IPO· lse:HON
2009-10-27Grant· $11.4M

Leadership Team

leader-logo
Shane Tedjarati
President & CEO - High Growth Regions
linkedin
leader-logo
Vimal Kapur
Chief Executive Officer
linkedin
Company data provided by crunchbase
logo

Orion

Your AI Copilot