Site Reliability Engineer @ NCR Atleos | Jobright.ai
JOBSarrow
RecommendedLiked
0
Applied
0
Site Reliability Engineer jobs in Atlanta, GA
200+ applicants
company-logo

NCR Atleos · 4 days ago

Site Reliability Engineer

Wonder how qualified you are to the job?

ftfMaximize your interview chances
BankingFinTech

Insider Connection @NCR Atleos

Discover valuable connections within the company who might provide insights and potential referrals, giving your job application an inside edge.

Responsibilities

You will be responsible for maintaining and scaling production services and servers for complex and high throughput cloud services.
You will bridge and own the union between development, quality, security, and operations.
You will write automation code for provisioning and operating infrastructure at massive scale.
You will improve scalability, service reliability, capacity, and performance.
You are not just an operator, you’re an experienced software engineer focused on operations.
You will initiate and contribute to continuous improvement of software delivery processes and practices.
You will use automation extensively to design, configure, manage, and monitor systems to support product development teams.
You will participate in disaster recovery planning and execution.
You will be responsible for maintaining/patching servers supporting SaaS products.
You will work hand-in-hand with all teams to ship code to production using CI/CD and AppSec tooling.
You will collaborate with development teams to create SLIs, SLOs, and SLAs.
You will provide timely assistance and remediation solutions during critical situations and production incidents.
You will develop monitoring architecture, implement monitoring agents, build dashboards, manage escalations and alerts.
You will participate in incident management, driving root cause analysis and risk management processes.
You will participate in a rotating on-call schedule during off-hours.
You may periodically need to remote-in to systems if a production outage occurs.

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

DevOps/SREGCP/AWS/AzureJavaCC++.NETPythonRubyGoShellPerlJavaScriptCloud virtualizationPaaSDockerKubernetesOpenShiftLinuxShell ScriptingPKI TLS/SSLFirewallsLoad balancersBackupDesigning distributed systemsAnalyzing distributed systemsRunning distributed systemsOrchestration toolsAutomation toolsConfiguration management toolsGit

Required

BS degree in Computer Science or related technical field or 5 years prior relevant experience
Extensive experience in a DevOps / SRE role with demonstrable experience in deploying and managing large scale production environments in GCP, AWS or Azure and Multi Datacenter environment
Experience developing and debugging code (i.e. one or more of the following: Java, C, C++, .NET, Python, Ruby, Go, Shell, Perl, JavaScript)
2+ years deploying and supporting high traffic, scalable web applications/services
2+ years with cloud virtualization and PaaS
2+ years with AWS/GCP/Azure
2+ years with Docker, Kubernetes and early versions of OpenShift
Experience with Linux, Shell Scripting, PKI TLS/SSL, Network, firewalls, load balancers and backup
Experience in designing, analyzing and running large-scale distributed systems
Experience hosting and solving problems with public-facing services securely in Azure, AWS or GCP
Experience with orchestration, automation, and configuration management tools like git, Fabric and Ansible (or Puppet, Chef, Terraform, Helm or related technology)
Excellent analysis, debugging, root-cause identification, and troubleshooting skills
Experience with Kubernetes, system virtualization, on-prem and/or hybrid cloud computing, cloud Identity and security system, cloud monitoring and logging, and/or local/cloud storage
Experience with one or more CI tools Jenkins, Artifactory, Harness, CloudBuild
Experience with application disaster recovery, migration, roll-back plans, expansion, routine deployments, and system upgrades
Experience with log management, including aggregation, alerting, and graphing (i.e Sensu/StackDriver/Prometheus/ELK/TICK stacks)

Preferred

Bonus points for experience with Cassandra, Elasticsearch or Kafka
Extra bonus points for Cloud certifications and exposure to Harness

Benefits

Medical Insurance
Dental Insurance
Life Insurance
Vision Insurance
Short/Long Term Disability
Paid Vacation
401k

Company

NCR Atleos

twittertwittertwitter
company-logo
NCR Atleos provides banks and retailers with technological solutions to reduce operational complexity and increase customer experiences.

Funding

Current Stage
Public Company
Total Funding
$1.35B
2023-10-17IPO· nyse:NATL
2023-09-18Post Ipo Debt· $1.35B

Leadership Team

leader-logo
Tim Oliver
Chief Executive Officer and President
linkedin
leader-logo
Paul Gardiner
Chief Technology Officer
linkedin
Company data provided by crunchbase
logo

Orion

Your AI Copilot