Nextlink Internet · 3 months ago
Development Operations Engineer
Nextlink Internet is a Texas-based Internet Service Provider delivering high-speed internet and voice services across multiple states. They are seeking a Development Operations Engineer to design, build, and operate CI/CD, GitOps, container, and infrastructure-as-code platforms, while collaborating with various teams to automate workflows and improve service delivery.
Wireless
Responsibilities
Develop, test, and maintain up‑to‑date device models for Nextlink monitoring systems (SNMP, API, streaming telemetry)
Collaborate with Engineering, Field, and NOC to ensure correct monitoring, data collection, thresholds, and alert standards
Own platform architecture for Zabbix (server, proxies, and backing database-e.g., PostgreSQL with time-series extension) including HA/failover, housekeeping, and retention policies
Create and maintain device templates (SNMPv3, API, JMX/IPMI/SSH as applicable) with low-level discovery (LLD), item/trigger prototypes, macros, preprocessing, and escalation logic that match Nextlink standards
Design proxy placement and discovery to cover POPs/datacenters and edge sites; ensure secure comms (TLS, PSKs/certs) and reliable buffering
Implement trigger dependencies, event correlation, maintenance windows, and SLA/service maps; tune thresholds from SLOs and NOC feedback
Manage templates, host onboarding, actions, and maintenance via the Zabbix API and Git-based workflows; integrate with CI/CD to promote monitoring changes through environments
Connect Zabbix to ChatOps (Teams/Slack), ticketing, and paging; publish dashboards for NOC/leadership; export metrics/events to your observability stack where useful
Enforce RBAC, SNMPv3, secret rotation, and least-privilege API tokens; document and test upgrades and rollbacks for zero/minimal downtime
Identify, develop, and maintain scripts/tools to automate processes and network/device changes (Python, Bash, PowerShell)
Enforce configuration baselines, drift detection, and golden‑config rollouts; integrate change control and approvals
Build and maintain CI/CD pipelines (reusable templates, quality gates, artifact/versioning, blue/green & canary)
Implement GitOps for Kubernetes and network automation (Argo CD/Flux) using Helm/Kustomize and policy controls
Support ephemeral environments, infrastructure testing, and progressive delivery with feature flags as applicable
Plan, deploy, and maintain physical servers and datacenter assets (capacity, ordering, lifecycle, firmware)
Provision cloud resources (Azure, AWS, GCP) using Terraform with least‑privilege identities and tagging/FinOps standards
Implement secure networking (VNet/VPC, private endpoints, peering, DNS/TLS, load balancing, WAF)
Own metrics, logs, traces, and profiling via Prometheus/Grafana, and ELK; leverage eBPF where appropriate
Define SLIs/SLOs, manage error budgets, and lead incident response/post‑incident reviews alongside the NOC
Embed DevSecOps: secret rotation, workload identity federation (OIDC), and least privilege across platforms
Establish software supply‑chain controls: SBOM (CycloneDX), image signing (Sigstore cosign), provenance (SLSA), and policy‑as‑code (OPA/Kyverno)
Automate vulnerability management, patching, and CIS/NIST-aligned hardening
Integrate AIOps for anomaly detection, noise reduction, and incident summarization; apply LLMs to enhance runbooks and root‑cause hypotheses
Implement ChatOps for deployments, rollbacks, and diagnostics via Teams/Slack bots with guardrails
Create standards for device configuration and proper use; test/qualify new devices before production per Nextlink LaunchPad
Document architectures, runbooks, and SOPs; provide training to stakeholders and track/report on projects
Qualification
Required
Bachelor's degree in CS/IT/Engineering or equivalent experience
Experience designing and operating Zabbix at scale (server, proxies, HA, PostgreSQL/time-series), building templates/LLD with macros and trigger logic, and automating changes via the Zabbix API
3+ years in DevOps/SRE/Platform Engineering supporting production systems
Strong coding/scripting in Python and one additional language (e.g., Bash or PowerShell)
Hands-on with CI/CD (GitHub/GitLab), IaC (Terraform), and configuration management (Ansible)
Proficient with Linux, containers (Docker), and Kubernetes (cluster operations, Helm/Kustomize, GitOps)
Solid networking fundamentals and operations in ISP contexts (routing basics, SNMP, NetFlow/sFlow)
Experience with observability stacks (Prometheus/Grafana, ELK) and incident response
DevOps: 3 years (Required)
Ability to Commute: Weatherford, TX 76087 (Required)
Work Location: In person
Preferred
Proven Zabbix architecture work (multi-proxy, distributed sites), API-driven onboarding, and integrations with ChatOps/ticketing
Experience with AKS/EKS/GKE and GitOps controllers (Argo CD/Flux)
Knowledge of zero-trust patterns, workload identity with OIDC, and secrets management (Key Vault, Vault)
Familiarity with SBOMs, SLSA, image signing, and policy-as-code frameworks (OPA/Kyverno, Conftest)
Exposure to AIOps tooling and ChatOps automation
Understanding of network automation APIs and telemetry for vendor devices common to ISPs
Benefits
401(k) matching
Dental insurance
Health insurance
Health savings account
Life insurance
Paid time off
Retirement plan
Vision insurance
Company
Nextlink Internet
Nextlink is a high speed Internet service provider with next generation phone services.
Funding
Current Stage
Late StageTotal Funding
$100MKey Investors
Cable ONECanAm Enterprises
2024-08-15Series Unknown· $20M
2024-08-15Debt Financing· $80M
Recent News
Business Wire
2025-11-19
2025-04-11
Company data provided by crunchbase