CLOVEHITCH · 7 hours ago
Senior Backend & Data Engineer
CLOVEHITCH is a Service-Disabled Veteran-Owned Small Business focused on providing professional solutions in training and talent acquisition. They are seeking a Senior Backend and Data Engineer to architect a new Reporting Portal, managing data ingestion, modeling, and workflow orchestration while integrating machine learning models for compliance reviews.
IndustrialMachinery Manufacturing
Responsibilities
Data Architecture & Pipeline Development
Medallion Architecture: Design and implement a Medallion-style data architecture (Bronze/Silver/Gold) to manage raw transaction data, immutable snapshots, and curated analytics
Pipeline Engineering: Build and maintain data pipelines that ingest, clean, and transform data from the Digital Transformation Bridge (DTB) and other sources
Immutable Audit Trails: Ensure all files and snapshots are stored in a non-destructive environment with strict versioning to preserve original evidence
Performance Optimization: Implement high-speed querying (e.g., AWS Athena) to enable complex ad-hoc trend analyses against raw data in seconds
Backend & Infrastructure Engineering
API Development: Build and maintain a high-concurrency backend using Node.js and Express
Secure Access: Implement robust Identity & Access Control layers, including mandatory Multi-Factor Authentication (MFA) and Role-Based Access Control (RBAC)
Infrastructure as Code: Provision and manage cloud infrastructure using Terraform within an AWS Commercial Cloud environment (EKS, RDS, S3, MSK)
Submission Engine: Develop a robust engine for multi-format uploads (PDF, DOCX, JPEG) featuring draft persistence and real-time validation logic
AI, ML & Data Science Support
LLM & NLP Integration: Build systems to parse and index unstructured text within uploaded documents using Large Language Models and NLP tools
Automated Analysis: Develop and operationalize machine learning models to assign compliance "confidence scores" and flag high-risk items for manual review
Data Science Enablement: Support the internal Data Science Workbench by providing Python notebook access and direct data hooks for risk analysts
Cross-Team Collaboration
Frontend Partnership: Work daily with the Frontend Engineer to architect data flows, manage APIs, and ensure the UI can handle complex list management and in-browser file rendering
Stakeholder Communication: Present technical tradeoffs and implementation paths to both technical and non-technical team members
Agile Ownership: Participate in technical discussions and architectural decisions in an iterative, startup-style environment
Qualification
Required
4+ years of professional experience in data engineering or data-focused backend engineering, ideally in a startup or fast-paced context
Strong proficiency in Node.js/Express and writing Python for data workflows
Deep experience with SQL, data modeling, and workflow orchestration tools
Comfort provisioning infrastructure and deploying workflows to cloud platforms (specifically AWS)
Proficiency across modern storage technologies (PostgreSQL, S3, Lakehouses) and handling varied file formats
Ability to manage multiple projects, adapt to shifting priorities, and deliver high-quality work on a Fall/Winter 2026 deployment schedule
Experience building and deploying Large Language Models (LLMs) or automated scoring algorithms in a production environment
Experience with event-streaming architecture (e.g., Kafka/MSK) and API management tools (e.g., Kong or Apigee)
Company
CLOVEHITCH
CLOVEHITCH ownership is from diverse backgrounds with the unique commonality of all having direct, hands-on CONUS and OCONUS experience leading our intelligence, linguistic, IT and administrative professionals.