MARA · 1 month ago
Lead Software Engineer – ML & Agentic Workloads
MARA is redefining the future of sovereign, energy-aware AI infrastructure. They are seeking a Lead Software Engineer to design, build, and scale systems that power agentic and intelligent workloads across their product ecosystem.
Computer Software
Responsibilities
Lead architecture and development of agentic platforms that integrate multiple models, tools, and knowledge sources into dynamic reasoning systems
Evaluate and deploy foundation and open-source models (LLMs, vision, multimodal) using efficient inference strategies and fine-tuning where applicable
Design and maintain prompt lifecycle pipelines with version control, testing, and CI/CD integration (“PromptOps”)
Build and optimize RAG systems—vector database configuration, retriever-generator orchestration, and embedding quality improvement
Implement guardrail frameworks for content safety, hallucination control, and policy enforcement across agentic workflows
Integrate and extend agentic frameworks (LangChain, LangGraph, CrewAI, AutoGen, or equivalent), both in code-based and visual orchestration environments
Collaborate with data, product, and infrastructure teams to design scalable APIs and services that enable model-driven applications
Define observability and evaluation metrics for model performance, latency, and behavior drift in production
Drive best practices for secure AI development, privacy-preserving data handling, and governance of third-party model integrations
Mentor engineers across ML, backend, and platform domains; champion continuous learning and experimentation
Qualification
Required
8+ years of professional software engineering experience, including 3+ years in ML application development or AI platform engineering
Proficiency in Python, with strong understanding of ML toolchains (PyTorch, Hugging Face, LangChain, MLflow, Ray, etc.)
Proven experience with model evaluation, fine-tuning, and deployment across cloud and on-prem environments
Hands-on experience with RAG architectures and vector databases (Weaviate, Milvus, pgvector, LanceDB, FAISS)
Deep understanding of prompt design, orchestration, and versioning using CI/CD workflows and automated testing frameworks
Familiarity with agentic systems, both code-driven and visual-builder interfaces (LangGraph Studio, Dust, Flowise, Relevance AI, etc.)
Strong knowledge of guardrail techniques (rule-based filters, policy evaluators, toxicity detection, grounding validation)
Experience deploying ML systems on Kubernetes and serverless environments with observability (Prometheus, Grafana, OpenTelemetry)
Solid understanding of API design, microservice architecture, and data pipeline integration
Excellent communication and leadership skills, with ability to translate complex ML concepts into actionable engineering outcomes
Preferred
Background in HPC, ML infrastructure, or sovereign/regulated environments
Familiarity with energy-aware computing, modular data centers, or ESG-driven infrastructure design
Experience collaborating with European and global engineering partners
Strong communicator who can bridge engineering, business, and vendor ecosystems seamlessly
Company
MARA
MARA (NASDAQ: MARA) deploys digital energy technologies to advance the world's energy systems.
H1B Sponsorship
MARA has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2024 (1)
2023 (1)
Funding
Current Stage
Growth StageRecent News
TradingView
2024-05-27
2024-05-26
2024-05-18
Company data provided by crunchbase