Apply on Employer Site

A1 · 1 day ago

Founding Machine Learning Engineer

United States

Full-time

Remote

Senior Level

A1 is a self-funded, independent AI group focused on building a new consumer AI application with global impact. The Founding Machine Learning Engineer will shape the core technical direction of the company, including model selection, training strategy, and infrastructure, while having full autonomy to experiment with frontier models and design systems for scalable deployment architectures.

Computer Software

Responsibilities

Build end-to-end training pipelines: data → training → eval → inference

Design new model architectures or adapt open-source frontier models

Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation)

Architect scalable inference systems using vLLM / TensorRT-LLM / DeepSpeed

Build data systems for high-quality synthetic and real-world training data

Develop alignment, safety, and guardrail strategies

Design evaluation frameworks across performance, robustness, safety, and bias

Own deployment: GPU optimization, latency reduction, scaling policies

Shape early product direction, experiment with new use cases, and build AI-powered experiences from zero

Explore frontier techniques: retrieval-augmented training, mixture-of-experts, distillation, multi-agent orchestration, multimodal models

Qualification

Deep learningTransformer architecturesPyTorchDistributed training frameworksGPU optimizationSoftware engineeringLLM fine-tuningModel architecture designData systems developmentEvaluation frameworksOpen-source contributionsScientific computingRLHF pipelinesLarge-scale data processingSoft skills

Required

Strong background in deep learning and transformer architectures

Hands-on experience training or fine-tuning large models (LLMs or vision models)

Proficiency with PyTorch, JAX, or TensorFlow

Experience with distributed training frameworks (DeepSpeed, FSDP, Megatron, ZeRO, Ray)

Strong software engineering skills — writing robust, production-grade systems

Experience with GPU optimization: memory efficiency, quantization, mixed precision

Comfortable owning ambiguous, zero-to-one technical problems end-to-end

Preferred

Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer)

Contributions to open-source ML libraries

Background in scientific computing, compilers, or GPU kernels

Experience with RLHF pipelines (PPO, DPO, ORPO)

Experience training or deploying multimodal or diffusion models

Experience in large-scale data processing (Apache Arrow, Spark, Ray)

Prior work in a research lab (Google Brain, DeepMind, FAIR, Anthropic, OpenAI)

Company

A1

A1 is research and product group focused on building essential, next-gen applications that benefits the wider society, not the exclusive few.

Founded in 2025

Malaysia, MY

11-50 employees

Funding

Current Stage

Early Stage

Company data provided by crunchbase