Apply on Employer Site

A1 · 1 day ago

Founding AI/ML Research Engineer

United States

Full-time

Remote

Senior Level

A1 is a self-funded, independent AI group dedicated to creating impactful consumer AI applications. The Founding AI/ML Research Engineer will define the technical direction of the company, focusing on model selection, training strategies, and system architecture while building innovative AI solutions.

Computer Software

Responsibilities

Build end-to-end training pipelines: data → training → eval → inference

Design new model architectures or adapt open-source frontier models

Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation)

Architect scalable inference systems using vLLM / TensorRT-LLM / DeepSpeed

Build data systems for high-quality synthetic and real-world training data

Develop alignment, safety, and guardrail strategies

Design evaluation frameworks across performance, robustness, safety, and bias

Own deployment: GPU optimization, latency reduction, scaling policies

Shape early product direction, experiment with new use cases, and build AI-powered experiences from zero

Explore frontier techniques: retrieval-augmented training, mixture-of-experts, distillation, multi-agent orchestration, multimodal models

Qualification

Deep learningTransformer architecturesPyTorchLarge model trainingDistributed training frameworksGPU optimizationSoftware engineeringOpen-source contributionsScientific computingRLHF pipelinesMultimodal modelsLarge-scale data processingResearch lab experience

Required

Strong background in deep learning and transformer architectures

Hands-on experience training or fine-tuning large models (LLMs or vision models)

Proficiency with PyTorch, JAX, or TensorFlow

Experience with distributed training frameworks (DeepSpeed, FSDP, Megatron, ZeRO, Ray)

Strong software engineering skills — writing robust, production-grade systems

Experience with GPU optimization: memory efficiency, quantization, mixed precision

Comfortable owning ambiguous, zero-to-one technical problems end-to-end

Preferred

Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer)

Contributions to open-source ML libraries

Background in scientific computing, compilers, or GPU kernels

Experience with RLHF pipelines (PPO, DPO, ORPO)

Experience training or deploying multimodal or diffusion models

Experience in large-scale data processing (Apache Arrow, Spark, Ray)

Prior work in a research lab (Google Brain, DeepMind, FAIR, Anthropic, OpenAI)

Company

A1

A1 is research and product group focused on building essential, next-gen applications that benefits the wider society, not the exclusive few.

Founded in 2025

Malaysia, MY

11-50 employees

Funding

Current Stage

Early Stage

Company data provided by crunchbase