Aldea · 2 months ago
Senior Research Scientist (LLMs)
Aldea is a multi-modal foundational AI company focused on advancing large-language-model architectures. The role involves researching and prototyping efficient transformer variants and attention mechanisms to enhance the scalability of language models, collaborating with product and engineering teams to integrate these models into production systems.
Artificial Intelligence (AI)SoftwareSpeech Recognition
Responsibilities
Research and prototype sub-quadratic attention architectures to unlock efficient scaling of large language models
Design and evaluate efficient attention mechanisms including state-space models (e.g., Mamba), linear attention variants, and sparse attention patterns
Lead pre-training initiatives across a range of model scales from 1B to 100B+ parameters
Conduct rigorous experiments measuring the efficiency, performance, and scaling characteristics of novel architectures
Collaborate closely with product and engineering teams to integrate models into production systems
Stay at the forefront of foundational research and help shape Aldea's long-term model roadmap
Qualification
Required
Requires a Ph.D. in Computer Science, Engineering, or related field
3+ years of relevant industry experience
Deep understanding of modern sequence modeling architectures including State Space Models (SSMs), Sparse Attention mechanisms, Mixture of Experts (MoE), and Linear Attention variants
Hands-on experience pre-training large language models across a range of scales (1B+ parameters)
Expertise in PyTorch, Transformers, and large-scale deep-learning frameworks
Proven ability to design and evaluate complex research experiments
Demonstrated research impact through patents, deployed systems, or core-model contributions
Preferred
Experience with distributed training frameworks and multi-node optimization
Knowledge of GPU acceleration, CUDA kernels, or Triton optimization
Publication record in top-tier ML venues (NeurIPS, ICML, ICLR) focused on architecture research
Experience with model scaling laws and efficiency-performance tradeoffs
Background in hybrid architectures combining attention with alternative sequence modeling approaches
Familiarity with training stability techniques for large-scale pre-training runs
Benefits
Competitive base salary
Performance-based bonus aligned with research and model milestones
Equity participation
Flexible Paid Time Off
Comprehensive health, dental, and vision coverage
Company
Aldea
Aldea builds AI voice and language technology with speech-to-text, text-to-speech, and conversational interfaces.
H1B Sponsorship
Aldea has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2024 (2)
2022 (1)
2021 (1)
2020 (1)
Funding
Current Stage
Early StageCompany data provided by crunchbase