Achira · 3 months ago
SWE - Distributed
Achira is a company dedicated to reshaping the future of drug discovery through advanced machine learning technologies. They are seeking a Software Engineer to architect and build distributed computing infrastructure for machine learning data generation and model training, ensuring efficiency and reliability across large-scale systems.
Artificial Intelligence (AI)BiotechnologyMedical
Responsibilities
Architect & Build: Design, implement, and optimize distributed compute infrastructure for ML data processing, training, and fine-tuning
Optimize & Monitor: Improve cluster observability, scheduling, and resource utilization (CPU/GPU/TPU)
Compute Efficiency: Research and implement cost-efficient compute solutions (spot instances, auto-scaling, multi-cloud strategies)
Tooling: Develop tools for monitoring, debugging, and performance tuning of large-scale ML workloads
Collaboration: Collaborate with ML engineers to accelerate training pipelines and reduce bottlenecks
Innovation: Stay current with emerging technologies in distributed computing (e.g., Ray, Kubernetes, Spark, Slurm) and apply them strategically
Qualification
Required
Experience in building or working with distributed computing frameworks (e.g., Ray, Dask, Celery)
Good grasp of parallel computing, job scheduling, and resource management
Comfortable identifying and resolving performance issues in distributed systems (profiling, bottlenecks, network overhead)
Implemented solutions using cloud compute platforms (AWS, GCP, Azure) and cluster orchestration (Kubernetes, Slurm)
Familiar with popular ML frameworks (PyTorch, TensorFlow, or JAX) and MLOps best practices such as model deployment and GPU performance monitoring
Company
Achira
Achira is a startup company that combines AI and physics-based methods for drug discovery.
Funding
Current Stage
Early StageTotal Funding
$33M2025-02-24Seed· $33M
Leadership Team
Recent News
2025-10-16
Company data provided by crunchbase