Melwy · 3 hours ago
Senior Machine Learning Engineer - Language Models
Maximize your interview chances
Insider Connection @Melwy
Get 3x more responses when you reach out via email instead of LinkedIn.
Responsibilities
Develop and implement state-of-the-art LLM architectures and training methods
Design and execute experiments to push the boundaries of LLM capabilities
Create efficient, scalable code for LLM training and inference
Build tools and infrastructure to support rapid prototyping and research
Bridge the gap between research concepts and practical applications
Stay current with the latest advancements in LLM research and contribute to the field
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
PhD or Postdocs in Machine Learning, Computer Sciences, Numerical Analysis, Functional Analysis, Signal Processing, Control Theory, Statistics, Dynamical Systems, Mathematics, Statistical Physics, Neurosciences, or other quantitative fields, or equivalent experiences in industry
Strong background in deep learning, natural language processing, and LLMs
Experience with LLM frameworks (e.g., PyTorch, TensorFlow)
Proficiency in CUDA (for Nvidia GPUs), XLA (for Google TPUs), or AWS Neuron (for AWS Inferentia/Trainium)
Familiarity with distributed computing and large-scale model training
Track record of contributions to research projects or publications in the field of AI/ML
Self-motivated with the ability to work independently in a remote environment with minimal supervision
Strong online communication skills and a collaborative mindset
Passion for pushing the boundaries of AI technology
Preferred
Big Tech experience is a plus (Google, Microsoft, Meta, Baidu...)
Benefits
Opportunity to work on groundbreaking LLM technology
Fully remote work environment with flexible hours
Competitive salary and equity package
Regular virtual team-building events and knowledge-sharing sessions
Support for continued learning and professional development
Chance to make a significant impact in a rapidly growing field
Company
Melwy
Melwy (ex-Startcrowd) is an online lab in artificial intelligence and data science, for precision medicine and drug discovery.
Funding
Current Stage
Early StageCompany data provided by crunchbase