Sr Engineer, Cloud AI Architect jobs in United States
cer-icon
Apply on Employer Site
company-logo

Lenovo · 11 hours ago

Sr Engineer, Cloud AI Architect

Lenovo is a global technology powerhouse focused on delivering Smarter Technology for All. The Sr Engineer, Cloud AI Architect will be responsible for designing and implementing advanced AI architecture, specifically for large language models and generative AI solutions, while collaborating with cross-functional teams to drive innovation.

ComputerConsumer ElectronicsElectronicsHardwareMobileWearables
check
H1B Sponsor Likelynote

Responsibilities

Produce high quality architecture specifically on AI, including LLMs, Inference Engineering and Prompt Engineering and design specifications
Experience finetuning an Opensource LLM
Architect and design end to end Generative AI products, applications and solutions for specific business needs and provide implementation guidance during delivery
Designing and implementing autonomous AI agents capable of reasoning, planning, acting, and adapting to achieve complex objectives
Integrating Large Language Models (LLMs) with memory, tool-use, and multi-step planning architectures, leveraging their natural language understanding and generation capabilities as the agent's "brain"
Utilizing MCP servers or similar mechanisms to enable AI agents to interact with external enterprise applications, databases, and APIs
Designing and implementing agentic workflows where multiple agents communicate and collaborate using the A2A protocol to achieve shared goals
Analyze and evaluate the performance of Gen AI systems and provide design recommendations
Research, design, and implement machine learning models and algorithms, with a focus on LLM and Deep Learning techniques
Collaborate with cross-functional teams to identify business problems and opportunities where machine learning solutions can add value
Develop and deploy scalable and efficient machine learning pipelines for processing and analyzing large volumes of structured and unstructured data
Perform data preprocessing, feature engineering, and model training/validation using state-of-the-art machine learning frameworks and libraries
Evaluate model performance and interpret results to derive actionable insights and recommendations
Estimate cost of using LLMs in different forms for business use cases and the viability, develop cost models for different usage patterns
Design and prototype reusable components for LLM based solution patterns
Architect components of an LLM solution to address Responsible AI Security
Collaborate seamlessly with diverse, cross-functional teams to accurately identify and prioritize requirements, ensuring that the language model meets the needs and expectations of various stakeholders
Create and maintain comprehensive technical documentation that comprehensibly captures the intricate details of the language model, facilitating seamless understanding, efficient troubleshooting, and future development
Harness the power of transformer architecture, a cutting-edge deep learning model widely employed in natural language processing and computer vision, to optimize the language model's performance and efficiency

Qualification

Cloud AI ArchitectureLarge Language Models (LLMs)Generative AI SolutionsMachine Learning FrameworksAgentic AI FrameworksModel Context Protocol (MCP)Agent-to-Agent (A2A) ProtocolsData PreprocessingDockerGit/SVN/CVSLinux ServersCommunication Skills

Required

BA/BS degree in Computer Science or related software engineering field, or equivalent experience
Minimum of 6 years of experience in designing deploying AI / ML solutions using at least one cloud vendor
Minimum of 1 year of experience in the LLM and Generative AI space
Minimum of 1 year of experience architecting and operationalizing LLM driven application architecture patterns
Deep understanding of AI/ML concepts, including LLMs, transformers, and prompt engineering
Experience with agentic AI frameworks such as LangChain, AutoGPT, Agentforce, LangGraph, or similar
Familiarity with the Model Context Protocol (MCP) standard and its role in providing context to AI agents
Experience with Agent-to-Agent (A2A) communication protocols and frameworks, enabling multi-agent collaboration and task delegation
2+ Experience with specific AI/ML frameworks and tools (e.g., TensorFlow, PyTorch, scikit-learn, MLflow)
Bachelor's degree or equivalent work experience (minimum of 12 years), or an associate degree with a minimum of 6 years of equivalent work experience
Demonstrable experience implementing and maintaining globally distributed, highly redundant, scalable cloud-hosted solutions
Ability to demonstrate knowledge of a container technology such as Docker
Proficient technical knowledge of current tools and best practices at scale
Demonstrable experience working with distributed teams 3rd-party vendors
Experience with monitoring and logging cloud services and infrastructure
Experience using code management tooling such as Git/SVN/CVS
Significant experience working with Linux servers and command lines
Strong written and verbal communication skills

Company

Lenovo Group is a computer technology company that manufactures personal computers, smartphones, televisions, and wearable devices.

H1B Sponsorship

Lenovo has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (76)
2024 (52)
2023 (75)
2022 (82)
2021 (58)
2020 (67)

Funding

Current Stage
Public Company
Total Funding
$3.35B
Key Investors
Alat
2025-01-08Post Ipo Debt· $2B
2024-04-01Post Ipo Debt· $500M
2017-10-03Post Ipo Equity· $500M

Leadership Team

leader-logo
Yang Yuanqing
Chairman & CEO
linkedin
leader-logo
Greg Huff
CTO, CSO, and SVP of Development, Quality, and Customer Care, Infrastructure Solutions Group
linkedin
Company data provided by crunchbase