AI/ML Infrastructure Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

NetSuite · 3 months ago

AI/ML Infrastructure Engineer

NetSuite is a leading provider of cloud-based business management software. They are seeking an AI/ML Infrastructure Engineer to design, implement, and manage infrastructure for AI/ML workloads, ensuring optimal performance and collaboration across teams.

Cloud ComputingComputerCRMiOSSaaSSoftware

Responsibilities

Experience in scripting and automation using tools like Ansible, Terraform, and/or Kubernetes
Experience with containerization technologies (e.g., Docker, Kubernetes) and orchestration tools for managing distributed systems
Solid understanding of networking concepts, security principles, and best practices
Excellent problem-solving skills, with the ability to troubleshoot complex issues and drive resolution in a fast-paced environment
Strong communication and collaboration skills, with the ability to work effectively in cross-functional teams and convey technical concepts to non-technical stakeholders
Strong documentation skills with experience documenting infrastructure designs, configurations, procedures, and troubleshooting steps to facilitate knowledge sharing, ensure maintainability, and enhance team collaboration
Strong Linux skills with hands-on experience in Oracle Linux/RHEL/CentOS, Ubuntu, and Debian distributions, including system administration, package management, shell scripting, and performance optimization
Strong proficiency in at least one of the programming languages such as Python, Rust, Go, Java, or Scala
Proven experience designing, implementing, and managing infrastructure for AI/ML or HPC workloads
Understanding machine learning frameworks and libraries such as TensorFlow, PyTorch, or sci-kit-learn and their deployment in production environments is a plus
Familiarity with DevOps practices and tools for continuous integration, deployment, and monitoring (e.g., Jenkins, GitLab CI/CD, Prometheus)
Strong experience with High-Performance Computing systems

Qualification

AI/ML infrastructure managementContainerization technologiesLinux system administrationScriptingAutomationNetworking conceptsProgramming languagesDevOps practicesProblem-solving skillsCommunication skillsDocumentation skills

Required

Experience in scripting and automation using tools like Ansible, Terraform, and/or Kubernetes
Experience with containerization technologies (e.g., Docker, Kubernetes) and orchestration tools for managing distributed systems
Solid understanding of networking concepts, security principles, and best practices
Excellent problem-solving skills, with the ability to troubleshoot complex issues and drive resolution in a fast-paced environment
Strong communication and collaboration skills, with the ability to work effectively in cross-functional teams and convey technical concepts to non-technical stakeholders
Strong documentation skills with experience documenting infrastructure designs, configurations, procedures, and troubleshooting steps to facilitate knowledge sharing, ensure maintainability, and enhance team collaboration
Strong Linux skills with hands-on experience in Oracle Linux/RHEL/CentOS, Ubuntu, and Debian distributions, including system administration, package management, shell scripting, and performance optimization

Preferred

Strong proficiency in at least one of the programming languages such as Python, Rust, Go, Java, or Scala
Proven experience designing, implementing, and managing infrastructure for AI/ML or HPC workloads
Understanding machine learning frameworks and libraries such as TensorFlow, PyTorch, or sci-kit-learn and their deployment in production environments is a plus
Familiarity with DevOps practices and tools for continuous integration, deployment, and monitoring (e.g., Jenkins, GitLab CI/CD, Prometheus)
Strong experience with High-Performance Computing systems

Company

NetSuite

company-logo
NetSuite is cloud computing company dedicated to delivering business applications over the internet.

Funding

Current Stage
Public Company
Total Funding
$157.79M
Key Investors
Meritech Capital PartnersTako VenturesStarVest Partners
2016-07-28Acquired
2007-12-20IPO
2007-02-05Secondary Market· $17.87M

Leadership Team

leader-logo
Brian Chess
SVP Technology and AI
linkedin
E
Eli Johnson
Vice President, Global Sales Productivity
linkedin
Company data provided by crunchbase