Lead High Performance Computing Architect @ Edify Technologies | Jobright.ai
JOBSarrow
RecommendedLiked
0
Applied
0
Lead High Performance Computing Architect jobs in United States
32 applicants
company-logo

Edify Technologies · 2 days ago

Lead High Performance Computing Architect

Wonder how qualified you are to the job?

ftfMaximize your interview chances
Real EstateSoftware
check
Comp. & Benefits
Hiring Manager
Latha Pai
linkedin

Insider Connection @Edify Technologies

Discover valuable connections within the company who might provide insights and potential referrals, giving your job application an inside edge.

Responsibilities

Lead the technical operations including the architect, design, expansion, monitoring, support, and maintenance for Scientific Computing’s computational and data science ecosystem consistent with best practices. Key components include a 50,000+ core and 30+ petabyte usable high-performance computing cluster, clinical data warehouse, and software development environment.
Lead the troubleshooting, isolation, and resolution of all technical issues.
Lead the design, development, implementation, and management of all system administration tasks, including hardware and software configuration, configuration management, system monitoring (including the development and maintenance of regression tests), usage reporting, system performance (file systems, scheduler, interconnect, high availability, etc.), security, networking and metrics, etc.
Ensures that the design and operation of the HPC ecosystem is productive for research.
Collaborates effectively with research and hospital system IT, compliance, HIPAA, security, and other departments to ensure compliance with all regulations and Sinai policies.
Partners with other peers regionally, nationally, and internationally to discover, propose, and deploy a world-class research infrastructure for Mount Sinai.
Prepares and manages budgets for hardware, software, and maintenance. Participates in chargeback/fee recovery analysis and provides suggestions to make operations sustainable.
Lead the integration of HPC resources with laboratory equipment such as genomic sequencers, etc.
Researches, deploys, and optimizes resource management and scheduling software and policies and actively monitoring.
Designs, tunes, manages, and upgrades parallel file systems, storage, and data-oriented resources.
Researches, deploys, and manages security infrastructure, including development of policies and procedures.
Lead and assist the team to resolve user support requests from researchers.
Assists in developing and writing system design for research proposals.
Lead the development of a framework for effective system documentation.
Works effectively and productively with other team members within the group and across Organization.
Provide after-hours support in case of a critical system issue.

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

HPC system administrationRedhat/CentOS Linux administrationBatch HPC cluster environmentTroubleshootingConfiguration management systemsXCATPuppetAnsibleSecurityInfinibandGigabit EthernetLSFGPFS Spectrum Scale parallel file systemsStorageTechnical operations leadershipDisparate tasks managementTechnology problem troubleshootingResearch teamsScriptingProgrammingProblem-solvingTeam playerCustomer focusedAttention to detailTime managementProject managementCommunication skillsAnalytical abilityJudgmentManagement skills

Required

Bachelor’s degree in computer science, engineering or another scientific field
8 years of progressive HPC system administration and operations (preferably in a Redhat/CentOS Linux administration, Batch HPC cluster environment)
Must be an expert troubleshooter; Must be a team player and customer focused
Strong experience with configuration management systems such as xCAT, Puppet and/or Ansible
Strong experience with networking and security
Strong experience with Infiniband and Gigabit Ethernet
Experience with LSF and GPFS Spectrum Scale parallel file systems and storage
Experience with providing technical operations leadership
Ability to manage a variety of disparate tasks and priorities independently and troubleshoot complex technology problems
Attention to detail; time and project management skills
Excellent communication skills, analytical ability, strong judgment and management skills, and the ability to work effectively as a liaison between both research and technology teams
Strong written, oral, and interpersonal communication skills
Script and programming experience

Preferred

Experience with archival storage and tape libraries (TSM) is highly preferred
Experience with databases and web services is highly preferred
Compliance, HIPAA, GDPR, FISMA
Experience with managing web access to HPC resources (such as Open OnDemand)
Experience in a research environment is highly preferred
Experience with financial budgets and providing cost benefit analysis is preferred
Cloud Technology

Company

Edify Technologies

twittertwitter
company-logo
Edify Technologies, headquartered in Naperville, IL, is a beacon of innovation with a two-decade legacy of success and enduring partnerships.

Funding

Current Stage
Early Stage
Total Funding
$0.15M
2014-10-01Seed· $0.15M
Company data provided by crunchbase
logo

Orion

Your AI Copilot