Software Engineering Institute | Carnegie Mellon University ยท 11 hours ago
Data Scientist - 2024033
Carnegie Mellon University's Software Engineering Institute is seeking a Data Scientist to tackle cybersecurity challenges using advanced statistics and machine learning. The role involves collaborating with clients to develop solutions, co-authoring research proposals, and contributing to significant cybersecurity research projects.
Responsibilities
Use advanced statistics, data analytics, machine learning, and artificial intelligence to help our government and industry clients research and solve cybersecurity challenges
Work with customers to identify areas where advanced statistical techniques can help tackle problems
Plan and develop prototype solutions
Build out final products
Co-author research proposals
Execute studies
Present findings to DoD sponsors and at academic conferences
Craft metrics and experimental designs for large-scale cybersecurity research programs
Develop human-in-the-loop machine learning solutions
Build classifiers to identify security vulnerabilities
Qualification
Required
BS in data science, machine learning, computer science, statistics, or related highly-quantitative discipline with eight (8) years of experience or equivalent combination of training or experience; or MS in data science, machine learning, computer science, statistics, or related highly-quantitative discipline with five (5) years of experience; or PhD in data science, machine learning, computer science, statistics, or related highly-quantitative discipline with two (2) years of experience
Willingness to complete modest travel to various locations to support the SEI's overall mission
You will be subject to a background check and must be able obtain and maintain a U.S. Department of Defense security clearance
Experience in predictive modeling, data science, and/or AI & machine learning
Deep understanding of statistical modeling techniques and advanced data analytics
Proficient with at least one mathematical/statistical programming package (e.g., R, python numpy/scipy/pandas/polars, MATLAB, etc.)
Innovative and inquisitive with ability to imagine novel analytical solutions to problems
Thrives in a multi-disciplinary environment
Strong communication skills
Expertise in one or more of the following: Recommendation systems, Time-series forecasting (Prophet, NeuralProphet, Chronos, Lag-Llama, etc.), NLP / LLMs (fine-tuning, RAG, evaluation, prompt engineering), Causal inference / uplift modeling / synthetic controls, Modern ML frameworks: LightGBM/XGBoost, CatBoost, PyTorch, JAX, TensorFlow), LLMs / agentic workflows (LangChain/LlamaIndex/Haystack), Experience deploying models (FastAPI, Triton, KServe, SageMaker, Vertex AI, or similar), Experience working with big data (Spark, Trino, Snowflake, BigQuery, Databricks)
Preferred
Experience in cybersecurity and privacy is a plus
Experience in U.S. Government work and/or with FFRDCs, UARCs and National Labs is a plus
Demonstrated ability to learn new concepts and grow into new areas of work
Company
Software Engineering Institute | Carnegie Mellon University
At the SEI, we research complex software engineering, cybersecurity, and AI engineering problems; create and test innovative technologies; and transition maturing solutions into practice.
Funding
Current Stage
Late StageLeadership Team
Recent News
Seattle TechFlash
2025-06-25
2025-04-30
2025-04-10
Company data provided by crunchbase