General Dynamics Information Technology · 1 day ago
Senior Data Scientist
General Dynamics Information Technology (GDIT) is seeking Senior Data Scientists for a federal client in the Washington D.C. area. The role involves designing and managing infrastructure for data collection, storage, and analysis, ensuring data quality and security, and automating data workflows.
Artificial Intelligence (AI)Cloud ComputingConsultingCyber SecurityInformation Technology
Responsibilities
Designing, building, and managing the infrastructure and tools needed to collect, store, process, and analyze large volumes of data
Data Collection: Gathering data from various sources, such as databases, APIs, and Internet of Things (IoT) devices
Data Storage: Using scalable storage solutions like data lakes and distributed file systems to handle vast amounts of data
Data Processing: Transforming raw data into a usable format through batch processing (e.g., Hadoop) or real-time processing (e.g., Apache Kafka)
Data Integration: Combining data from different sources to create a unified view
Data Quality: Ensuring the accuracy, consistency, and reliability of data
Data Security: Implementing measures to protect data from unauthorized access and breaches
Data Pipeline Management: Automating and orchestrating data workflows to ensure smooth data flow from source to destination with subsequent training once pipelines are setup for any super-users
Storing and analyzing large datasets utilizing advanced techniques such as statistical analysis, econometrics, Machine Learning (ML), and predictive modeling with multiple scripting options such as R, Python, SAS, Stata, and SQL
Support of varying transfer methods (direct cloud upload, secure FTP, or physical media) from diverse sources (transactional DB, operational data stores, external SaaS, flat files, legacy mainframe)
Preprocessing that may include decompression, deduplication, batch-based ingestion, near real time streaming
Qualification
Required
8 + years of related experience
AWS Big Data
Data Pipelines
Data Visualization
Machine Learning Algorithms
Deep knowledge of big data and other COTS statistical and analytical tools (R, SAS, Stata and data lake tools), database management, and ETL processes
Expertise in data architecture, data science tools, AI, and data lakes to facilitate successful project execution
Strong background in statistics and mathematics
Proficiency in programming (e.g., Python, Java)
Experience with machine learning algorithms
Experience with data visualization tools (e.g., Tableau, Matplotlib)
Comparative understanding of leading models (e.g., Claude Code, ChatGPT, xAI), including their capabilities, limitations, and trade-offs (e.g., latency, cost, fine-tuning, context window size)
Experience deploying and managing LLMs in FedRAMP-authorized environments, including GCC, GovCloud, or other secure cloud infrastructures
Current Certified Analytics Professional (CAP) Certification
Current Principal Data Scientist (PDS) Certification
Security clearance level: Candidates must be eligible to obtain a Public Trust level clearance
Ability to obtain and maintain a Public Trust or higher and authorization to work in the United States. Work visa sponsorship will not be provided for this position
Preferred
US citizenship Preferred
Benefits
Comprehensive benefits and wellness packages
401K with company match
Variety of medical plan options
Some with Health Savings Accounts
Dental plan options
A vision plan
Variety of paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave
Short and long-term disability benefits
Life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance
Company
General Dynamics Information Technology
General Dynamics Information Technology is an IT consulting company that specializes in cyber security, AI, and quantum computing. It is a sub-organization of General Dynamics.
Funding
Current Stage
Late StageRecent News
2026-02-05
2026-01-03
Company data provided by crunchbase