Data Engineer/Machine Learning Developer jobs in United States
cer-icon
Apply on Employer Site
company-logo

CACI International Inc · 4 months ago

Data Engineer/Machine Learning Developer

CACI International Inc is currently looking for a motivated, career and customer-oriented Mid-level Data Engineer and Machine Learning Developer to join their Customs and Border Protection team in Northern Virginia. The role involves designing, developing, and deploying machine learning algorithms, collaborating with data teams, and solving complex big data problems to enhance national security efforts.

Information TechnologyService IndustrySoftware
check
Comp. & Benefits
badNo H1BnoteSecurity Clearance RequirednoteU.S. Citizen Onlynote

Responsibilities

Develop an understanding of the customer’s data environment through data profiling, data pipeline, and machine learning/statistical analyses
Deliver ML software models and components that solve real-world business problems, while working in collaboration with our Product and Data Science teams
Solve complex problems by writing and testing application code, developing and validating ML models, and automating tests and deployment
Collaborate as part of a cross-functional Agile team to create and enhance software that enables state-of-the-art, next generation big data and machine learning applications
Leverage cloud-based architectures and technologies to deliver optimized ML models at scale
Construct optimized data pipelines to feed ML models
Use programming languages like Python, Scala, or Java
Leverage Continuous Integration and continuous deployment best practices, including test automation and monitoring, to ensure successful deployments of ML models and application code
Advocate for software and machine learning engineering best practices
Function as the engineering tech lead for large-scale initiatives
Perform statistical analysis and tune using test results
Study appropriate datasets and transform data science prototypes
Train data-driven learning model
Maintain and work with data pipeline that transfers and processes large scale of heterogenous structural/non-structural data using Spark, Scala, Python, Apache Kafka, TensorFlow, PyTorch, and/or other data analytic tools
Design, build and support pipelines of data transformation, conversion, validation
Build data manipulation, processing, and data visualization tools and share these tools across the team
Apply data analysis, data mining and data engineering to present data clearly and develop experiments
Ensure high-quality data and understand how data is generated out experimental design and how these experiments can produce actionable, trustworthy conclusions
Work with development teams to build tools for data logging and repeatable data tasks that will accelerate and automate process

Qualification

Machine LearningData EngineeringCloud ComputingPythonAgile MethodologyData PipelineStatistical AnalysisBig DataProblem SolvingCommunication SkillsTeam CollaborationTime Management

Required

Must be a U.S. Citizen with the ability to pass CBP background investigation, criteria includes, but not limited to: 3-year check for felony convictions, 1-year check for illegal drug use, 1-year check for misconduct such as theft or fraud
Applicable combination of education and experience: Data Engineer/Machine Learning Developer T3 - Bachelor's Degree in computer science, electrical engineering, mathematics, or a similar field and 5+ years of on-the-job experience with an industry recognized ML framework such as scikit-learn, PyTorch, Dask, Spark, or TensorFlow, and applications related to data gathering and preparation for ML models and building production-ready data pipelines that feed ML models
Data Engineer/Machine Learning Developer T4 - Bachelor's Degree in computer science, electrical engineering, mathematics, or a similar field and 7+ years of on-the-job experience with 5+ years data gathering and preparation for ML models
Proven previous experience as a Machine Learning Engineer or similar role
Experience developing and deploying ML solutions in a public cloud such as AWS, Azure, or Google Cloud Platform
Familiarity with machine learning frameworks (like Keras or PyTorch, or TensorFlow) and libraries (like Pandas or scikit-learn) and ability to write robust code in Python, Java and R
Proven ML industry impact through conference presentations, papers, blog posts, open-source contributions, or patents
Essential knowledge of neural networks, Deep Learning and Computer vision algorithms for object detection, feature engineering, and hyperparameter tuning
Understanding of data structures, data modeling and software architecture
Deep knowledge of math, probability, statistics and algorithms
Outstanding analytical and problem-solving skills
Experience with developing predictive models for using large data sets for high transactional volume environment
Experience with evaluating and measuring performance of models
Strong algorithmic problem-solving skills. Experience with statistics, modeling and machine learning techniques Statistics including but limited to hypothesis testing, regression, clustering, classification, and optimization
Ability to understand and analyze data models – how the data is stored in relational databases
Ability to understand system integration aspects of integrating model input and output in transactional systems to help real time decision making
Good understanding software application architecture and develop integration approaches for predictive models
Possess the ability to perform with little direct supervision as a self-starter. Be a self-motivated, creative, and inquisitive problem solver with a strong work ethic and data integrity
Have effective oral and verbal, visual and written communication skills
Ability to work effectively in a high energy and a very rapid dynamic environment and interact effectively with team members, senior managers, and clients

Preferred

Strong organization and time management skills – prior experience in leading a small team is preferred
Working knowledge of the CBP Port of Entry systems and/or their operational requirements
Experience automating business processes using RPA technologies

Benefits

Healthcare
Wellness
Financial
Retirement
Family support
Continuing education
Time off benefits

Company

CACI International Inc

company-logo
At CACI International Inc (NYSE: CACI), our 25,000 talented and dynamic employees are ever vigilant in delivering distinctive expertise and technology to meet our customers’ greatest challenges in national security.

Funding

Current Stage
Public Company
Total Funding
$1B
2025-05-21Post Ipo Debt· $1B
2003-01-10IPO

Leadership Team

leader-logo
John Mengucci
President & CEO
linkedin
leader-logo
Darryl W Burke
Senior Vice President / Air Force Client Executive
linkedin
Company data provided by crunchbase