Codoxo · 8 hours ago
Jr. Data Engineer
Codoxo is a leading provider of artificial intelligence-driven solutions in the healthcare sector, focused on reducing risks from fraud, waste, and abuse. The Junior Data Engineer will support the design and maintenance of scalable data pipelines, collaborating with data scientists and analysts to ensure timely and secure data delivery.
Artificial Intelligence (AI)Big DataHealthcareSoftwareProperty & Casualty InsuranceAnalyticsFraud DetectionGenerative AIHealth CareMachine Learning
Responsibilities
Assist in designing, building, and maintaining scalable ETL/ELT data pipelines
Develop and optimize batch and streaming workflows using tools such as AWS Glue, Spark, and Airflow
Support data integration across multiple structured and unstructured data sources
Write clean, efficient, and maintainable code in Python and SQL
Monitor, troubleshoot, and improve pipeline reliability and performance
Optimize database performance, particularly in PostgreSQL and cloud-based environments
Maintain and support AWS-based infrastructure (EC2, S3, Glue, etc.)
Implement data validation, quality checks, and monitoring processes
Ensure compliance with data governance, security, and regulatory standards
Collaborate with data scientists and analysts to translate data requirements into scalable engineering solutions
Document data flows, architecture decisions, and technical processes
Use AI-assisted development tools to improve speed, testing coverage, and code quality
Qualification
Required
Bachelor's degree in Computer Science, Data Engineering, Information Systems, or a related technical field (or equivalent practical experience)
0–2 years of experience in data engineering, software engineering, or related technical roles (internships included)
Proficiency in Python, PySpark and SQL
Familiarity with ETL/ELT concepts and data pipeline architecture
Experience working with relational databases such as PostgreSQL
Basic understanding of cloud computing concepts, preferably AWS
Exposure to distributed data processing frameworks such as Spark
Experience working in Linux environments and basic shell scripting
Strong analytical and problem-solving skills
Ability to work collaboratively in a team environment under mentorship
Strong written and verbal communication skills
Preferred
Experience working with medical claims data strongly preferred
Hands-on experience with AWS services such as EC2, S3, Glue, and IAM
Experience with workflow orchestration tools such as Apache Airflow
Exposure to data warehousing concepts and dimensional modeling
Familiarity with CI/CD pipelines and version control (e.g., Git)
Understanding of data security, governance, and compliance best practices
Experience supporting machine learning pipelines or analytics platforms
Demonstrated use of AI tools (e.g., code assistants, automation platforms) to improve development efficiency
Benefits
Health, Dental, and Vision insurance with 100% employee premium coverage (Starts Day 1)
Unlimited PTO
Annual Professional Development stipend
Annual home office stipend
401K Match (after 90 days)
Company
Codoxo
Codoxo leverages advanced generative and self-learning AI to enhance payment integrity in healthcare.
Funding
Current Stage
Growth StageTotal Funding
$49.66MKey Investors
CVS Health VenturesQED InvestorsGRA Venture Fund,QED Investors
2025-12-17Series C· $35M
2025-03-01Debt Financing
2022-02-03Series B· $5.25M
Leadership Team
Recent News
2025-12-20
Crowdfund Insider
2025-12-20
vcnewsdaily.com
2025-12-18
Company data provided by crunchbase