OP · 3 hours ago
Data Analyst I
OP is a technology consulting and solutions company that helps harness the power of technology for maximum impact. They are seeking a Data Analyst to develop and refine data curation and evaluation strategies to improve models across key quality metrics.
EnterpriseInformation ServicesInformation Technology
Responsibilities
Data Curation: Manage data labeling workflows, including data enqueueing for labeling, UI for labeling, and extracting labels into datasets for the modeling team
Data Engineering (Pipelines): Maintain large-scale, efficient, and reliable data processing pipelines (billions of images). This includes data sourcing, running machine learning models to understand content, and using LLMs to clean data
Data Engineering (Governance): Maintain our portfolio of datasets, ensuring governance of access, retention, and privacy compliance
Annotations: Spend time manually annotating training data based on modeling team requirements. Use of LLMs and other models to annotate training data or to evaluate generated content. Then apply auditing to understand these model performance
Analysis: Collaborate with engineers to identify and summarize model gaps based on evaluations. Utilize these findings to identify necessary data, and then mine and prepare that data for subsequent model training iterations
Auditing: Scale validated evaluation protocols with PDO teams, including coordination and auditing. Also, audit and correct human-labeled data
Participate in OP monthly team meetings and participate in team-building efforts
Contribute to OP technical discussions, peer reviews, etc
Contribute content and collaborate via the OP-Wiki/Knowledge Base
Provide status reports to OP Account Management as requested
Qualification
Required
Verbal and written communication skills
Problem-solving skills
Interpersonal skills
Attention to detail
Aptitude for experimental investigations
Basic ability to work independently and manage one's time
Basic knowledge of Python
Basic knowledge of SQL
Basic knowledge of computer vision and generative models
Basic knowledge of data ETL workflows & pipelines
Usage of LLM for data labeling-related work
Associate's degree or equivalent training required in Computer Science, Electronic Engineering, Physics, Bioinformatics, or other STEM subjects
Preferred
Prior industrial experience in software development and testing
Research experience in human-computer interaction
Benefits
401(k)
Dental Insurance
Health insurance
Vision insurance
Company
OP
OP is one of the fastest-growing technology consulting and solutions companies in the U.S.
H1B Sponsorship
OP has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (8)
2024 (5)
2023 (1)
2022 (8)
2021 (6)
2020 (11)
Funding
Current Stage
Growth StageCompany data provided by crunchbase