Data Analyst I Apply
Are you passionate about harnessing data to drive innovation? We're on the lookout for a talented Data Analyst to develop and refine data curation and evaluation strategies that improve our models across key quality metrics, visual fidelity, prompt adherence, identity preservation, naturalness, and visual text generation. Using billions of images, manual annotations, and machine learning signals, we identify quality gaps and data needs through both manual and automated assessments. If you're eager to make an impact in a cutting-edge environment, we'd love to hear from you!
Main Responsibilities:
- Data Curation: Manage data labeling workflows, including data enqueueing for labeling, UI for labeling, and extracting labels into datasets for the modeling team.
- Data Engineering (Pipelines): Maintain large-scale, efficient, and reliable data processing pipelines (billions of images). This includes data sourcing, running machine learning models to understand content, and using LLMs to clean data.
- Data Engineering (Governance): Maintain our portfolio of datasets, ensuring governance of access, retention, and privacy compliance.
Additional Responsibilities:
- Annotations:
- Spend time manually annotating training data based on modeling team requirements.
- Use of LLMs and othe2r models to annotate training data or to evaluate generated content. Then apply auditing to understand these model performance.
- Analysis:
- Collaborate with engineers to identify and summarize model gaps based on evaluations. Utilize these findings to identify necessary data, and then mine and prepare that data for subsequent model training iterations.
- Auditing: Scale validated evaluation protocols with PDO teams, including coordination and auditing. Also, audit and correct human-labeled data.
Skills:
- Verbal and written communication skills, problem-solving skills, and interpersonal skills.
- Attention to detail and an aptitude for experimental investigations
- Basic ability to work independently and manage one's time.
- Basic knowledge of Python and SQL.
- Basic knowledge of computer vision and generative models.
- Basic knowledge of data ETL workflows & pipelines.
- [New] Usage of LLM for data labeling-related work.
Education/Experience:
- Associate's degree or equivalent training required in Computer Science, Electronic Engineering, Physics, Bioinformatics, or other STEM subjects.
- Prior industrial experience in software development and testing, and/or research experience in human-computer interaction are preferred.
Benefits:
- 401(k).
- Dental Insurance.
- Health insurance.
- Vision insurance.
- We are an equal-opportunity employer and value diversity, equality, inclusion, and respect for people.
- The salary will be determined based on several factors, including, but not limited to, location, relevant education, qualifications, experience, technical skills, and business needs.
Additional Responsibilities:
- Participate in OP monthly team meetings and participate in team-building efforts.
- Contribute to OP technical discussions, peer reviews, etc.
- Contribute content and collaborate via the OP-Wiki/Knowledge Base.
- Provide status reports to OP Account Management as requested.
About us:
At OP, we help you harness the power of technology for maximum impact. A technology consulting and solutions company, we offer advisory and managed services, innovative platforms, and staffing solutions across a wide range of fields including AI, cyber security, enterprise architecture, and beyond. For nearly two decades, we've been challenging the status quo of the consulting industry, serving up fresh, ingenious thinking through a radically lean structure. Together, this strategy delivers unprecedented performance at an unparalleled pace for faster results that propel your business forward.
At OP, we help you harness the power of technology for maximum impact. A technology consulting and solutions company, we offer advisory and managed services, innovative platforms, and staffing solutions across a wide range of fields including AI, cyber security, enterprise architecture, and beyond. For nearly two decades, we've been challenging the status quo of the consulting industry, serving up fresh, ingenious thinking through a radically lean structure. Together, this strategy delivers unprecedented performance at an unparalleled pace for faster results that propel your business forward.

