Consumer Edge · 2 weeks ago
Senior Machine Engineer (NLP)
Consumer Edge is a data innovation and AI company transforming how professionals interpret consumer and business behavior. They are seeking an experienced Senior Machine Learning Engineer with a specialization in Natural Language Processing (NLP) to solve critical data challenges by designing, building, and deploying production-scale systems for entity resolution.
Information Services
Responsibilities
Design & Build: Lead the end-to-end development of machine learning pipelines for large-scale entity resolution, record linkage, and data matching
NLP Modeling: Apply and customize advanced NLP techniques (e.g., entity extraction, semantic similarity, text vectorization, fuzzy matching) to compare and match entities from structured and unstructured text
System Architecture: Engineer scalable and efficient data processing and model inference systems designed to handle terabyte scale datasets using cloud-native tools
Deployment: Deploy, monitor, and maintain ML models and data pipelines in production on GCP (e.g., Vertex AI, BigQuery, Dataflow)
Project Leadership: Collaborate closely with product managers, data engineers, and business stakeholders to scope new projects, define data requirements, and establish success metrics
Communication & Documentation: Create clear, comprehensive design documents and effectively communicate complex technical concepts, trade-offs, and results to both technical and non-technical audiences
Qualification
Required
3+ years of hands-on experience building and deploying machine learning models in a production environment
Proven, demonstrable experience in Natural Language Processing (NLP) with a specific focus on entity resolution, record linkage, or data matching projects
Strong proficiency in Python and common ML/data science libraries (e.g., scikit-learn, pandas, spaCy, Hugging Face Transformers)
Hands-on experience with ML deployment and data processing services on public cloud providers (GCP, AWS, or Azure)
Solid software engineering fundamentals, including version control (Git), testing, and CI/CD practices
Excellent written and verbal communication skills, with a proven ability to document design decisions and present complex information clearly
Preferred
Experience building data-intensive applications and working with very large datasets using distributed computing frameworks (e.g., Apache Beam, Apache Spark, Dask, Ray)
Experience building NLP applications with an LLM based component
Familiarity with MLOps principles and tools (e.g., MLflow, Kubeflow, TFX)
Experience deploying AI/ML systems to production and integrating with data pipelines (e.g., ETL tools, Airflow, Dagster)
Publications in relevant conferences (e.g., ACL, EMNLP, KDD) or contributions to open-source projects
Benefits
Performance-based bonus
Company equity
401(k) matching
Paid parental leave
Flexible and generous time off
Work-from-home flexibility
Subsidized health benefits
Company
Consumer Edge
Consumer Edge (CE) is a consumer alternative data firm providing insights as a service solutions to the global investor community, hedge funds, private equity, venture capital, corporates and data partners.