BTI360 · 13 hours ago
Data Scientist - Polygraph Required
BTI360 is a company dedicated to developing software engineers and has been recognized as a top workplace. They are seeking a Data Scientist to transform raw data into meaningful insights, leading machine learning projects and collaborating with teams to solve complex business problems.
Information TechnologySoftwareSoftware Engineering
Responsibilities
Lead end-to-end machine learning projects from problem definition to production, leveraging pre-trained models and LLMs for rapid prototyping, then using data-driven evaluation to determine when to invest in custom solutions such as fine-tuned models or purpose-built small language models
Design and conduct rigorous model evaluations using industry-standard methodologies, including developing custom evaluation frameworks for clustering, classification, embeddings, and entity resolution systems
Translate business requirements into quantitative problems and communicate technical findings to both technical and non-technical stakeholders through reports, presentations, and direct customer engagement
Drive technical decision-making for model selection, including evaluating embeddings models, LLMs, and other ML systems against business requirements and performance benchmarks
Optimize existing systems by identifying data quality issues, implementing performance improvements, and developing novel approaches that deliver measurable improvements in key metrics
Stay current with ML advances and introduce new techniques and frameworks to the team, including participating in or presenting at industry conferences
Apply statistical methods to validate findings and support data-driven decisions
Develop reports and whitepapers that evaluate solution alternatives based on impact, cost, technical feasibility, and alignment with strategic goals
Collaborate across teams to align on strategy, provide data science expertise, and contribute to proposals and strategic initiatives
Mentor junior data scientists by providing technical guidance, defining project direction, and sharing best practices in model development and evaluation
Qualification
Required
Active TS/SCI with Polygraph
3+ years of experience in data science or machine learning roles with demonstrated impact on production systems
Deep expertise in NLP and modern ML techniques including embeddings, clustering, classification, and working with LLMs
NLP experience including feature engineering and modeling for text data
Knowledge of ML evaluation metrics appropriate for various use cases
Proven track record of model evaluation using appropriate metrics and methodologies for different problem types (precision/recall, silhouette scores, benchmarks, etc.)
Strong scripting skills with experience in AI/ML frameworks (scikit-learn, PyTorch/TensorFlow, transformers) and data manipulation libraries, including proficiency with Jupyter notebooks for exploratory analysis and experimentation
Basic software engineering practices such as git, CI/CD, code reviews, documentation, and ability to access and leverage remote APIs
Familiarity with cloud platforms (AWS, Azure, GCP)
Excellent communication skills with experience translating technical metrics into business value and presenting findings to leadership and customers
Ability to work independently
Preferred
Strong Python programming skills
Experience integrating ML models into production systems
Experience customizing or fine-tuning LLMs (LoRA, distillation, etc.)
Experience with large or multimodal datasets
Proficiency in SQL and Elasticsearch/Elastic Stack for building evaluation datasets and querying production data systems
Experience leading technical initiatives
Benefits
Fully paid healthcare premiums
Competitive salaries and performance bonuses
Career development and in-house training
Continuing Education: $5,250.00 annually towards education
Up to 5 weeks PTO plus 2 weeks of federal holidays
401K dollar for dollar matching up to 6% annually - vested immediately on day 1
Giving Back: Serving communities locally and across the globe
Social Events (happy hours, golf tournament, BTI360 Family Festival and more)
Company
BTI360
BTI360 develops and delivers big data software solutions to minimize the time spent on research and utilize time providing insights.
Funding
Current Stage
Growth StageRecent News
2023-10-03
Company data provided by crunchbase