Data Scientist jobs in United States
cer-icon
Apply on Employer Site
company-logo

Simpson Thacher & Bartlett LLP · 16 hours ago

Data Scientist

Simpson Thacher & Bartlett LLP is seeking a Data Scientist who will deliver insights to the Firm’s leadership and legal practices. The role involves utilizing statistics, machine learning, and natural language processing to derive meaning from both structured and unstructured data, while contributing to the Firm's AI initiatives.

Law Practice
badNo H1Bnote

Responsibilities

Support legal teams and relevant operational staff in delivering on opportunities to use data to drive decision-making and improve the efficiency and effectiveness of the Firm’s client representations
Collaborate with Firm functional departments (e.g., Finance, Talent, Business Development, IT) to analyze data and develop solutions to support operational objectives
Develop regression and classification models using established and emerging data science methodologies
Chain, fine-tune and deploy pre-trained language models (e.g., BERT, Llama, Qwen, etc.) to optimize performance on a range of NLP tasks, including text classification, named entity recognition, and generative tasks such as summarization, clause and document generation, and question-answer exchanges
Design and deploy document segmentation and embedding approaches to facilitate information retrieval and retrieval augmented generation (RAG)
Conduct advanced quantitative research, using machine learning (ML) and natural language processing (NLP) techniques to understand patterns in large volumes of data, identify relationships, detect data anomalies and classify data
Configure practice-specific AI workflows and language technologies, which may require complex pipelines, prompt engineering, prompt chaining, and text operations
Design and deploy highly visual reports and user interfaces that surface quantitative insights in forms that are fit-for-purpose, modern and easily accessible to non-technical business professionals
Stay current with the latest advancements in LLMs, NLP, Deep Learning and ML research, implementing cutting edge techniques and incorporating them into production models as appropriate
Document development processes, codebase, and best practices to facilitate knowledge sharing and maintain a well-organized, reproducible environment
Partner with other technical resources to refine data pipelines for recurring classes of analysis and data-driven solutions
Handle projects on request under the direction of the CKIO, Director of Applied Analytics + AI, and other executive staff

Qualification

Machine LearningNatural Language ProcessingDeep LearningStatistical ProgrammingData VisualizationSQLPythonRTechnical KnowledgeCommunication SkillsProblem SolvingCollaboration

Required

A bachelor's degree required, preferably in data science, mathematics, statistics, computer science, engineering, finance or a related field
2+ year in a data science, machine learning engineering, artificial intelligence or equivalent role, or a PhD in a related field
Highly proficient with statistical programming (e.g., Python, R) and databases (e.g., SQL, Pinecone)
Proven experience developing and validating linear and non-linear regression and classification models
Expertise in data transformation, data science and visualization libraries (e.g., pandas, scikit-learn, matplotlib, Seaborn)
Ability to design and develop object-oriented machine learning systems beyond Jupyter notebooks
Solid understanding of deep learning frameworks such as TensorFlow or PyTorch
Proficiency with version control systems such as Git or equivalent tools for code management and collaboration
Able to translate business problems to technical logic and practical solutions
Able to communicate complex results clearly to a non-technical audience
Proactively develops and maintains technical knowledge in emerging data science areas

Preferred

Master's degree in data science, computer science, statistics, computational linguistics or engineering preferred
Prior coursework in deep learning, natural language processing, or information retrieval a significant plus
Experience with natural language processing and related libraries (e.g., Hugging Face's Transformers, spaCy, NLTK)
Experience in the legal field is a significant plus

Company

Simpson Thacher & Bartlett LLP

twitter
company-logo
Simpson Thacher & Bartlett LLP is one of the world’s leading international law firms.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Kelly Stevens
Chief Operating Officer
linkedin
leader-logo
Alan Turner
Partner
linkedin
Company data provided by crunchbase