Tykhe Inc ยท 2 days ago
Senior Data Scientist
Maximize your interview chances
Staffing & Recruiting
Insider Connection @Tykhe Inc
Get 3x more responses when you reach out via email instead of LinkedIn.
Responsibilities
Design, development, management, and maintenance of systems and handling of large datasets.
Lead AI strategy by delivering solutions that combine software engineering, statistics, and machine learning effectively for complex clinical applications.
Execute deep, thoughtful, analytical experiments that utilize the most appropriate techniques and lead to substantial incremental improvements.
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
Minimum 4-5 years' experience working in data scientist capacity.
PhD or MS in Computer Science/Machine Learning/AI, or related work experience with the design, building and evaluation of Machine Learning systems.
Experience with Machine Learning ecosystem tools, including pytorch/tensorflow, scikit-learn, xgboost or equivalents.
Statistical Analysis: Proficiency in statistical techniques and tools.
Machine Learning: Knowledge of machine learning algorithms and their applications.
Programming: Strong programming skills (e.g., Python, R, or SQL).
Data Visualization: Ability to create meaningful visualizations.
Ability to design, implement, test and deploy Machine Learning models.
Proficiency in accessing and handling databases via SQL, Azure Data Factory or similar.
Alternatively, familiarity with data storage/management systems or Big Data frameworks (like Hadoop, Spark) expected to be known or used.
Proficiency in Software Development best practices such as Continuous Integration, Unit/Integration Testing, Code Reviews.
Understanding of MLOps fundamentals.
Preferred
Experience working with and evaluating LLM-based pipeline, with an emphasis on retrieval augmented generation and prompting techniques is a plus.
Familiarity with langchain/llamaindex/haystack/Azure AI studio, vector databases and retrieval techniques, or equivalent common tools in the emerging LLM-enabled tech stack is a plus.