CBC · 18 hours ago

Machine Learning Engineer (PhD is required)

United States

Contract

Remote

Mid Level

3+ years exp

Maximize your interview chances

BlockchainCloud Data Services

H1B Sponsor Likely

Insider Connection @CBC

Discover valuable connections within the company who might provide insights and potential referrals.
Get 3x more responses when you reach out via email instead of LinkedIn.

Responsibilities

Work or educational background in one or more of the following areas: operations research, computer science, Mathematics, data science, business analytics, or knowledge management.

Demonstrated experience programming with R/Python, Linux, and Spark in AWS cloud environment, or knowledge and algorithmic design experience in Python/C#/C++ (3+ years)

Proficient with Amazon AWS Sagemaker, Jupyter Notebook and Python Scikit, Deep Learning, Machine Learning tools such as TensorFlow

Experience building Vector DB, NLP, LLM and GenAI tools. Experience with LoRA, LangChain, RAG, LLM Fine Tuning and PEFT are preferred.

Demonstrated experience with SQL and relational database technologies, such as Oracle, PostgreSQL, MySQL, RDS, Redshift, Hadoop EMR, Hive, etc.

Demonstrated experience processing structured and unstructured data sources, data cleansing, data normalization and prep for analysis

Demonstrated experience with machine learning techniques including natural language processing, BERT, RoBERT, GPT and Large language Models.

Demonstrated experience with code repositories and build/deployment pipelines, specifically Jenkins and/or Git.

Demonstrated experience using Apache Hadoop and/or Apache Spark stack for big data processing, or comparable distributed computing platforms.

Demonstrated experience using data streaming technologies such as Kafka, Rabbit MQ, NiFi, Kinesis or comparable tools

Demonstrated experience using Tableau, Kibana, Quicksights or other similar data visualizations tools.

Ability to handle terabytes of time-series and cross-sectional data and extract well defined alpha from the underlying relationships

Very comfortable working with ambiguity (e.g. imperfect data, loosely defined concepts, ideas, or goals)

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

Generative AILLMNLPDeep LearningMachine LearningPythonAWS SagemakerJupyter NotebookSQLApache SparkData VisualizationRLinuxSpark in AWSC#C++TensorFlowVector DBLoRALangChainRAGLLM Fine TuningPEFTOraclePostgreSQLMySQLRDSRedshiftHadoop EMRHive

Required

PhD background in related field

Work or educational background in one or more of the following areas: operations research, computer science, Mathematics, data science, business analytics, or knowledge management.

Demonstrated experience programming with R/Python, Linux, and Spark in AWS cloud environment, or knowledge and algorithmic design experience in Python/C#/C++ (3+ years)

Proficient with Amazon AWS Sagemaker, Jupyter Notebook and Python Scikit, Deep Learning, Machine Learning tools such as TensorFlow

Demonstrated experience building Vector DB, NLP, LLM and GenAI tools.