CBC ยท 22 hours ago
Machine Learning Engineer (PhD is required)
Maximize your interview chances
BlockchainCloud Data Services
H1B Sponsor Likely
Insider Connection @CBC
Get 3x more responses when you reach out via email instead of LinkedIn.
Responsibilities
Work or educational background in one or more of the following areas: operations research, computer science, Mathematics, data science, business analytics, or knowledge management.
Demonstrated experience programming with R/Python, Linux, and Spark in AWS cloud environment, or knowledge and algorithmic design experience in Python/C#/C++ (3+ years)
Proficient with Amazon AWS Sagemaker, Jupyter Notebook and Python Scikit, Deep Learning, Machine Learning tools such as TensorFlow
Experience building Vector DB, NLP, LLM and GenAI tools. Experience with LoRA, LangChain, RAG, LLM Fine Tuning and PEFT are preferred.
Demonstrated experience with SQL and relational database technologies, such as Oracle, PostgreSQL, MySQL, RDS, Redshift, Hadoop EMR, Hive, etc.
Demonstrated experience processing structured and unstructured data sources, data cleansing, data normalization and prep for analysis
Demonstrated experience with machine learning techniques including natural language processing, BERT, RoBERT, GPT and Large language Models.
Demonstrated experience with code repositories and build/deployment pipelines, specifically Jenkins and/or Git.
Demonstrated experience using Apache Hadoop and/or Apache Spark stack for big data processing, or comparable distributed computing platforms.
Demonstrated experience using data streaming technologies such as Kafka, Rabbit MQ, NiFi, Kinesis or comparable tools
Demonstrated experience using Tableau, Kibana, Quicksights or other similar data visualizations tools.
Ability to handle terabytes of time-series and cross-sectional data and extract well defined alpha from the underlying relationships
Very comfortable working with ambiguity (e.g. imperfect data, loosely defined concepts, ideas, or goals)
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
PhD background in related field
Work or educational background in one or more of the following areas: operations research, computer science, Mathematics, data science, business analytics, or knowledge management.
Demonstrated experience programming with R/Python, Linux, and Spark in AWS cloud environment, or knowledge and algorithmic design experience in Python/C#/C++ (3+ years)
Proficient with Amazon AWS Sagemaker, Jupyter Notebook and Python Scikit, Deep Learning, Machine Learning tools such as TensorFlow
Demonstrated experience building Vector DB, NLP, LLM and GenAI tools.
Demonstrated experience with SQL and relational database technologies, such as Oracle, PostgreSQL, MySQL, RDS, Redshift, Hadoop EMR, Hive, etc.
Demonstrated experience processing structured and unstructured data sources, data cleansing, data normalization and prep for analysis
Demonstrated experience with machine learning techniques including natural language processing, BERT, RoBERT, GPT and Large language Models.
Demonstrated experience with code repositories and build/deployment pipelines, specifically Jenkins and/or Git.
Demonstrated experience using Apache Hadoop and/or Apache Spark stack for big data processing, or comparable distributed computing platforms.
Demonstrated experience using data streaming technologies such as Kafka, Rabbit MQ, NiFi, Kinesis or comparable tools
Demonstrated experience using Tableau, Kibana, Quicksights or other similar data visualizations tools.
Ability to handle terabytes of time-series and cross-sectional data and extract well defined alpha from the underlying relationships
Very comfortable working with ambiguity (e.g. imperfect data, loosely defined concepts, ideas, or goals)
Education: MS in Computer Science, Statistics, Math, Engineering, or related field, PhD preferred
3+ years of relevant experience in building large scale machine learning or deep learning models and/or systems
1+ year of experience specifically with deep learning (e.g., CNN, RNN, LSTM)
1+ year of experience building NLP, LLM and GenAI tools.
Demonstrated skills with Jupyter Notebook, AWS Sagemaker, or Domino Datalab or comparable environments
Passion for solving complex data problems and generating cross-functional solutions in a fast-paced environment
Knowledge in Python or C++ / C#, and SQL, object oriented programming, service oriented architectures
Strong scripting skills with Shell script and SQL
Strong coding skills and experience with Python (including SciPy, NumPy, and/or PySpark) and/or Scala.
Knowledge and implementation experience with statistical and machine learning models (regression, classification, clustering, graph models, etc.)
Preferred
Experience with LoRA, LangChain, RAG, LLM Fine Tuning and PEFT
Hands on experience building models with deep learning frameworks like MXNet, Tensorflow, Keras, Caffe, PyTorch, Theano, or similar
Experience search architecture (ex - Solr, ElasticSearch)
Experience with building querying ontologies such as Zeno, OWL, RDF, SparQL or comparable
Knowledge and implementation experience with NLP techniques (LDA, TF/IDF, Sentiment analysis) and NLP technologies such as Python NLTK, or Spacy or comparable technologies
Knowledge & experience with microservices, service mesh, API development and test automation.
Demonstrated experience using Docker, Kubernetes, and/or other similar container frameworks.
Company
CBC
Strategy & Advisory | Dataeco System | Modelling | Insights & Outcomes | Adoption
H1B Sponsorship
CBC has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2023 (38)
2022 (74)
2021 (49)
2020 (14)
Funding
Current Stage
Growth StageRecent News
Company data provided by crunchbase