Bayer · 11 hours ago
Data Engineer
Bayer is a visionary company committed to solving the world’s toughest challenges in health and agriculture. They are seeking a highly motivated Data Engineer to join their Science team within Digital Farming Solutions, where the role involves developing reliable and scalable machine learning solutions and collaborating with data engineers and scientists.
BiotechnologyChemicalHealth CareLife SciencePharmaceutical
Responsibilities
Write robust, well-documented code
Engage with colleagues with diverse technical backgrounds and expertise to understand feature, model and business requirements, and develop solutions to meet automation and scaling needs
Evaluate existing data and model pipelines, and provide recommendations to support business and technical requirements
Utilize tools and/or infrastructure developed by technical delivery partners to ensure proper data architectures and computational infrastructure is available to support rapid development and testing on machine and deep learning models that create value for our farmers
Collaborate across the Science organization to create high-quality, reproducible tools (e.g. templates, packages and dashboards) that will accelerate delivery of customer value
Undertake written & verbal communication with stakeholders in various parts of the organization in the form of detailed documentation and presentations
Chunk, clean, and format text into LLM-ready formats such as JSONL
Contribute to community guidelines for prompt/data hygiene, provenance, and safe data handling for LLMs
Qualification
Required
Bachelor's degree in data science, computer science, or related quantitative field
A minimum of 3 years' experience is required for the following
Strong Python coding skills for development and visualizations, including experience with standard packages (numpy, pandas, matplotlib, seaborn)
Familiar with Extract, Transform, and Load (ETL) processes and simple pipeline creation
Proficient use of SQL or Spark to extract and manipulate data from databases, data warehouses, and data lakes
Experience utilizing Git for collaborative code development
Experience with cloud-based platforms (AWS, GCP, Azure) and basic services (ie. S3, Lambda, Dataproc)
Experience with oral presentation and technical writing
Practical experience with newline-delimited data formats
Familiarity with text encoding (UTF-8) and common cleaning issues
Familiarity with tokenization concepts and context-window tradeoffs
Experience with prompt templates and basic prompt telemetry for debugging
Preferred
Excitement to learn new topics, develop new skills and use new tools
Experience with web services, api and database design
Experience applying data quality concepts
Familiarity with NOSQL tools like Hadoop (hive, pig, presto) and PySpark
Experience with virtual machines and cloud computing
Interdisciplinary communication and collaboration skills
Experience with application of statistics to environmental or agricultural problems
Expertise in data interoperability and analytics
Familiarity with NLP libraries and tokenizers
Familiarity with embeddings, vector DBs, and data-labeling workflows
Benefits
Health care
Vision
Dental
Retirement
PTO
Sick leave
Company
Bayer
Bayer is a life science company that specializes in the areas of health care and agriculture.
H1B Sponsorship
Bayer has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (62)
2024 (71)
2023 (76)
2022 (141)
2021 (138)
2020 (117)
Funding
Current Stage
Public CompanyTotal Funding
$9.34BKey Investors
Bank of AmericaBill & Melinda Gates FoundationTemasek Holdings
2025-09-26Post Ipo Debt· $331.5M
2024-12-06Post Ipo Debt· $5.29B
2022-11-08Grant· $12M
Leadership Team
Recent News
Morningstar.com
2026-01-20
Company data provided by crunchbase