Apply on Employer Site

Feuji · 1 day ago

Virtualization Engineer

Los Altos, CA

Contract

Onsite

Mid Level

Feuji is seeking a contractor to help build and evolve their internal data platform that supports vehicle testing, experimentation, and machine learning workflows. The role involves implementing data ingestion pipelines, automated processing workflows, and web-based visualization tools.

ConsultingInformation Technology

No H1B

Hiring Manager

Rehan Qureshi

Responsibilities

Implement and extend data ingestion and processing workflows for large, heterogeneous datasets collected from vehicle tests and ML pipelines

Contribute to improving orchestration, scheduling, and reliability of long-running data workflows operating under real-world constraints

Integrate downstream automation such as metric computation, plotting, and LLM-based postprocess tooling

Implement backend services and APIs that support data indexing, metadata management, and experiment tracking

Build user-facing web-based tools and dashboard that allow users to browse datasets, inspect results, and understand experimental progress over time

Work with a SQL-backed database to store metrics, experiment metadata, and summaries, ensuring the data can be queried and accessed consistently across systems

Contribute to data traceability and provenance mechanisms that capture how datasets are generated, transformed, and consumed in ML workflows

Own and extend an existing data ingestion system responsible for uploading vehicle test data to Amazon S3

Improve ingestion orchestration to support: Upload prioritization for small datasets, Deferred upload scheduling for large datasets during off-hours, Automatic discarding of data explicitly marked as trash, Persistent queueing and resumability across server restarts or failures

Maintain ingestion reliability under constrained network bandwidth

Extend the current web interface for clarity, reliability and extendability

Integrate ingestion workflows with post-processor, such as: Existing LLM-based automatic annotation module, Automating plot generation, Metric computation pipelines

Package and deploy the annotation system as a service (e.g., EC2-based)

Implement orchestration logic to trigger annotation jobs opportunistically when ingestion resources are idle

Store metrics, experiment metadata, plots and summaries in SQL-backed database layer

Implement and extend a SQL-backed metrics database using schemas defined by the team

Define schemas to support: Multiple projects, Baselines vs experimental runs, Historical comparisons

Build automated pipelines to compute and register metrics after ingestion

Implement project-level leaderboard functionality to track: Best performance per metric, Accepted baselines vs rejected experiments

Develop a web-based visualization interface to: Display time-series progress, Visualize metric tradeoffs, Summarize experimental outcomes

Design and implement a data provenance system for ML datasets

Track: Source S3 URIs, Post-processing operations applied to datasets

Implement a registry of post-processing functions with support for: Easy addition and removal, Versioning and configuration tracking

Generate human-readable dataset identifiers

Enable lookup and inspection of dataset lineage via API and/or web interface

Qualification

PythonSQLAWSData visualizationWeb-based toolsLinuxMachine LearningSoft skills

Required

Experience with Python for backend services, data pipelines, and automation

Working knowledge of SQL, including writing queries and understanding database schemas

Experience building web-based tools, including: Backend APIs (e.g., FastAPI, Flask, or similar), Frontend applications using React or other modern frameworks

Familiarity with AWS and cloud-based storage or services

Comfortable working in Linux environments

Preferred

Interest in autonomous racing and vehicle dynamics research

Prior internship or project experience involving data pipelines, dashboards, or analytics tools

Exposure to data visualization libraries, ML workflows, or experiment tracking systems

Company

Feuji

Feuji is a technology company delivering Digital Transformation, Cloud, Data and Insights, Cyber Security, and Strategic Staffing solutions.

Founded in 2014

Irving, Texas, USA

501-1000 employees

https://www.feuji.com/

Funding

Current Stage

Late Stage

Leadership Team

Amar Naga

President & Chief Growth Officer

Recent News

PRLog

Feuji Appoints Business & Technology Leader Amar Naga as President and Chief Growth Officer

2023-10-26

Company data provided by crunchbase