Wall Street Consulting Services LLC · 8 hours ago
Senior Generative AI Engineer
Wall Street Consulting Services LLC is looking for a GenAI / Small Language Model (SLM) Engineer to design, deploy, and maintain agentic AI solutions. The role involves end-to-end delivery of AI features, including data preparation, model fine-tuning, and ensuring compliance with security standards.
Responsibilities
Collect, clean, and preprocess domain-specific datasets for SLM training and fine-tuning
Ensure data quality, diversity, and compliance with privacy and security standards
Fine-tune small language models on curated datasets using techniques like LoRA, adapters, or parameter-efficient tuning
Optimize hyperparameters for performance, latency, and resource efficiency
Help design and implement agent orchestration (single and multi‑agent) and function/tool use strategies
Craft, version, and optimize prompts and system instructions for accuracy, coherence, and domain alignment
Integrate external tools/APIs and establish content‑safety guardrails (e.g., policy enforcement, PII redaction, jailbreak prevention)
Build resilient agent workflows and services; harden reliability with retries, fallbacks, circuit breakers
Develop automated tests for prompts, tools, and agent behaviors; maintain regression suites and golden datasets
Operate AI services in production: performance tuning, cost optimization, incident response, and iterative improvement
Design and manage data pipelines for fine‑tuning and retrieval (RAG), including cleansing, labeling, and governance
Monitor drift, quality, latency, and safety signals; implement model/agent observability and alerting
Run structured evaluations of agent outputs (functional, coherence, safety, bias); track precision/recall and hallucination rates
Perform risk assessments for agent behaviors and tool actions; document mitigations and approval workflows
Collaborate with security/compliance to meet regulatory, privacy, and usage‑policy requirements
Qualification
Required
4–8+ years in software/ML engineering, with 2+ years building LLM/SLM/GenAI solutions in production
Proficiency in Python (and/or TypeScript) and modern AI orchestration frameworks (e.g., Microsoft Agent Framework, Google Agent Development Kit, LangChain, Semantic Kernel)
Hands‑on with retrieval‑augmented generation (RAG), function calling, prompt optimization, and agent design patterns
Experience building data pipelines (batch/stream), and managing datasets for training/fine‑tuning and evaluation
Practical understanding of AI guardrails: content filtering, safety policies, redaction, rate limiting, and misuse prevention
Strong willingness to learn advanced agent orchestration and MLOps practices
Preferred
MLOps fluency: model packaging, CI/CD, experiment tracking (e.g., MLflow), deployment on cloud/container platforms
IaC (e.g., Terraform/Bicep) and DevOps tooling (e.g., GitHub Actions/Azure DevOps); strong grasp of observability
Experience with multi‑agent systems, toolformer patterns, and complex orchestration graphs
Knowledge of vector databases and retrieval systems; evaluation frameworks (e.g., Ragas, DeepEval) and custom metrics
Familiarity with privacy, compliance, and model risk management practices for AI
Background in tuning open‑source and hosted models; comfort with hybrid cloud environments
Company
Wall Street Consulting Services LLC
“Why wait for change? Drive it yourself!” At Wall Street the next generation IT solutions meet the next generation career opportunities.
H1B Sponsorship
Wall Street Consulting Services LLC has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (23)
2024 (28)
2023 (22)
2022 (11)
2021 (10)
2020 (5)
Funding
Current Stage
Growth StageCompany data provided by crunchbase