SIGN IN
Software Developer\/Engineer (Mid Level experience) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Tri-Force Consulting Services Inc. | IT Recruitment & Staffing Agency · 11 hours ago

Software Developer\/Engineer (Mid Level experience)

Tri-Force Consulting Services Inc. is an IT recruitment and staffing agency specializing in software development and IT solutions. They are seeking a Mid Level Software Developer/Engineer to implement on-prem LLM and Vector DB solutions for their client, Philadelphia Gas Works, focusing on deployment and integration of advanced technologies.
ConsultingEnterprise SoftwareInformation TechnologySoftware
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Consultant Requirements – On-Prem LLM & Vector DB Implementation
Hands-on experience deploying open-source LLMs such as Meta Llama 3 and Mistral / Mixtral in on-prem or private environments
Strong proficiency in Python for LLM inference, prompt engineering, and integration
Experience with CPU-based inference, model quantization, and performance tuning
Practical experience with open-source vector databases such as Qdrant, Chroma, Milvus, or pgvector
Proven implementation of Retrieval-Augmented Generation (RAG) pipelines
Experience generating and managing embeddings and metadata filtering
Understanding of data privacy, air-gapped deployments, and enterprise security requirements
Experience implementing access controls and audit logging
Deliverables: Reference architecture and deployment guidance, Working prototype (LLM + vector DB + RAG), Documentation and knowledge transfer to internal teams

Qualification

Open-source LLM deploymentPython proficiencyVector databases experienceRAG pipeline implementationData privacy understandingAccess controls implementationDocker familiarityKubernetes familiarityRust exposureGo exposureC++ exposureInference frameworks knowledgeEnterprise environment experience

Required

Hands-on experience deploying open-source LLMs such as Meta Llama 3 and Mistral / Mixtral in on-prem or private environments
Strong proficiency in Python for LLM inference, prompt engineering, and integration
Experience with CPU-based inference, model quantization, and performance tuning
Practical experience with open-source vector databases such as Qdrant, Chroma, Milvus, or pgvector
Proven implementation of Retrieval-Augmented Generation (RAG) pipelines
Experience generating and managing embeddings and metadata filtering
Understanding of data privacy, air-gapped deployments, and enterprise security requirements
Experience implementing access controls and audit logging

Preferred

Experience with LangChain or LlamaIndex
Exposure to Rust, Go, or C++ for high-performance services
Familiarity with Docker and Kubernetes for on-prem deployments
Knowledge of inference frameworks (e.g., vLLM, llama.cpp, Hugging Face Transformers)
Prior work in regulated or enterprise environments

Company

Tri-Force Consulting Services Inc. | IT Recruitment & Staffing Agency

twittertwittertwitter
company-logo
Tri-Force Consulting Services, Inc.

H1B Sponsorship

Tri-Force Consulting Services Inc. | IT Recruitment & Staffing Agency has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2024 (1)

Funding

Current Stage
Growth Stage

Leadership Team

M
Manish Gorawala
President and CEO
linkedin
Company data provided by crunchbase