Tri-Force Consulting Services Inc. | IT Recruitment & Staffing Agency · 17 hours ago
Software Developer\/Engineer (Mid Level experience)
Tri-Force Consulting Services Inc. is an IT recruitment and staffing agency specializing in software development and IT solutions. They are seeking a Mid Level Software Developer/Engineer to implement on-prem LLM and Vector DB solutions for their client, Philadelphia Gas Works, focusing on deployment and integration of advanced technologies.
ConsultingEnterprise SoftwareInformation TechnologySoftware
Responsibilities
Consultant Requirements – On-Prem LLM & Vector DB Implementation
Hands-on experience deploying open-source LLMs such as Meta Llama 3 and Mistral / Mixtral in on-prem or private environments
Strong proficiency in Python for LLM inference, prompt engineering, and integration
Experience with CPU-based inference, model quantization, and performance tuning
Practical experience with open-source vector databases such as Qdrant, Chroma, Milvus, or pgvector
Proven implementation of Retrieval-Augmented Generation (RAG) pipelines
Experience generating and managing embeddings and metadata filtering
Understanding of data privacy, air-gapped deployments, and enterprise security requirements
Experience implementing access controls and audit logging
Deliverables: Reference architecture and deployment guidance, Working prototype (LLM + vector DB + RAG), Documentation and knowledge transfer to internal teams
Qualification
Required
Hands-on experience deploying open-source LLMs such as Meta Llama 3 and Mistral / Mixtral in on-prem or private environments
Strong proficiency in Python for LLM inference, prompt engineering, and integration
Experience with CPU-based inference, model quantization, and performance tuning
Practical experience with open-source vector databases such as Qdrant, Chroma, Milvus, or pgvector
Proven implementation of Retrieval-Augmented Generation (RAG) pipelines
Experience generating and managing embeddings and metadata filtering
Understanding of data privacy, air-gapped deployments, and enterprise security requirements
Experience implementing access controls and audit logging
Preferred
Experience with LangChain or LlamaIndex
Exposure to Rust, Go, or C++ for high-performance services
Familiarity with Docker and Kubernetes for on-prem deployments
Knowledge of inference frameworks (e.g., vLLM, llama.cpp, Hugging Face Transformers)
Prior work in regulated or enterprise environments
Company
Tri-Force Consulting Services Inc. | IT Recruitment & Staffing Agency
Tri-Force Consulting Services, Inc.
H1B Sponsorship
Tri-Force Consulting Services Inc. | IT Recruitment & Staffing Agency has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2024 (1)
Funding
Current Stage
Growth StageCompany data provided by crunchbase