SIGN IN
Developer/Engineer (Mid Level experience) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Tech Army, LLC · 7 hours ago

Developer/Engineer (Mid Level experience)

Tech Army is seeking a Mid Level Developer/Engineer to work in a hybrid mode in Philadelphia. The role involves hands-on experience with deploying open-source LLMs and vector databases, as well as implementing security measures for data privacy and governance.
ConsultingIT ManagementSoftwareStaffing Agency
check
H1B Sponsor Likelynote

Responsibilities

Hands-on experience deploying open-source LLMs such as Meta Llama 3 and Mistral / Mixtral in on-prem or private environments
Strong proficiency in Python for LLM inference, prompt engineering, and integration
Experience with CPU-based inference, model quantization, and performance tuning
Practical experience with open-source vector databases such as Qdrant, Chroma, Milvus, or pgvector
Proven implementation of Retrieval-Augmented Generation (RAG) pipelines
Experience generating and managing embeddings and metadata filtering
Understanding of data privacy, air-gapped deployments, and enterprise security requirements
Experience implementing access controls and audit logging
Reference architecture and deployment guidance
Working prototype (LLM + vector DB + RAG)
Documentation and knowledge transfer to internal teams

Qualification

Open-source LLM deploymentPython proficiencyVector databases experienceRetrieval-Augmented GenerationData privacy understandingModel quantizationPerformance tuningAccess controls implementationAudit loggingLangChain experienceRust exposureGo exposureC++ exposureDocker familiarityKubernetes familiarityInference frameworks knowledgeRegulated environments experience

Required

Hands-on experience deploying open-source LLMs such as Meta Llama 3 and Mistral / Mixtral in on-prem or private environments
Strong proficiency in Python for LLM inference, prompt engineering, and integration
Experience with CPU-based inference, model quantization, and performance tuning
Practical experience with open-source vector databases such as Qdrant, Chroma, Milvus, or pgvector
Proven implementation of Retrieval-Augmented Generation (RAG) pipelines
Experience generating and managing embeddings and metadata filtering
Understanding of data privacy, air-gapped deployments, and enterprise security requirements
Experience implementing access controls and audit logging

Preferred

Experience with LangChain or LlamaIndex
Exposure to Rust, Go, or C++ for high-performance services
Familiarity with Docker and Kubernetes for on-prem deployments
Knowledge of inference frameworks (e.g., vLLM, llama.cpp, Hugging Face Transformers)
Prior work in regulated or enterprise environments

Company

Tech Army, LLC

twittertwitter
company-logo
Tech Army, LLC is an 8(a) and DBE certified industry leader with over 30 years of success in providing IT consulting and end-to-end IT staff augmentation services.

H1B Sponsorship

Tech Army, LLC has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2)
2024 (1)
2022 (2)
2021 (1)

Funding

Current Stage
Growth Stage

Leadership Team

leader-logo
Jay Narang
Chief Executive Officer
linkedin
Company data provided by crunchbase