Tech Army, LLC · 9 hours ago
Developer/Engineer (Mid Level experience)
Tech Army is seeking a Mid Level Developer/Engineer to work in a hybrid mode in Philadelphia. The role involves hands-on experience with deploying open-source LLMs and vector databases, as well as implementing security measures for data privacy and governance.
ConsultingIT ManagementSoftwareStaffing Agency
Responsibilities
Hands-on experience deploying open-source LLMs such as Meta Llama 3 and Mistral / Mixtral in on-prem or private environments
Strong proficiency in Python for LLM inference, prompt engineering, and integration
Experience with CPU-based inference, model quantization, and performance tuning
Practical experience with open-source vector databases such as Qdrant, Chroma, Milvus, or pgvector
Proven implementation of Retrieval-Augmented Generation (RAG) pipelines
Experience generating and managing embeddings and metadata filtering
Understanding of data privacy, air-gapped deployments, and enterprise security requirements
Experience implementing access controls and audit logging
Reference architecture and deployment guidance
Working prototype (LLM + vector DB + RAG)
Documentation and knowledge transfer to internal teams
Qualification
Required
Hands-on experience deploying open-source LLMs such as Meta Llama 3 and Mistral / Mixtral in on-prem or private environments
Strong proficiency in Python for LLM inference, prompt engineering, and integration
Experience with CPU-based inference, model quantization, and performance tuning
Practical experience with open-source vector databases such as Qdrant, Chroma, Milvus, or pgvector
Proven implementation of Retrieval-Augmented Generation (RAG) pipelines
Experience generating and managing embeddings and metadata filtering
Understanding of data privacy, air-gapped deployments, and enterprise security requirements
Experience implementing access controls and audit logging
Preferred
Experience with LangChain or LlamaIndex
Exposure to Rust, Go, or C++ for high-performance services
Familiarity with Docker and Kubernetes for on-prem deployments
Knowledge of inference frameworks (e.g., vLLM, llama.cpp, Hugging Face Transformers)
Prior work in regulated or enterprise environments
Company
Tech Army, LLC
Tech Army, LLC is an 8(a) and DBE certified industry leader with over 30 years of success in providing IT consulting and end-to-end IT staff augmentation services.
H1B Sponsorship
Tech Army, LLC has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2)
2024 (1)
2022 (2)
2021 (1)
Funding
Current Stage
Growth StageCompany data provided by crunchbase