SIGN IN
Software Developer/Engineer (Mid Level experience) jobs in United States
info-icon
This job has closed.
company-logo

Jobs via Dice · 8 hours ago

Software Developer/Engineer (Mid Level experience)

Dice is the leading career destination for tech experts at every stage of their careers, and they are seeking a Software Developer/Engineer for Tri-Force Consulting Services Inc. The role involves deploying open-source LLMs and vector databases, ensuring data privacy and security, and delivering prototypes and documentation.
Computer Software

Responsibilities

Hands-on experience deploying open-source LLMs such as Meta Llama 3 and Mistral / Mixtral in on-prem or private environments
Strong proficiency in Python for LLM inference, prompt engineering, and integration
Experience with CPU-based inference, model quantization, and performance tuning
Practical experience with open-source vector databases such as Qdrant, Chroma, Milvus, or pgvector
Proven implementation of Retrieval-Augmented Generation (RAG) pipelines
Experience generating and managing embeddings and metadata filtering
Understanding of data privacy, air-gapped deployments, and enterprise security requirements
Experience implementing access controls and audit logging
Deliverables: Reference architecture and deployment guidance
Working prototype (LLM + vector DB + RAG)
Documentation and knowledge transfer to internal teams

Qualification

Open-source LLMsPythonVector DatabasesRetrieval-Augmented GenerationData privacyAccess controlsDockerKubernetesRustGoC++Inference frameworks

Required

Hands-on experience deploying open-source LLMs such as Meta Llama 3 and Mistral / Mixtral in on-prem or private environments
Strong proficiency in Python for LLM inference, prompt engineering, and integration
Experience with CPU-based inference, model quantization, and performance tuning
Practical experience with open-source vector databases such as Qdrant, Chroma, Milvus, or pgvector
Proven implementation of Retrieval-Augmented Generation (RAG) pipelines
Experience generating and managing embeddings and metadata filtering
Understanding of data privacy, air-gapped deployments, and enterprise security requirements
Experience implementing access controls and audit logging

Preferred

Experience with LangChain or LlamaIndex
Exposure to Rust, Go, or C++ for high-performance services
Familiarity with Docker and Kubernetes for on-prem deployments
Knowledge of inference frameworks (e.g., vLLM, llama.cpp, Hugging Face Transformers)
Prior work in regulated or enterprise environments

Company

Jobs via Dice

twitter
company-logo
Welcome to Jobs via Dice, the go-to destination for discovering the tech jobs you want.

Funding

Current Stage
Early Stage
Company data provided by crunchbase