Technical Architect (Ai) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Saransh Inc ยท 3 months ago

Technical Architect (Ai)

Saransh Inc is seeking a highly skilled Technical Architect (AI) to join their team. This role focuses on leveraging expertise in Generative AI and machine learning to design and implement innovative AI-driven solutions that enhance automation and efficiency for clients.

EmploymentInformation TechnologyStaffing Agency
check
H1B Sponsor Likelynote

Responsibilities

Lead the design and implementation of end-to-end AI solutions ensuring scalability, robustness, and efficiency aligned with business needs
Architect RAG pipelines using frameworks like LangChain, LlamaIndex, or custom-built stacks
Design Agentic AI architectures, including task-based agents, stateful memory, planning-execution workflows, and tool augmentation
Define and execute data strategies for collection, cleaning, transformation, and integration
Fine-tuning & Prompt Engineering: Fine-tuning pre-trained models (e.g., GPT, BERT, etc.) and optimize prompt engineering techniques to drive high-quality, actionable outputs for diverse business use cases
Perform embeddings generation, evaluation of outputs, and incorporate human/automated feedback loops
Apply advanced NLP techniques such as tokenization, prompt engineering, and query optimization
Build, train, and deploy machine learning models, including deep learning models, for complex AI applications across various domains
Build and enforce guardrails for model safety and compliance, including prompt validation, output moderation, and access controls
Ensure solutions meet data governance, compliance, and security standards
Collaborate with teams to deploy solutions in AWS cloud-native environments (Bedrock, Lambda, ECS, SageMaker, CDK)
Oversee CI/CD pipelines, API integrations, and scalable production deployments
Lead LLM provisioning from AWS, balancing performance and cost-effectiveness
Oversee the deployment of AI models, ensuring smooth integration with production systems, and perform rigorous evaluation of LLMs for accuracy, efficiency, and scalability
Contribute to system observability
Support post-deployment monitoring, optimization, and retraining cycles for LLM-driven systems

Qualification

Generative AILLMsNLPMachine LearningArchitectural DesignRAG PipelinesPythonAWSCI/CDLeadership CapabilitiesSolutioningCommunication

Required

Proven production experience with RAG pipelines (LangChain, LlamaIndex, or custom stacks)
Strong understanding of Agentic AI patterns: task agents, memory/state tracking, orchestration
Expertise in LLM fine-tuning, embeddings, evaluation strategies, and feedback integration
Hands-on experience with AI guardrails (moderation, filtering, prompt validation)
Proficiency in Python, vector DBs, and LLM APIs
Familiarity with CI/CD, API integration, and cloud-native deployments
Strong database management skills (SQL & NoSQL)
Excellent communication, solutioning, and leadership capabilities

Preferred

Experience with agent orchestration frameworks
Knowledge of machine learning and deep learning models beyond NLP
Exposure to data strategy at enterprise scale, including cost-optimized LLM provisioning
Hands-on observability tools for monitoring AI systems

Company

Saransh Inc

twittertwittertwitter
company-logo
We provide recruitment, consulting and IT services for our clients, which focus on maximizing their revenue generation, enhancing business productivity and improving cost management.

H1B Sponsorship

Saransh Inc has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (35)
2024 (51)
2023 (45)
2022 (30)
2021 (22)
2020 (39)

Funding

Current Stage
Growth Stage

Leadership Team

leader-logo
Sridhar Chimaladinne
FOUNDER & CEO
linkedin
Company data provided by crunchbase