Data Scientist jobs in United States
cer-icon
Apply on Employer Site
company-logo

Pyramid Consulting, Inc · 4 hours ago

Data Scientist

Pyramid Consulting, Inc is a leading Banking and Financial Industry, seeking a talented Data Scientist for a contract opportunity. The role involves architecting and implementing scalable AI solutions, optimizing lightweight LLMs, and collaborating with cross-functional teams to build full-stack Gen AI experiences.

ConsultingInformation TechnologyLegalProfessional ServicesSoftwareStaffing Agency
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Architect and implement scalable AI Agents, Agentic Workflows and GenAI applications to address diverse and complex business use cases
Develop, fine-tune, and optimize lightweight LLMs; lead the evaluation and adaptation of models such as Claude (Anthropic), Azure OpenAI, and open-source alternatives
Design and deploy Retrieval-Augmented Generation (RAG) and Graph RAG systems using vector databases and knowledge bases
Curate enterprise data using connectors integrated with AWS Bedrock's Knowledge Base/Elastic
Implement solutions leveraging MCP (Model Context Protocol) and A2A (Agent-to-Agent) communication
Build and maintain Jupyter-based notebooks using platforms like SageMaker and MLFlow/Kubeflow on Kubernetes (EKS)
Collaborate with cross-functional teams of UI and microservice engineers, designers, and data engineers to build full-stack Gen AI experiences
Integrate GenAI solutions with enterprise platforms via API-based methods and GenAI standardized patterns
Establish and enforce validation procedures with Evaluation Frameworks, bias mitigation, safety protocols, and guardrails for production-ready deployment
Design & build robust ingestion pipelines that extract, chunk, enrich, and anonymize data from PDFs, video, and audio sources for use in LLM-powered workflows—leveraging best practices like semantic chunking and privacy controls
Orchestrate multimodal pipelines using scalable frameworks (e.g., Apache Spark, PySpark) for automated ETL/ELT workflows appropriate for unstructured media
Implement embeddings drives—map media content to vector representations using embedding models, and integrate with vector stores (AWS KnowledgeBase/Elastic/Mongo Atlas) to support RAG architectures

Qualification

Generative AIPythonRAGLLM-based solutionsAWS SageMakerVector databasesPrompt engineeringAI governanceCI/CD practicesCross-functional collaboration

Required

Generative AI, RAG, Python
Architect and implement scalable AI Agents, Agentic Workflows and GenAI applications to address diverse and complex business use cases
Develop, fine-tune, and optimize lightweight LLMs; lead the evaluation and adaptation of models such as Claude (Anthropic), Azure OpenAI, and open-source alternatives
Design and deploy Retrieval-Augmented Generation (RAG) and Graph RAG systems using vector databases and knowledge bases
Curate enterprise data using connectors integrated with AWS Bedrock's Knowledge Base/Elastic
Implement solutions leveraging MCP (Model Context Protocol) and A2A (Agent-to-Agent) communication
Build and maintain Jupyter-based notebooks using platforms like SageMaker and MLFlow/Kubeflow on Kubernetes (EKS)
Collaborate with cross-functional teams of UI and microservice engineers, designers, and data engineers to build full-stack Gen AI experiences
Integrate GenAI solutions with enterprise platforms via API-based methods and GenAI standardized patterns
Establish and enforce validation procedures with Evaluation Frameworks, bias mitigation, safety protocols, and guardrails for production-ready deployment
Design & build robust ingestion pipelines that extract, chunk, enrich, and anonymize data from PDFs, video, and audio sources for use in LLM-powered workflows—leveraging best practices like semantic chunking and privacy controls
Orchestrate multimodal pipelines using scalable frameworks (e.g., Apache Spark, PySpark) for automated ETL/ELT workflows appropriate for unstructured media
Implement embeddings drives—map media content to vector representations using embedding models, and integrate with vector stores (AWS KnowledgeBase/Elastic/Mongo Atlas) to support RAG architectures
BA or MS in AI/Data Science
Experience in AI/ML, with 3+ years in applied GenAI or LLM-based solutions
Deep expertise in prompt engineering, fine-tuning, RAG, GraphRAG, vector databases (e.g., AWS KnowledgeBase / Elastic), and multi-modal models
Proven experience with cloud-native AI development (AWS SageMaker, Bedrock, MLFlow on EKS)
Strong programming skills in Python and ML libraries (Transformers, LangChain, etc.)
Deep understanding of Gen AI system patterns and architectural best practices, Evaluation Frameworks
Demonstrated ability to work in cross-functional agile teams
Need Github Code Repository Link for each candidate. Please thoroughly vet the candidates
Published contributions or patents in AI/ML/LLM domains
Hands-on experience with enterprise AI governance and ethical deployment frameworks
Familiarity with CI/CD practices for ML Ops and scalable inference APIs

Benefits

Health insurance (medical, dental, vision)
401(k) plan
Paid sick leave (depending on work location)

Company

Pyramid Consulting, Inc

company-logo
Pyramid Consulting, a global leader in workforce and technology solutions, empowers individuals and organizations to transform and thrive in the most challenging and competitive markets.

H1B Sponsorship

Pyramid Consulting, Inc has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (178)
2024 (112)
2023 (95)
2022 (62)
2021 (50)
2020 (117)

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Ramesh Maturu
President and Co-Founder
linkedin
leader-logo
Manish Kaushik
Chief Financial Officer
linkedin
Company data provided by crunchbase