SIGN IN
Senior Full Stack AI Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Smart Folks Inc · 13 hours ago

Senior Full Stack AI Engineer

Smart Folks Inc is seeking a highly skilled and motivated AI Engineer with a strong focus on Generative AI and Natural Language Processing (NLP). The ideal candidate will be responsible for designing, developing, and deploying AI use cases that involve searching, summarizing, and creating themes from extensive document repositories.
Information Technology & Services
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Apply depth of knowledge and expertise to all aspects of the software development lifecycle, as well as partner continuously with stakeholders on a regular basis
Develop and engineer solutions within an Agile software delivery team, working to collaboratively deliver sprint goals, write code, and participate in the broader Citi technical community and team-level Agile and Scrum processes
Contribute to the design, documentation, and development of world-class enterprise applications leveraging the latest technologies and software design patterns
Leverage technical knowledge of concepts and procedures within own area and basic knowledge of other areas to resolve issues, as necessary
Design and Development:  Lead the design and implementation of end-to-end AI/ML pipelines for document understanding, summarization, and theme extraction. This includes data preprocessing, feature engineering, model training, evaluation, and deployment
Generative AI Application:  Develop and optimize LLM-based solutions for text summarization, content generation, and knowledge extraction from structured/unstructured data
RAG System Implementation:  Build and maintain robust Retrieval-Augmented Generation (RAG) pipelines, leveraging vector databases and advanced indexing strategies to ensure accurate and contextually relevant information retrieval
Model Tuning and Optimization:  Apply advanced GenAI tuning techniques such as QLORA, LORA, and PEFT to fine-tune pre-trained LLMs for specific use cases, optimizing for performance, efficiency, and accuracy
Vector Search and Embeddings:  Implement and optimize vector search capabilities and embedding pipelines to enhance the efficiency and relevance of document searches and information retrieval
Prompt Engineering:  Develop and refine prompts to maximize the performance and accuracy of language models
Collaboration:  Work closely with cross-functional teams, including product managers, AI model teams and other engineers, to understand business requirements and translate them into scalable AI/ML solutions
Deployment and MLOps:  Deploy and monitor AI models in production environments, ensuring scalability, reliability, and maintainability. Contribute to MLOps practices for model versioning, continuous deployment, and monitoring
Research and Innovation:  Stay abreast of the latest advancements in Generative AI, NLP, and machine learning, and actively identify opportunities to integrate new techniques and tools into our products and services
AI-Driven Development : Leverage AI tools, such as GitHub Copilot, to enhance development efficiency, accelerate delivery timelines, and optimize software solutions
Problem Solving and Troubleshooting : Possess the expertise to analyze and effectively troubleshoot complex coding, application performance, and design challenges
Root Cause Analysis : Capable of conducting thorough research to identify the root causes of development and performance issues, as well as devising and implementing effective defect resolutions
Technical Acumen : Demonstrate a profound understanding of the technical requirements pertinent to the solutions under development
Containerization and Orchestration : Utilize Docker for application containerization and Kubernetes for efficient service orchestration
Communication and Risk Management : Effectively communicate progress, proactively anticipate bottlenecks, provide skilled escalation management, and adeptly identify, assess, track, and mitigate issues and risks across various levels
Process Optimization : Streamline, automate, or eliminate redundant processes within architecture, build, delivery, production operations, or business areas where similar efforts or issues recur annually

Qualification

Generative AINatural Language ProcessingLarge Language ModelsRetrieval-Augmented GenerationPythonMicroservicesAI toolsVector databasesNLP techniquesCI/CDAgile methodologiesProcess OptimizationTechnical AcumenProblem SolvingCommunicationCollaboration

Required

Minimum of 8 years of proven software development experience
In-depth knowledge of modern application architecture principles
Clear understanding of Data Structures and Object Oriented Principles
Practical experience with Artificial Intelligence (AI) tools for enhancing development workflows
Proficiency in Microservices frameworks Event-Driven Services, and Cloud-Native Application Development
Multiple years of experience on Service Oriented and Microservices architectures, including REST and GraphQL implementations
Expert-level proficiency in Python and relevant libraries (e.g., FastAPI, Pydantic, PyTorch, HuggingFace Ecosystem)
Proven experience in building and deploying applications using Large Language Models
Hands-on experience with vector databases and developing embedding pipelines
Strong understanding and practical experience with Retrieval-Augmented Generation (RAG) frameworks (e.g., LangChain, LlamaIndex)
Experience with generative AI tuning techniques such as QLORA, LORA, and PEFT
Practical experience with Agentic Workflows, and Model Context Protocol (MCP) for enhancing development workflows
Strong hands-on experience with NLP techniques such as text classification, summarization, and topic modeling
Demonstrated ability to design, develop, and maintain both front-end and back-end components of robust web applications
Strong expertise in developing intuitive user interfaces using contemporary JavaScript frameworks (e.g., React), HTML5, and CSS
Solid experience in developing server-side logic and APIs using languages Python, Java, or similar
Comprehensive knowledge of SQL and PL/SQL, with a deep understanding of Relational Database Management Systems (RDBMS), particularly Oracle
Proven capability in designing, developing, and implementing high-performance RESTful APIs leveraging appropriate frameworks and technologies
Proficiency with Continuous Integration/Continuous Deployment (CI/CD) pipelines and tools for building (e.g., Maven, Gradle) and deploying code (e.g., Docker, Jenkins, OpenShift)
Experience with AWS is considered a significant advantage
Practical experience working within Agile development methodologies and utilizing project management tools such as JIRA
Ability to develop and automate comprehensive unit, integration, and end-to-end tests to ensure code quality
Solid understanding and practical experience with code versioning tools, including GitHub Enterprise
Bachelor's or Master's degree in Computer Science, Artificial Intelligence, Machine Learning, Data Science, or a related field

Company

Smart Folks Inc

twitter
company-logo
Smart Folk’s Inc established in early 2011 is a strategic consulting, technical staffing services company headquartered in McKinney Texas.

H1B Sponsorship

Smart Folks Inc has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (9)
2024 (13)
2023 (7)
2022 (4)
2021 (21)
2020 (17)

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Lalitha Sneha Nandyala
Founder and President
linkedin
Company data provided by crunchbase