Apply on Employer Site

Smart Folks Inc · 13 hours ago

Senior Full Stack AI Engineer

Tampa, FL

Full-time

Onsite

Senior Level, Lead/Staff

8+ years exp

Smart Folks Inc is seeking a highly skilled and motivated AI Engineer with a strong focus on Generative AI and Natural Language Processing (NLP). The ideal candidate will be responsible for designing, developing, and deploying AI use cases that involve searching, summarizing, and creating themes from extensive document repositories.

Information Technology & Services

Growth Opportunities

H1B Sponsor Likely

Responsibilities

Apply depth of knowledge and expertise to all aspects of the software development lifecycle, as well as partner continuously with stakeholders on a regular basis

Develop and engineer solutions within an Agile software delivery team, working to collaboratively deliver sprint goals, write code, and participate in the broader Citi technical community and team-level Agile and Scrum processes

Contribute to the design, documentation, and development of world-class enterprise applications leveraging the latest technologies and software design patterns

Leverage technical knowledge of concepts and procedures within own area and basic knowledge of other areas to resolve issues, as necessary

Design and Development: Lead the design and implementation of end-to-end AI/ML pipelines for document understanding, summarization, and theme extraction. This includes data preprocessing, feature engineering, model training, evaluation, and deployment

Generative AI Application: Develop and optimize LLM-based solutions for text summarization, content generation, and knowledge extraction from structured/unstructured data

RAG System Implementation: Build and maintain robust Retrieval-Augmented Generation (RAG) pipelines, leveraging vector databases and advanced indexing strategies to ensure accurate and contextually relevant information retrieval

Model Tuning and Optimization: Apply advanced GenAI tuning techniques such as QLORA, LORA, and PEFT to fine-tune pre-trained LLMs for specific use cases, optimizing for performance, efficiency, and accuracy

Vector Search and Embeddings: Implement and optimize vector search capabilities and embedding pipelines to enhance the efficiency and relevance of document searches and information retrieval

Prompt Engineering: Develop and refine prompts to maximize the performance and accuracy of language models

Collaboration: Work closely with cross-functional teams, including product managers, AI model teams and other engineers, to understand business requirements and translate them into scalable AI/ML solutions

Deployment and MLOps: Deploy and monitor AI models in production environments, ensuring scalability, reliability, and maintainability. Contribute to MLOps practices for model versioning, continuous deployment, and monitoring

Research and Innovation: Stay abreast of the latest advancements in Generative AI, NLP, and machine learning, and actively identify opportunities to integrate new techniques and tools into our products and services

AI-Driven Development : Leverage AI tools, such as GitHub Copilot, to enhance development efficiency, accelerate delivery timelines, and optimize software solutions

Problem Solving and Troubleshooting : Possess the expertise to analyze and effectively troubleshoot complex coding, application performance, and design challenges

Root Cause Analysis : Capable of conducting thorough research to identify the root causes of development and performance issues, as well as devising and implementing effective defect resolutions

Technical Acumen : Demonstrate a profound understanding of the technical requirements pertinent to the solutions under development

Containerization and Orchestration : Utilize Docker for application containerization and Kubernetes for efficient service orchestration

Communication and Risk Management : Effectively communicate progress, proactively anticipate bottlenecks, provide skilled escalation management, and adeptly identify, assess, track, and mitigate issues and risks across various levels

Process Optimization : Streamline, automate, or eliminate redundant processes within architecture, build, delivery, production operations, or business areas where similar efforts or issues recur annually

Qualification

Generative AINatural Language ProcessingLarge Language ModelsRetrieval-Augmented GenerationPythonMicroservicesAI toolsVector databasesNLP techniquesCI/CDAgile methodologiesProcess OptimizationTechnical AcumenProblem SolvingCommunicationCollaboration

Required

Minimum of 8 years of proven software development experience

In-depth knowledge of modern application architecture principles

Clear understanding of Data Structures and Object Oriented Principles

Practical experience with Artificial Intelligence (AI) tools for enhancing development workflows

Proficiency in Microservices frameworks Event-Driven Services, and Cloud-Native Application Development

Multiple years of experience on Service Oriented and Microservices architectures, including REST and GraphQL implementations

Expert-level proficiency in Python and relevant libraries (e.g., FastAPI, Pydantic, PyTorch, HuggingFace Ecosystem)

Proven experience in building and deploying applications using Large Language Models

Hands-on experience with vector databases and developing embedding pipelines

Strong understanding and practical experience with Retrieval-Augmented Generation (RAG) frameworks (e.g., LangChain, LlamaIndex)

Experience with generative AI tuning techniques such as QLORA, LORA, and PEFT

Practical experience with Agentic Workflows, and Model Context Protocol (MCP) for enhancing development workflows

Strong hands-on experience with NLP techniques such as text classification, summarization, and topic modeling

Demonstrated ability to design, develop, and maintain both front-end and back-end components of robust web applications

Strong expertise in developing intuitive user interfaces using contemporary JavaScript frameworks (e.g., React), HTML5, and CSS

Solid experience in developing server-side logic and APIs using languages Python, Java, or similar

Comprehensive knowledge of SQL and PL/SQL, with a deep understanding of Relational Database Management Systems (RDBMS), particularly Oracle

Proven capability in designing, developing, and implementing high-performance RESTful APIs leveraging appropriate frameworks and technologies

Proficiency with Continuous Integration/Continuous Deployment (CI/CD) pipelines and tools for building (e.g., Maven, Gradle) and deploying code (e.g., Docker, Jenkins, OpenShift)

Experience with AWS is considered a significant advantage

Practical experience working within Agile development methodologies and utilizing project management tools such as JIRA

Ability to develop and automate comprehensive unit, integration, and end-to-end tests to ensure code quality

Solid understanding and practical experience with code versioning tools, including GitHub Enterprise

Bachelor's or Master's degree in Computer Science, Artificial Intelligence, Machine Learning, Data Science, or a related field

Company

Smart Folks Inc

Smart Folk’s Inc established in early 2011 is a strategic consulting, technical staffing services company headquartered in McKinney Texas.

Founded in 2011

McKinney, Texas, US

501-1000 employees

https://smartfolksinc.com/

H1B Sponsorship

Smart Folks Inc has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2025 (9)

2024 (13)

2023 (7)

2022 (4)

2021 (21)

2020 (17)

Funding

Current Stage

Late Stage

Leadership Team

Lalitha Sneha Nandyala

Founder and President

Company data provided by crunchbase