Artificial Intelligence Engineer
Job Title: Artificial Intelligence (AI) Engineer
Location: Remote.
Employment Type: W2 on ExpediteInfoTech's payroll. This position requires a Permanent resident or a U.S. citizen. The selected candidate will go through a Public Trust Clearance process.
Position Overview:
A backend-focused AI engineer responsible for developing secure, scalable, and production-grade AI applications, with deep experience in LLM integration, retrieval-augmented generation (RAG) pipelines, and cloud-based LLM Ops workflows. The role emphasizes Amazon SageMaker Studio, Pipelines, and Model Registry for operationalizing large language models within FedRAMP-compliant AWS environments.
Required Qualifications:
- 8+ years of IT experience.
- 3+ years of experience as an AI Engineer
- 3+ years of experience in AWS
- AWS Services: EC2 (GPU-enabled), SageMaker (Studio, Pipelines, Endpoints, Model Registry), Bedrock, OpenSearch Vector DB, Systems Manager, Load Balancers, Amazon - Bedrock, OpenSearch Vector Database, PineCone, knowledgebase, lambda, API Gateway, FASTAPIs or Flask, SQS, SNS, Step functions, DynamoDB, RDS/Postgres SQL, and EKS or Fargate
- Programming: Advanced Python (async, FastAPI, LangChain, Transformers)
- DevSecOps: Docker, GitHub, GitHub Actions, CI/CD pipelines
- Frontend Prototyping: Streamlit, Figma, or similar frameworks for AI interaction demos
- Cloud-Native Development: Infrastructure-as-Code, cloud monitoring, and security policies
Key Responsibilities
AI Solution Development:
- Expert hands-on knowledge in RAG architectures to handle multiple complex data formats (PDF, images, tables, Word documents, Excel, acronyms, attachments, etc.) to create cleansed standardized data for hydration into a vector database.
- Expert hands-on knowledge on text embeddings, image embeddings, chunking logic, metadata creation, and embedding vectors indexing.
- Expert hands-on knowledge in creating a highly accurate RAG retrieval system with knowledge on reranking, semantic search, similarity search, hybrid search, etc. to search by text or images.
- Implement secure, scalable, highly accurate RAG pipelines using LlamaIndex, Haystack frameworks, or AWS-native services like Bedrock, OpenSearch Vector Database, and Knowledgebase.
- Create backend infrastructure for chatbot applications with long-term and short-term memory capabilities to improve user experience.
- Hands-on knowledge of creating APIs, RAGGraph, develop agentic AI workflows, and intelligent automation solutions.
AI/ML Skills:
- Experience operationalizing AI/ML pipelines in SageMaker Studio with model governance
- Experience with Amazon - Bedrock, OpenSearch Vector Database, PineCone, knowledgebase, lambda, API Gateway, FASTAPIs or Flask, SQS, SNS, Step functions, DynamoDB, RDS/Postgres SQL, and EKS or Fargate.
- Prompt engineering, LLM evaluation methodologies, bias detection, and hallucination detection.
LLM Integration & LLM Ops:
- Integrate multiple LLMs via APIs (AWS Bedrock: Anthropic - Claude, Titan, Llama, Stability Diffusion models)
- Deploy self-hosted open-source LLMs (e.g., Llama, Falcon, Mistral) on GPU-enabled EC2 or SageMaker Endpoints
- Implement structured prompt engineering frameworks, response evaluation tools, and feedback loops
- Build model optimization layers, including prompt selectors, model switchers, and cache layers
Cloud Infrastructure & Deployment:
- Deploy AI services using SageMaker, EC2, Systems Manager, and Elastic Load Balancers
- Containerize backend systems with Docker and deploy to scalable environments using ECS/EKS
- Implement CI/CD pipelines via GitHub Actions integrated with AWS Systems Manager and CodePipeline
- Architect solutions for VPC isolation, IAM hardening, and FedRAMP High compliance
About: Headquartered in Rockville, MD, since 2012, ExpediteInfoTech, Inc. (EIT) provides specialized technical, cybersecurity, IT, and financial advisory solutions to the Federal, State, and County governments. Our clients include the US Department of Education, US Department of Transportation, US Department of Justice, US Department of Health & Human Services, Montgomery County government, Prince George's County Government, the State of Maryland, and the District of Columbia. EIT is appraised at level 3 for CMMI Services & CMMI Development, as well as ISO 9001:2015, ISO 20000-1:201,8 and ISO 27001:2013.
EIT offers a competitive benefits package that includes medical, dental, vision, and prescription drug coverage, paid time off, federal holidays, a matching 401 (k) plan, and tuition/professional development reimbursement benefits.
EIT is an equal opportunity employer, and all qualified applicants will be considered for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, or any other characteristic protected by applicable law.
ExpediteInfoTech, Inc. is an Equal Opportunity Employer. Please review the position details, including location, work authorization status, and other government contractual requirements.
Your application will be best considered if you have your full legal name, current location, phone number, email address, and work authorization status.