Experience
Galileo Financial Technologies
AI Engineer (Contract) · Remote · Apr 2025 – Sep 2025
- Owned end-to-end MLOps architecture on AWS (SageMaker, ECR, ECS), defining deployment topology and CI/CD strategy for a 6-person engineering team - reduced operational cost by 25% through automated experiment tracking via Weights and Biases and DVC.
- Built and evaluated ML services on Amazon Bedrock and SageMaker, fine-tuning foundation models on proprietary financial data with evaluation pipelines that improved fraud detection precision by 40%.
LPL Financial
Software Engineer – (Applied AI Engineer) · Remote · Jan 2024 – Dec 2024
- Designed and led system architecture for an enterprise agentic RAG platform (LangGraph, LangChain, CrewAI) with short/long-term memory, making tradeoff decisions across latency, retrieval quality, and cost that cut manual document research time by 70% across 200+ financial advisors.
- Mentored 2 engineers on LLM integration patterns, prompt engineering, and evaluation frameworks establishing the team’s first structured code review process for AI-specific pull requests.
- Engineered production inference pipelines on SageMaker and Bedrock, fine-tuning Claude 3.5, Llama 3 (70B), and Gemma 3 with quantization and structured output, reduced end-to-end latency by 25%.
- Deployed cloud-native AI microservices on AWS (Lambda, S3, DynamoDB, ECR, ECS) with Terraform/Docker and full observability via OpenTelemetry, Grafana, and Prometheus—achieving 99.9% uptime on production inference endpoints.
AI VIET NAM
AI Researcher & Educator (Part-time) · Remote, Vietnam · Jan 2024 – Present
- Led curriculum design for end-to-end MLOps program (50+ students), covering data versioning, experiment tracking, SageMaker Pipelines, RAG with LangChain, vector databases, Bedrock, FastAPI, and Docker CI/CD.
- Researched agentic RAG architectures using LangGraph, investigating multi-agent collaboration and tool-use patterns for complex reasoning in financial and technical domains.
- Evaluated LLM serving optimization (FP8/INT4 quantization, speculative decoding), measuring impact on inference cost and perplexity; implemented semantic chunking strategies to maximize retrieval quality.
Ford Credit
Software Engineer · Remote · Feb 2022 – Dec 2023
- Designed high-availability backend microservices on GCP with Java, Spring Boot, and PostgreSQL serving 50K+ daily transactions; modernized Angular frontend dashboards improving data accessibility by 75%.
- Automated multi-environment infrastructure with Terraform and Tekton CI/CD; built SQL-driven dashboards for real-time KPI visibility, achieving 99.9% measured uptime across production services.
AeroVironment
Software Engineer · Simi Valley, CA · Jan 2020 – Feb 2022
- Architected autonomous UAV flight control system with optimized routing and real-time decision logic, reduced mission execution time by 25%—presenting design to cross-functional stakeholders for production approval.
- Built cross-platform control interface (C#/.NET backend, React/JS UI) with low-latency telemetry for real-time situational awareness.
California State University, Northridge
Research Assistant · Los Angeles, CA · Jan 2018 – May 2019
- Built modular automation framework on Raspberry Pi (Linux/Python) for autonomous robotics; engineered deployment pipelines reduced manual configuration time by 40%.
- Optimized lightweight computer vision algorithms for 1–4GB RAM edge hardware, enabling real-time robot tracking and object recognition.
