About me
Tuan Quang
AI Engineer | Machine Learning | AI-Driven Solutions
About Me
Senior AI Engineer with 7+ years in software engineering and production LLM systems, multi-agent pipelines, and cloud-native MLOps infrastructure. Architected agentic RAG platforms processing 500K+ documents that cut research time by 70%, designed inference pipelines serving 10K+ daily financial API requests at subsecond latency, and published two peer-reviewed papers in multimodal AI. Proven track record of owning system architecture end-to-end, mentoring engineering teams, and driving build-vs-buy decisions across LangGraph, LangSmith, AWS Bedrock/SageMaker, and modern orchestration tooling.
My core strengths include:
- Core — LangGraph, LangChain, PyTorch, AWS Bedrock/SageMaker, FAISS, Neo4j, FastAPI, Docker, Kubernetes, Terraform
- LLM & AI Systems — CrewAI, AutoGen, LlamaIndex, RAG (Agentic/Corrective/Self-RAG), Prompt Engineering, Function Calling/Tool Use, Structured Output, Guardrails, LLM Evaluation, MCP Server
- ML & Deep Learning — TensorFlow, JAX, Hugging Face (Transformers, PEFT, Accelerate), Scikit-learn, OpenCV, Fine-tuning (QLoRA/LoRA), Quantization (GPTQ/AWQ/GGUF), ONNX, W&B, MLflow
- Data & Infrastructure — Pinecone, Chroma, Weaviate, PostgreSQL, MongoDB, DynamoDB, Kafka, PySpark, Airflow, DVC, Feast, OpenTelemetry, Grafana, Prometheus, GitHub Actions, CI/CD
- Languages — Python (Proficient), Java, JavaScript/TypeScript, SQL, C++, C#Galileo Financial Technologies
