About me

Tuan Quang

AI Engineer | Machine Learning | AI-Driven Solutions

About Me

Senior AI Engineer with 7+ years in software engineering and production LLM systems, multi-agent pipelines, and cloud-native MLOps infrastructure. Architected agentic RAG platforms processing 500K+ documents that cut research time by 70%, designed inference pipelines serving 10K+ daily financial API requests at subsecond latency, and published two peer-reviewed papers in multimodal AI. Proven track record of owning system architecture end-to-end, mentoring engineering teams, and driving build-vs-buy decisions across LangGraph, LangSmith, AWS Bedrock/SageMaker, and modern orchestration tooling.

My core strengths include:

Core — LangGraph, LangChain, PyTorch, AWS Bedrock/SageMaker, FAISS, Neo4j, FastAPI, Docker, Kubernetes, Terraform
LLM & AI Systems — CrewAI, AutoGen, LlamaIndex, RAG (Agentic/Corrective/Self-RAG), Prompt Engineering, Function Calling/Tool Use, Structured Output, Guardrails, LLM Evaluation, MCP Server
ML & Deep Learning — TensorFlow, JAX, Hugging Face (Transformers, PEFT, Accelerate), Scikit-learn, OpenCV, Fine-tuning (QLoRA/LoRA), Quantization (GPTQ/AWQ/GGUF), ONNX, W&B, MLflow
Data & Infrastructure — Pinecone, Chroma, Weaviate, PostgreSQL, MongoDB, DynamoDB, Kafka, PySpark, Airflow, DVC, Feast, OpenTelemetry, Grafana, Prometheus, GitHub Actions, CI/CD
Languages — Python (Proficient), Java, JavaScript/TypeScript, SQL, C++, C#Galileo Financial Technologies