Jae Choi

“Every good gift and every perfect gift is from above, coming down from the Father of lights.” — James 1:17

Hi, I’m Jae 👋

Machine Learning Engineer & AI Specialist with 8+ years of experience delivering production-grade ML systems. I specialize in LLM fine-tuning, RAG architectures, and MLOps at scale.

Currently at Walmart Global Tech, I lead initiatives that have:

Impact	Metric
💰 Cost Savings	$250K+ annually across 15+ projects
⚡ Efficiency	38% reduction in development time
📈 Revenue	$1.5M+ additional annual revenue
🎯 Accuracy	15% improvement in predictive models

What I Do

🤖 LLM & RAG Systems

Building production multimodal RAG pipelines with Llama, GPT, and Gemini. Designing data-centric chatbots that scale.

⚙️ MLOps & Platform Engineering

Architecting Kubernetes-based ML platforms with automated CI/CD, model monitoring, and continuous learning feedback loops.

🎙️ Voice AI & Agents

Engineering low-latency voice agents with streaming ASR-to-LLM pipelines, achieving sub-4s response times at $0.05/conversation.

Experience

Machine Learning Engineer | AI Engineer

Walmart Global Tech · Oct 2023 – Present

Led design and rollout of standardized MLOps template integrating multimodal RAG and data-centric chatbots (Llama, GPT, Gemini-Pro)
Established continuous learning feedback layer to monitor metrics, capture inference data, and automate retraining
Impact: Cut development time by 38%, improved team efficiency by 30%, achieved $250K+ annual savings across 15+ production projects

Data Scientist

Walmart Global Tech · May 2021 – Oct 2023

Architected Kubernetes-based automated pipeline integrating model logic, monitoring, and CI/CD with testing and code reviews
Transformed model adoption from 8% to 14%, improved predictive accuracy by 15%, driving $1.5M additional annual revenue
Eliminated critical downtimes, reduced manual effort by 40%, cut cloud costs by 20% — saving $120K annually

Machine Learning Engineer Lead

Mozzine · Sept 2020 – Feb 2021

Built large-scale, high-quality dataset via custom scraping and cleaning pipelines with novel reconstruction algorithms
Improved core model accuracy by 20%, driving 5% boost in customer conversions and enabling higher-value pricing tier

Featured Projects

🎙️ Voice-to-Voice Health Insurance Agent

Real-time voice AI for health insurance inquiries

Engineered streaming ASR-to-LLM pipeline with multimodel routing (GPT, Llama3, Qwen, Gemma) and LoRA-based domain adaptation
Results: p95 latency of 2-4s, 92% QA accuracy, 80-88% multi-turn task completion
Cost: Only $0.05-$0.35 per five-turn conversation

🧠 KB Agent (Knowledge Base Assistant)

Enterprise multimodal knowledge agent

Designed modular RAG architecture with dual embeddings (text + CLIP), HNSW + IVF vector search, and validity-gated answer generation
Results: 60% reduction in lookup time, answer accuracy improved from 32% to 92%
Scale: Supported 10K daily users, driving $53K in revenue

Technical Skills

Languages & Frameworks

Python · C++ · Java · Flask · FastAPI

AI / ML / Deep Learning

GPT · Llama · Gemini · Qwen · LangChain · LlamaIndex
TensorFlow · PyTorch · Keras · Scikit-Learn · OpenCV

MLOps & Cloud Platforms

Kubeflow · Vertex AI · MLflow · Databricks · Airflow
Azure · GCP · AWS · Kubernetes · Docker · Git

Data & Storage

PostgreSQL · MySQL · BigQuery · Redis · Cosmos DB
Spark · Dask · Ray · Milvus · Chroma

Education

🎓 M.S. Robotics & Autonomous Systems (AI & CS concentration)
Arizona State University · GPA 4.0/4.0 · Dec 2020

Awards & Recognition

Award	Year
🥇 1st Place — AWS \| State Farm Hackathon	2020
🥇 1st Place — MESCON	2014
🎖️ Engineering Graduate Fellowship — ASU	2019-2020
📚 Academic Scholarship — UE	2010-2015

Publications

📄 Scan & Go — US Patent No. 20240029119 (USPTO)

📄 Production And Measurement Of Four Degree Of Freedom Photonic States — Semantic Scholar

Let’s Connect

I’m always open to discussing ML engineering roles, AI architecture consulting, or collaboration opportunities.

📬 Get in Touch

📧 jaehyuk0325@gmail.com
💼 LinkedIn
🐙 GitHub

Support This Blog ☕

If you’ve found my articles, tutorials, or insights helpful in your ML journey, consider buying me a coffee! Your support helps me continue creating quality content.

💝 Ways to Support

☕ Buy Me a Coffee

Every contribution, no matter how small, is deeply appreciated and goes directly toward creating more helpful content for the ML community.

About This Blog

“It is easily forgettable that we are running into death from birth.”

This blog is my space for reflections, technical deep-dives, and lessons learned from 8+ years in the trenches of production ML. I share what I wish I had known earlier — the failures, the wins, and everything in between.

Topics I write about:

🧠 LLM fine-tuning & prompt engineering
🔧 MLOps best practices & war stories
📊 System design for ML at scale
💡 Career advice for ML engineers

Happy to be criticized to mature. Iron sharpens iron.