Jae Choi

“Every good gift and every perfect gift is from above, coming down from the Father of lights.” — James 1:17


Hi, I’m Jae 👋

Machine Learning Engineer & AI Specialist with 8+ years of experience delivering production-grade ML systems. I specialize in LLM fine-tuning, RAG architectures, and MLOps at scale.

Currently at Walmart Global Tech, I lead initiatives that have:

Impact Metric
💰 Cost Savings $250K+ annually across 15+ projects
Efficiency 38% reduction in development time
📈 Revenue $1.5M+ additional annual revenue
🎯 Accuracy 15% improvement in predictive models

What I Do

🤖 LLM & RAG Systems

Building production multimodal RAG pipelines with Llama, GPT, and Gemini. Designing data-centric chatbots that scale.

⚙️ MLOps & Platform Engineering

Architecting Kubernetes-based ML platforms with automated CI/CD, model monitoring, and continuous learning feedback loops.

🎙️ Voice AI & Agents

Engineering low-latency voice agents with streaming ASR-to-LLM pipelines, achieving sub-4s response times at $0.05/conversation.


Experience

Machine Learning Engineer | AI Engineer

Walmart Global Tech · Oct 2023 – Present

  • Led design and rollout of standardized MLOps template integrating multimodal RAG and data-centric chatbots (Llama, GPT, Gemini-Pro)
  • Established continuous learning feedback layer to monitor metrics, capture inference data, and automate retraining
  • Impact: Cut development time by 38%, improved team efficiency by 30%, achieved $250K+ annual savings across 15+ production projects

Data Scientist

Walmart Global Tech · May 2021 – Oct 2023

  • Architected Kubernetes-based automated pipeline integrating model logic, monitoring, and CI/CD with testing and code reviews
  • Transformed model adoption from 8% to 14%, improved predictive accuracy by 15%, driving $1.5M additional annual revenue
  • Eliminated critical downtimes, reduced manual effort by 40%, cut cloud costs by 20% — saving $120K annually

Machine Learning Engineer Lead

Mozzine · Sept 2020 – Feb 2021

  • Built large-scale, high-quality dataset via custom scraping and cleaning pipelines with novel reconstruction algorithms
  • Improved core model accuracy by 20%, driving 5% boost in customer conversions and enabling higher-value pricing tier

🎙️ Voice-to-Voice Health Insurance Agent

Real-time voice AI for health insurance inquiries

  • Engineered streaming ASR-to-LLM pipeline with multimodel routing (GPT, Llama3, Qwen, Gemma) and LoRA-based domain adaptation
  • Results: p95 latency of 2-4s, 92% QA accuracy, 80-88% multi-turn task completion
  • Cost: Only $0.05-$0.35 per five-turn conversation

🧠 KB Agent (Knowledge Base Assistant)

Enterprise multimodal knowledge agent

  • Designed modular RAG architecture with dual embeddings (text + CLIP), HNSW + IVF vector search, and validity-gated answer generation
  • Results: 60% reduction in lookup time, answer accuracy improved from 32% to 92%
  • Scale: Supported 10K daily users, driving $53K in revenue

Technical Skills

Languages & Frameworks

Python · C++ · Java · Flask · FastAPI

AI / ML / Deep Learning

GPT · Llama · Gemini · Qwen · LangChain · LlamaIndex
TensorFlow · PyTorch · Keras · Scikit-Learn · OpenCV

MLOps & Cloud Platforms

Kubeflow · Vertex AI · MLflow · Databricks · Airflow
Azure · GCP · AWS · Kubernetes · Docker · Git

Data & Storage

PostgreSQL · MySQL · BigQuery · Redis · Cosmos DB
Spark · Dask · Ray · Milvus · Chroma

Education

🎓 M.S. Robotics & Autonomous Systems (AI & CS concentration)
Arizona State University · GPA 4.0/4.0 · Dec 2020


Awards & Recognition

Award Year
🥇 1st Place — AWS | State Farm Hackathon 2020
🥇 1st Place — MESCON 2014
🎖️ Engineering Graduate Fellowship — ASU 2019-2020
📚 Academic Scholarship — UE 2010-2015

Publications

📄 Scan & GoUS Patent No. 20240029119 (USPTO)

📄 Production And Measurement Of Four Degree Of Freedom Photonic StatesSemantic Scholar


Let’s Connect

I’m always open to discussing ML engineering roles, AI architecture consulting, or collaboration opportunities.

📬 Get in Touch

📧 jaehyuk0325@gmail.com
💼 LinkedIn
🐙 GitHub


Support This Blog ☕

If you’ve found my articles, tutorials, or insights helpful in your ML journey, consider buying me a coffee! Your support helps me continue creating quality content.

💝 Ways to Support

☕ Buy Me a Coffee

Every contribution, no matter how small, is deeply appreciated and goes directly toward creating more helpful content for the ML community.


About This Blog

“It is easily forgettable that we are running into death from birth.”

This blog is my space for reflections, technical deep-dives, and lessons learned from 8+ years in the trenches of production ML. I share what I wish I had known earlier — the failures, the wins, and everything in between.

Topics I write about:

  • 🧠 LLM fine-tuning & prompt engineering
  • 🔧 MLOps best practices & war stories
  • 📊 System design for ML at scale
  • 💡 Career advice for ML engineers

Happy to be criticized to mature. Iron sharpens iron.