Writing by Haoming Koo | Applied AI systems and product builds

Apr 12, 2026

I Went from 64 to 100/100 on Smithery. Here's Every Fix.

The quality fixes that took a working MCP server from 64 to 100/100 on Smithery: tool descriptions, dot notation naming, annotations, caching, smithery.yaml, and the skill that automates the checks.

Building AI MCP TypeScript Open Source

15 min

Apr 10, 2026

PDF Parsing for Complaint Forms — Docling vs PyMuPDF vs PaddleOCR

Seven open-source PDF parsers tested on complaint forms: scanned pages, handwriting, AcroForm fields, and degraded fax. Where each parser breaks, and which one I would use in production.

Building ML OCR Open Source

14 min

Mar 27, 2026

I Built an AI Wine Deal Finder — Here's What 50 Bottles Taught Me

How I built a wine price comparison tool for Singapore. Amex Platinum gives you S$200 back on wine, but are the base prices actually competitive? I answered that with Selenium scraping, bundle-aware comparisons, Brave-assisted Vivino resolution, and a guarded daily refresh pipeline.

Building AI Full-Stack Singapore

8 min

Mar 26, 2026

Building Job Hunter SG: AI Resume Coaching for Singapore

How I built a Singapore job search and resume product with nightly crawls, RAG matching, a 7-stage tailoring pipeline, and 5 validation gates to keep edits factual.

Building AI Full-Stack Singapore

10 min

Mar 22, 2026

What Mood Is the Market In? An Interactive Guide to HMM Regime Detection

Markets have moods. A Hidden Markov Model reads the volatility regime (calm, normal, or stormy) so you can trade with the weather, not against it. Interactive walkthrough with live visualizations.

Building ML Visualization Trading

10 min

Mar 21, 2026

DPO Interactive Demo: Your Language Model is Secretly a Reward Model

An interactive walkthrough of Direct Preference Optimization: how to align LLMs with human preferences using a classification loss. No reward model, no RL, no PPO.

Research ML Alignment Visualization

12 min

Mar 16, 2026

What I Learned from a Live LLM Serving Gauntlet

19 engineers, 19 A100 GPUs, two hours of Locust fire. I tuned 40+ vLLM configs and scraped peer metrics. Then someone with a 30-line cache proxy beat all of us.

LLMOps vLLM Part 2

15 min

Mar 16, 2026

LLM Inference: The Theory You Need Before Deploying

Prefill vs decode. Why memory bandwidth matters more than FLOPS. VRAM budget math. Quantization methods (GPTQ, AWQ, FP8). Batching strategies. The theory behind every LLM serving decision.

LLMOps Inference Part 1

15 min

Mar 13, 2026

Visualizing Weak-Driven Learning: An Interactive WMSS Demo

An interactive walkthrough of the WMSS paper, from SFT saturation to logit mixing and gradient amplification. Use the sliders to see how weak agents can improve stronger agents.

Research ML Visualization

10 min

Mar 12, 2026

How 468 Facial Landmarks Decide If You're Passport-Ready

MediaPipe face mesh, rembg segmentation, 25+ compliance checks, and country-specific rules for 6 countries. Here's how Photo ID Studio decides if your face is passport-worthy.

Building Computer Vision FastAPI OpenCV

16 min

Mar 12, 2026

How I Built an AI-Powered Stock Market Dashboard

A deep dive into Trader Koo – from using a pre-trained YOLOv8 model to detect chart patterns, to solving the coordinate mapping problem, to running a five-layer detection ensemble every night.

Building YOLOv8 FastAPI Computer Vision

12 min

Mar 11, 2026

I Tried to Predict Singapore's Rain and It Humbled Me

Building LionWeather: tropical weather data from 2016 to 2024, LightGBM rainfall forecasting, SHAP explainability, animated radar, and a whole lot of lessons about Singapore weather.

Building ML FastAPI EDA

18 min