All Posts

50 posts

Sort:

Reinforcement Learning in LLMs - Why and How

From imitation to optimization: when LLMs need RL, how verifiable rewards unlock reasoning, and a minimal GRPO playbook.

September 18, 2025•9 min read

•

Building a Custom Markdown Pipeline with Rich Embeds

How we built a custom markdown pipeline that handles LaTeX math, image galleries, and rich embeds while keeping content in plain .md files—no MDX required.

May 30, 2025•8 min read

•

List of Advice

A living collection of advice from mentors, friends, and books.

March 31, 2024•9 min read

•

Why High-Dimensional Gaussians Feel Like Soap Bubbles

Concentration of measure pushes Gaussian samples onto a thin shell—here's the intuition, the math, and why typicality matters for generative models.

September 15, 2023•6 min read

•

From Transformers to ChatGPT

This note provides a high-level summary of the progress in large language models (LLMs) covering major milestones from Transformers to ChatGPT. The note serves as a fast-paced recap for readers to catch up on this field quickly.

December 29, 2022•36 min read

•

Exponential-Min and Gumbel-Max

Exponential-min and Gumbel-max tricks for reformulating sampling from a discrete distribution as argmin and argmax, making the sampling operation differentiable.

January 1, 2019•2 min read

•

Expectation-Maximization Algorithm in 10 Minutes

A quick walk-through of Expectation-Maximization (EM) algorithm and its cousins.

December 15, 2017•11 min read

•

From PPO to DPO (and GRPO)

PPO made RLHF work; DPO made it simple. This post derives DPO from PPO, explains why it’s a supervised alternative (not RL), where it shines, and where RL/GRPO still helps.

September 17, 2025•5 min read

•

Reparameterization vs REINFORCE

You know how to differentiate through a function—but how do you differentiate through a sampling step? Two estimators: score‑function (REINFORCE) and pathwise (reparameterization); pathwise backpropagates through the sampling transform with lower variance.

September 17, 2025•4 min read

•

Speculative Decoding: Exact Speedups with Draft Models

Can we speed up generation without changing the distribution? A small draft model proposes, the big model accepts/rejects—yielding exact samples, faster.

September 17, 2025•3 min read

•

Maximum Entropy and Maximum Likelihood

Why that particular sigmoid in logistic regression? This short post shows how simple moment constraints lead to exponential families (MaxEnt chooses the model) and how MLE fits them.

September 16, 2025•6 min read

•

Life Is Short, a Reading List

A living list of books, essays, and videos that helps me keep perspective on life.

December 9, 2024•1 min read

•

The Physics–Music Dilemma

Music notation gives us a tidy grid of notes, but physics delivers a messy spectrum of vibrations. Here's why tuning is always a compromise.

June 15, 2024•3 min read

•

Quantitative Tech Interview Preparation Guide

A short list of interview preparation resources for Data Scientists, Machine Learning Engineers, Machine Learning Scientists, Quant Developers and Quant Researchers.

May 5, 2018•6 min read

•

Hume’s Law

A quick riff on Hume's is–ought gap—why facts don't dictate values, and how the leap from 'is' to 'ought' rests on sentiment.

February 5, 2025•1 min read

•

Dopamine

There has been a lot of confusing information about dopamine. I finally found a literature review-style article, and here is what I learned.

December 15, 2024•5 min read

•

London

A January dash to London to finally see Jay Chou live, with museums, parks, and good meals along the way.

January 10, 2024•1 min read

•

Italy

In October 2021, we spent two weeks traveling to various cities in Italy, including Rome, Cinque Terre, Florence, Tuscany, and Venice. This was our first trip to Italy, and we have documented our journey with a report and photos.

November 10, 2021•9 min read

•

Utah

Trip report (itinerary and photos) from our recent trip to southern Utah (Zion, Arches, Canyonlands and Bryce).

August 15, 2021•7 min read

•

Diffusion Models

This is the first post of hopefully a series of post walking through diffusion models. This post will introduce the foundations, focusing on two foundational papers, that many other papers built upon.

June 6, 2023•5 min read

•

Building LLM-Powered Products

This is a quick note to discuss a few topics below related to building LLM-powered products and applications, such as how to let LLM use tools and become autonomous agents, how to incorporate domain adaptation, and the production hurdles.

April 23, 2023•8 min read

•

How Does Auto-GPT Work?

In this note, we'll take a look at how Auto-GPT work and discuss LLM's ability to do explicit reasoning and to become an autonomous agent. We'll touch upon a few related works such as WebGPT, Toolformer, and Langchain.

April 9, 2023•5 min read

•

Next.js: Firebase Authentication and Middleware for API Routes

Building Next.js app with Firebase authentication on the client-side, as well as using it on the server-side with a middleware pattern similar to Express.js.

February 28, 2021•6 min read

•

Recent Progress in Language Modeling

This page is a high-level summary / notes of various recent results in language modeling with little explanations

October 9, 2018•3 min read

•

NLP Starter Resources

A list of starter resources for Natural Language Processing (NLP), mostly with deep learning.

June 30, 2018•1 min read

•

Recent Progress in Neural Variational Inference

A literature survey of recent papers on Neural Variational Inference (NVI) and its application in topic modeling.

March 8, 2018•1 min read

•

A Brief Survey of Generative Models

A high-level summary of various generative models including Variational Autoencoders (VAE), Generative Adverserial Networks (GAN), and their notable extentions and generalizations, such as f-GAN, Adversarial Variational Bayes (AVB), Wasserstein GAN, Wasserstein Auto-Encoder (WAE), Cramer GAN and etc

December 20, 2017•7 min read

•

iOS Dev Learning Pointers

Pointers for learning iOS development

January 31, 2026•1 min read

•

TIL: React Server Components

March 23, 2025•4 min read

•

Hiragana and Katakana for Chinese Speakers

A quick reference tying each hiragana and katakana character back to its Chinese origin, plus the patterns and mnemonics I lean on while studying.

December 6, 2024•2 min read

•

How to Add a Table of Contents in Ghost without Editing the Site Template

How to add a table of contents in Ghost without editing the site template

March 31, 2023•1 min read

•

How to Add EmailOctopus Form to a React App

EmailOctopus form is a script tag, this post shows how to make that work with React (using useEffect and useRef).

May 28, 2022•1 min read

•

My Frontend Learning Plan - 2021

My plan and progress updates on learning web frontend development more or less from scratch. Will be semi-regularly updated.

February 12, 2021•9 min read

•

My Takeaways from "State of JS 2020"

My takeaways from State of JS 2020 survey.

February 12, 2021•3 min read

•

How to Enable Preview for Member-Only Content in Ghost

Automatically add preview / teaser content for Member-Only posts in Ghost.

January 24, 2021•5 min read

•

Demo: Member-Only Content Preview

This is a demo for "How to Enable Preview for Member-Only Content in Ghost"

January 24, 2021•2 min read

•

Ghost Themes Local Development Setup

Local development setup for Ghost themes.

January 23, 2021•2 min read

•

Featured-First Post Order in Ghost

Making featured posts show up first in Ghost Casper theme (instead of the default reverse chronological order).

January 21, 2021•1 min read

•

Editing Tips for Ghost

I recently switched to Ghost to host my blog, Here are some editing tips as I learn to use this platform.

January 21, 2021•2 min read

•

Moving to Ghost

My onboarding experience with Ghost and some wishlist items for future improvements.

January 18, 2021•3 min read

•

Test: Malformed Gallery and Link-Preview Blocks

Testing error handling for malformed gallery and link-preview blocks

January 5, 2020•1 min read

•

Test: Basic Markdown Features

Testing core markdown syntax including headers, lists, tables, links, and formatting

January 4, 2020•3 min read

•

Test: Inline Code Escaping

Validates that backticked HTML comments and tags render literally as code.

September 3, 2025•1 min read

•

Test: Miscellaneous Features

July 29, 2025•1 min read

•

Test: Code Blocks and Syntax Highlighting

Testing code syntax highlighting with various programming languages and code block types

May 29, 2025•1 min read

•

Test: Iframe Embedding

Testing iframe embedding for videos, maps, and other external content

May 29, 2025•2 min read

•

Test: Image Galleries

Comprehensive test suite for both auto-detected sequential image galleries and explicit gallery markers, testing various image counts and edge cases.

May 29, 2025•9 min read

•

Test: Math Rendering

Comprehensive test suite for MathJax rendering with various equation types, LaTeX syntax, and edge cases to verify the markdown pipeline's math processing.

May 29, 2025•2 min read

•

Test: URL Preview Cards

Testing link preview card functionality for external URLs

May 29, 2025•2 min read

•

Inner Peace Requires External Validation

February 17, 2025•1 min read

•

End of posts • 50 posts