Welcome!

This is a blog primarily about topics in mathematics, machine learning, and technology, but occasionally about other things.

New posts every couple of months!

Latest Posts:

What's your house worth?

Something about heteroskedasticity
March 3, 2025

Over four million homes were sold in 2024, with a median sale price of $407,500 (source). That’s $1.66 trillion of residential real estate transactions in 2024 alone! And yet, house prices are notoriously difficult to predict—just ask Zillow! In this post, I’ll talk about some of the challenges in pricing illiquid assets (like homes), highlight how those challenges impact model design, and ultimately develop a fairly sophisticated house pricing model. Along the way, we’ll touch on topics like heteroskedasticity, hierarchical modeling, and Bayesian regression.

Keep reading...

Lobotomizing GPT

It's [contraint: clever remark]!
February 13, 2025

Modern LLMs are impressive pieces of machinery, capable of feats across many domains that were previously thought to be unique signifiers of human intelligence. This power and generality comes from their incredible size, which allows LLMs to compress huge quantities of information and recognize highly sophisticated patterns.

Keep reading...

Transfer learning with PyTorch and Huggingface Transformers

Almost exactly as easy as it sounds
September 10, 2024

One of the most powerful arguments for incorporating deep learning models into your workflow is the possibility of transfer learning: using a pre-trained model’s latent representations as a starting point for your own modeling task. This can be particularly useful when you have a fairly small number of labeled examples, but the task in question is similar to a pre-existing model’s task. So how easy is it to do transfer learning with an LLM? As we’ll see, with HuggingFace’s transformers library, it’s actually quite easy.

Keep reading...

Did I write something you want to use in your own work? Please do: All code is MIT licensed; any other content is licensed CC-BY-NC unless otherwise indicated. (If neither of those licenses work, let's get in touch.)