Blog Archive

machine learning

Floating-Point Formats and Deep Learning

10 minute read

Floating-point format is not a crucial consideration in deep learning, but it can make a significant difference. What is floating-point, why should you (a deep learning practictioner) care, and what can you do about it?

Why Latent Dirichlet Allocation Sucks

11 minute read

Latent Dirichlet allocation is a well-known and popular model in machine learning and natural language processing, but it really sucks sometimes. Here’s why.

Back to top ↑

open source

Benchmarks for Mass Matrix Adaptation

9 minute read

I benchmarked various mass matrix adaptation methods in PyMC3. Sane defaults are easy to take for granted: it’s more nuanced than I initially expected!

Introducing stan-vim

less than 1 minute read

A Vim plugin for Stan, offering syntax highlighting, automatic indentation and code folding. Check it out!

Anatomy of a Probabilistic Programming Framework

13 minute read

In this blog post, we’ll break down what probabilistic programming frameworks are made up of, and how the various pieces are organized and structured. We’ll take a look at some open source frameworks as examples.

Cookbook — Bayesian Modelling with PyMC3

24 minute read

This is a compilation of notes, tips, tricks and recipes for Bayesian modelling that I’ve collected from everywhere: papers, documentation, peppering my more experienced colleagues with questions.

Back to top ↑

natural language processing

Why Latent Dirichlet Allocation Sucks

11 minute read

Latent Dirichlet allocation is a well-known and popular model in machine learning and natural language processing, but it really sucks sometimes. Here’s why.

Back to top ↑

bayesianism

Cookbook — Bayesian Modelling with PyMC3

24 minute read

This is a compilation of notes, tips, tricks and recipes for Bayesian modelling that I’ve collected from everywhere: papers, documentation, peppering my more experienced colleagues with questions.

Back to top ↑

pymc

Adventures in Manipulating Python ASTs

6 minute read

I explored the possibility of simplifying PyMC4’s model specification API by manipulating the Python abstract syntax tree (AST) of the model code.

Benchmarks for Mass Matrix Adaptation

9 minute read

I benchmarked various mass matrix adaptation methods in PyMC3. Sane defaults are easy to take for granted: it’s more nuanced than I initially expected!

Anatomy of a Probabilistic Programming Framework

13 minute read

In this blog post, we’ll break down what probabilistic programming frameworks are made up of, and how the various pieces are organized and structured. We’ll take a look at some open source frameworks as examples.

Cookbook — Bayesian Modelling with PyMC3

24 minute read

This is a compilation of notes, tips, tricks and recipes for Bayesian modelling that I’ve collected from everywhere: papers, documentation, peppering my more experienced colleagues with questions.

Back to top ↑

deep learning

Floating-Point Formats and Deep Learning

10 minute read

Floating-point format is not a crucial consideration in deep learning, but it can make a significant difference. What is floating-point, why should you (a deep learning practictioner) care, and what can you do about it?

Autoregressive Models in Deep Learning — A Brief Survey

11 minute read

My current project involves working with a class of fairly niche and interesting neural networks that aren’t usually seen on a first pass through deep learning. I thought I’d write up my reading and research and post it.

Back to top ↑

life update

Joining Flatiron Health

less than 1 minute read

An exciting professional update: I’ve joined Flatiron Health as a data scientist!

Graduated Cooper Union, Joining Point72

less than 1 minute read

Some exciting personal news: I’ve graduated from The Cooper Union and I’m joining Point72 Asset Management as a data scientist/research analyst!

Hello World!

less than 1 minute read

This is the first post of what will (hopefully) be a cool and interesting blog. Hope you like it!

Back to top ↑

mathematics

Back to top ↑

python

Streaming Data with Tornado and WebSockets

7 minute read

WebSockets with the Tornado web framework is a simple, robust way to handle streaming data. I walk through a minimal example and discuss why these tools are good for the job.

Adventures in Manipulating Python ASTs

6 minute read

I explored the possibility of simplifying PyMC4’s model specification API by manipulating the Python abstract syntax tree (AST) of the model code.

Back to top ↑

reinforcement learning

Back to top ↑

talks

Back to top ↑

probabilistic programming

Anatomy of a Probabilistic Programming Framework

13 minute read

In this blog post, we’ll break down what probabilistic programming frameworks are made up of, and how the various pieces are organized and structured. We’ll take a look at some open source frameworks as examples.

Back to top ↑

dataset

Back to top ↑

quant finance

Back to top ↑

statistics

Back to top ↑

stan

Introducing stan-vim

less than 1 minute read

A Vim plugin for Stan, offering syntax highlighting, automatic indentation and code folding. Check it out!

Back to top ↑

streaming

Streaming Data with Tornado and WebSockets

7 minute read

WebSockets with the Tornado web framework is a simple, robust way to handle streaming data. I walk through a minimal example and discuss why these tools are good for the job.

Back to top ↑

tornado

Streaming Data with Tornado and WebSockets

7 minute read

WebSockets with the Tornado web framework is a simple, robust way to handle streaming data. I walk through a minimal example and discuss why these tools are good for the job.

Back to top ↑

websocket

Streaming Data with Tornado and WebSockets

7 minute read

WebSockets with the Tornado web framework is a simple, robust way to handle streaming data. I walk through a minimal example and discuss why these tools are good for the job.

Back to top ↑