and some change

LLM benchmarks like SWE-bench are not trustworthy
4 minute read

If you believe OpenAI’s marketing, their LLM products are automating an increasingly large fraction of software engineering jobs. They substantiate this, in part, by citing how their products perform against various LLM benchmarks.
Read more…
Jan 8, 2025 #tech , #ai , #llms , #science
Predictions for 2025
4 minute read

Following up on my predictions for 2024, here’s a bunch of predictions for this year. As before, all dollar values inflation-adjusted for start of 2025.
Read more…
Jan 5, 2025 #predictions
Following up on my 2024 predictions
5 minute read

Here’s how I did with my 2024 predictions.
Read more…
Dec 31, 2024 #predictions
Notes from Midtown East Casino Proposal Town Hall, 2024/01/11
7 minute read

Below are some notes I took during a town hall organized by Kristen Gonzalez, on the topic of a proposal for a casino in Midtown East (38th to 41st St, between First Ave and FDR).
Read more…
Jan 11, 2024 #nyc , #politics , #kirsten gonzalez , #notes
Predictions for 2024
2 minute read

Here’s a bunch of predictions for next year, along with rough probabilities. All dollar values inflation-adjusted for start of 2024.
Read more…
Dec 31, 2023 #predictions
The US government is 13x better at healthcare insurance than private companies
One minute read

Per a KFF report, the US government spent less than 1.3% of Medicare expenses on overhead in 2021 – the remaining 98.
Read more…
Nov 27, 2023 #us , #politics , #healthcare , #insurance , #medicare
Debugging stories: the inconsistent database
3 minute read

Recently at work I ran across a bug which I thought was kind of interesting, so I figured I’d write it up.
Read more…
Dec 5, 2022 #tech , #bugs , #debugging , #war-stories
Setting this blog up on Github Pages
6 minute read

As a birthday present to myself this year, I finally decided to bite the bullet and quit my Twitter addiction. I signed up for Mastodon, which seems nice so far (you can find me here).
Read more…
Nov 3, 2022 #tech , #github , #tutorials , #meta
Catching Us Up to 2018
4 minute read

Well, it’s been a year since my last blogpost, which caught us up to 2017. Let’s bring us up to speed again, shall we?
Read more…
Jan 2, 2018
Catching Us Up to 2017
5 minute read

It’s been a long time since my last real update in July 2014. What’s happened since then? Here’s the highlights, month by month.
Read more…
Jan 12, 2017