2025-03-15

[logs] March 15, 2025

I mostly browse social media apps like LinkedIn and Bluesky on my phone. Recently, LinkedIn started surfacing a popup prompting me to download the app after a bit of scrolling the browser. This prompts me to close LinkedIn entirely. I’m curious if my behavior is unusual, or if they’re getting enough conversions to app downloads for it to be worth it. It’s certainly caused me to use LinkedIn less which I am guessing is not the point.

2025-03-09

[logs] March 9, 2025

So much of the world is outside of what can be specified. https://podcasts.apple.com/us/podcast/ai-and-i/id1719789201?i=1000696284548

2025-03-07

[logs] March 7, 2025

cursorrules

I feel like .cursorrules are finally starting to snap into place for me. I’ve made many messes in Cursor, but once I exhaust what is possible without structure, if I want to continue, I need to create a structure. There’s nothing like a painful refactor to reinforce how and why you should define conventions for your codebase. Usually, when diving into a new idea, I don’t love to think about this stuff, but it’s always time well spent. With this in mind, my plan is to try and build rules and structure as I go rather than pushing my projects to the limit then needing to clean up.

2025-03-05

[logs] March 5, 2025

Stumbled upon the react-three-fiber library today and now I am building a game where a lander can fly around a mini-solar system.

2025-03-04

[logs] March 4, 2025

language_models

Learned more about the post-training phase of fine-tuning LLMs and how the model initially goes through a pre-training phase. From there, it is fine-tuned to contribute to a token stream with a human user, using prompt tokens to demarcate whether a message was written by the user or the assistant.

For example

<|im_start|>user
Hi there!<|im_end|>
<|im_start|>assistant
Nice to meet you!<|im_end|>
<|im_start|>user
Can I ask a question?<|im_end|>

Finally, labs have continued to improve model benchmark performance using further fine-tuning, like RLHF, where humans pick the best of a set of responses from the model, and the model is further fine-tuned on this data.

2025-03-02

[logs] March 2, 2025

I found an interesting library for building 3D games specifically “built for Cursor” called viber3d. I assume the name is a reference to “vibe coding”. This is the first library I have seen ship a starter scaffold with Cursor rules. This is an interesting development in how frameworks are being built now. Since language models don’t know about brand new frameworks, frameworks are shipping with content that will aid language models in using them, since coding with models seems to be an increasingly popular way to code.

2025-02-28

[logs] February 28, 2025

I’ve built a few prototypes with the OpenAI voice to text API with code largely written using Cursor. This has been fast and easy to incorporate into Next.js apps. I can add an audio-recording-to-text feature to any app in a couple of minutes, ready for use in a production environment.

There are several other options for voice to text as well. MacOS has a built-in voice to text feature and there are several other Whisper wrappers available, some that can run locally. Talking is much faster than typing and allows me to capture raw thoughts faster, which I can then refine later. LLMs are also quite good at structuring these raw thoughts into a more refined form that I can then edit.

2025-02-22

[logs] February 22, 2025

I’d like to see authors being surprised by what readers end up learning from their material. Because the author is not just sending out something static. They’re sending out a program which is capable of emergent behavior. So the reader will be able to try out different things and discover things the author hadn’t intended.

Bret Victor, The Humane Representation of Thought, Oct 2014

2025-02-20

[logs] February 20, 2025

index

I’ve been playing around with this idea I am calling “idea projection”. The concept is that you can take raw ideas, logs or notes (like all the logs on my site, for example) and create projections of them into different forms using a model and a target structure.

The prompting approach looks something like this

<files>
{files_xml}
</files>

<structure>
{structure}
</structure>

<instructions>
Given the above files, your job is to create a new document that structures the contents of the files in adherence with the <structure>, maintaining the original voice and phrasing as much as possible.

Output the complete document content directly, without any replacement blocks.
</instructions>

What this allows you to do is give the model a ton of files and have it transform that input content into the target structure. This approach can work for everything from

2025-02-18

[logs] February 18, 2025

A great read by Harper about writing code with LLMs.

One passage that particularly resonated with me

When I describe this process to people I say “you have to aggressively keep track of what’s going on because you can easily get ahead of yourself.”
For some reason I say “over my skies” a lot when talking about LLMs. I don’t know why. It resonates with me. Maybe it’s because it is beautiful smooth powder skiing, and then all of a sudden you are like “WHAT THE FUCK IS GOING ON!,” and are completely lost and suddenly fall off a cliff.
Read More…